Further details contact at subiksha@tsmspl.com || +91 6364 922 002
Responsibilities:
1. Infrastructure Design and Automation: Design, implement, and manage our AWS
infrastructure using Infrastructure as Code (IaC) tools such as CloudFormation
or Terraform. Automate deployment and scaling processes to achieve high
availability and fault tolerance.
2. System Monitoring and Optimization: Implement robust monitoring solutions to
proactively identify performance bottlenecks, security vulnerabilities, and
reliability issues. Continuously optimize the infrastructure for
cost-efficiency, performance, and scalability.
3. Incident Management: Respond to incidents, troubleshoot system outages,
and participate in post-incident reviews. Implement measures to minimize
downtime and prevent recurrence of issues.
4. Release Management: Collaborate with the development team to
streamline the release process. Implement continuous integration and continuous
delivery (CI/CD) pipelines to ensure smooth and reliable software deployments.
5. Security and Compliance: Implement best practices for security, access
control, and data protection in the AWS environment. Stay current with industry
trends and ensure compliance with relevant regulations.
6. Capacity Planning: Analyze system performance data to anticipate
future resource needs. Scale the infrastructure to accommodate growth while
maintaining optimal performance.
7. Documentation and Knowledge Sharing: Maintain comprehensive documentation of the
infrastructure, processes, and troubleshooting guides. Share knowledge with
team members to foster continuous learning and improvement.
Qualifications:
·
Bachelor's degree in Computer Science,
Engineering, or a related field (or equivalent experience).
·
Proven experience as a DevOps/SRE
engineer, with a strong focus on AWS cloud services.
·
Proficiency with IaC tools like
CloudFormation, Terraform, or similar.
·
Hands-on experience with containerization
and orchestration tools (e.g., Docker, Kubernetes).
·
Solid understanding of Agile methodologies
and experience working in Agile teams.
·
Strong scripting and programming skills
(e.g., Python, Bash, NodeJS, or similar).
·
Familiarity with monitoring tools (e.g.,
CloudWatch) and APM solutions.
·
Familiarity with Event Driven Design and
tooling (Kafka, RabbitMQ, SQS)
·
Understanding of taking Domain Driven
Design documents and guiding the development work to align with the business
Domains.
· Excellent problem-solving skills and the ability to work effectively in a fast-paced, collaborative environment.
No comments:
Post a Comment