Tuesday, August 15, 2023

DevOps Site Reliability Engineer (SRE)

 Hiring: DevOps Site Reliability Engineer (SRE)

 Further details contact at subiksha@tsmspl.com   || +91 6364 922 002

 

 Position Overview: As a DevOps Site Reliability Engineer (SRE) at Ampliforce, you'll play a critical role in designing, building, and maintaining the infrastructure that powers our applications on the AWS platform. You'll collaborate closely with our development and operations teams, working in an Agile environment to ensure our systems are reliable, scalable, and performant. Your expertise in cloud technologies and DevOps practices will drive the stability and resilience of our services


Responsibilities:

1.     Infrastructure Design and Automation: Design, implement, and manage our AWS infrastructure using Infrastructure as Code (IaC) tools such as CloudFormation or Terraform. Automate deployment and scaling processes to achieve high availability and fault tolerance.

2.     System Monitoring and Optimization: Implement robust monitoring solutions to proactively identify performance bottlenecks, security vulnerabilities, and reliability issues. Continuously optimize the infrastructure for cost-efficiency, performance, and scalability.

3.     Incident Management: Respond to incidents, troubleshoot system outages, and participate in post-incident reviews. Implement measures to minimize downtime and prevent recurrence of issues.

4.     Release Management: Collaborate with the development team to streamline the release process. Implement continuous integration and continuous delivery (CI/CD) pipelines to ensure smooth and reliable software deployments.

5.     Security and Compliance: Implement best practices for security, access control, and data protection in the AWS environment. Stay current with industry trends and ensure compliance with relevant regulations.

6.     Capacity Planning: Analyze system performance data to anticipate future resource needs. Scale the infrastructure to accommodate growth while maintaining optimal performance.

7.     Documentation and Knowledge Sharing: Maintain comprehensive documentation of the infrastructure, processes, and troubleshooting guides. Share knowledge with team members to foster continuous learning and improvement.

 

Qualifications:

·       Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).

·       Proven experience as a DevOps/SRE engineer, with a strong focus on AWS cloud services.

·       Proficiency with IaC tools like CloudFormation, Terraform, or similar.

·       Hands-on experience with containerization and orchestration tools (e.g., Docker, Kubernetes).

·       Solid understanding of Agile methodologies and experience working in Agile teams.

·       Strong scripting and programming skills (e.g., Python, Bash, NodeJS, or similar).

·       Familiarity with monitoring tools (e.g., CloudWatch) and APM solutions.

·       Familiarity with Event Driven Design and tooling (Kafka, RabbitMQ, SQS)

·       Understanding of taking Domain Driven Design documents and guiding the development work to align with the business Domains.

·       Excellent problem-solving skills and the ability to work effectively in a fast-paced, collaborative environment.

No comments:

Post a Comment