Thursday, November 30, 2023

Principal Cloud Operations Infrastructure Engineer

 

We are looking for a Principal Cloud Operations Infrastructure Engineer (Linux/Storage)

For further details please drop email with updated profile at aravinthu@tsmspl.com

Location: Remote/Hybrid


General Summary:

·       The Cloud Operations team provides 24x7x365 support for all Company SaaS & Hosting customers globally.

·       This business unit is responsible for the day-to-day management and support of the cloud operations environment including the uptime, performance and high availability of all customers supporting systems inside of the SaaS & Hosted environments.

·       The SaaS & hosted ecosystem is comprised of multi-tiered applications, microservice architectures, containers & virtual servers as well as large & complex multi-terabyte SQL database systems.

·       The Principal Cloud Operations Infrastructure Engineer will be responsible for designing, implementing & managing key infrastructure systems supporting Cloud Linux Systems, Storage and hyperscaler hosted instance backups.

·       Additional responsibilities may include DR event planning, future expansion into COLO hosted infrastructure and, and other centralized shared services that will need to be implemented in support of customer systems across private and public cloud environments including AWS.

 

 

Key Responsibilities

• Responsible for working with the Cloud Operations Support, Infrastructure, DBA, and SRE teams to collect requirements for system backups.

• Serve as the SME for Linux matters including the establishment and maintenance of SUSE and other linux VM templates

• Responsible for hardening linux systems per CIS standards through infrastructure as code

• Design the new backup solution that fits our needs, ensuring alignment with key deliverables from leadership.

• Follow established solution development processes to select and implement appropriate solutions across the relevant technology domains of linux, storage, backups, and disaster recovery to meet Company’s needs.

• Responsible for working with Cloud Operations Support, Infrastructure, DBA, and SRE teams to document and deploy new solutions

• Responsible for training on a cadence to the Cloud Operations Organization on how newly developed solutions operate.

• Provide escalation support on related technology issues to the Cloud Operations Organization serving in a 24x7 on-call rotation.

• Participate in Support ticket resolution for areas of responsibility.

• Manage Cloud based hyperscaler storage, backup and related offerings to maximize the value of the leveraged hyperscaler while minimizing the cost to the business. • Participate in on-call rotation every month.

• Responsible for engagement in the project deployment activities related to job function. Professional Skills & Abilities

• Desire and ability to thrive in a fast-paced, highly demanding, dynamic business and cloud operations environment.

• The role requires analytical acumen and solution orientation to probe for understanding and to make appropriate decisions to address the nuances of technical and business challenges in order to achieve the targeted outcome

• Strong customer service orientation

• Excellent communication skills and experience in driving cross department initiatives to obtain organizational objectives & meet customer needs

• Strong communication, presentation, business and technical writing skills

• The ability to provide excellent customer service as well as manage and build strong relationships both internally and externally

• Strong interest in further developing and integrating operations with technology in business value creating ways

• Awareness of emerging issues, including regulations, industry practices and technology Technical Skills & Experience

• 10+ years of experience in job specific skills.

• 5+ years of experience in scalable infrastructure and platforms architecture, implementation, and management including Linux systems management and architecture, storage and backup management in AWS, as well as Database backups, and other infrastructure services

• 5+ years of experience in linux administration and management

• 3+ years of experience in Database backups, restores, and database server migrations.

• 3+ years of experience in script development in python or powershell related to infrastructure lifecycle management

• Degree in Computer Science or equivalent experience.

No comments:

Post a Comment