Cloud Platform Reliability Engineer


Date: Mar 26, 2021

Location: Alpharetta, GA, US

Company: New York Life Insurance Co


When you join New York Life, you’re joining a company that values development, career growth, collaboration, innovation, and diversity & inclusion. We want employees to feel proud about being part of a company that is committed to doing the right thing. Through various resources and programs, you can grow your career while developing personally and professionally.




The Cloud Reliability Engineer will be a member of the Cloud Services team and drive cross-functional technology for delivering projects on an IaaS platform to meet critical technology and business requirements. 


This is a specialized job function that focuses on the automation of availability, performance, maintainability and optimization of business applications on the NYL IaaS (AWS) platform. The Cloud Reliability Engineer is expected to follow an Infrastructure as Code (IaC) philosophy, as well as standard development practices, to provide consistency and repeatability with our deployments.


Key Duties and Responsibilities:

  • Primary function is to deliver IaC automation which provides repeatable, reliable solutions using Terraform and GitHub.
  • Enable tooling and process so that all L1/L2 operations can be done by more traditional NOC teams and remain the L3 escalation point for Cloud incidents and requests.
  • Drive automation to replace manual operations
  • Deliver automation to address new features and defects as part of our platform releases.
  • Aid with application implementations using primarily Linux technology on AWS
  • Assist and oversight production releases while partnering with internal teams to identify requirements for operational monitoring and optimization
  • Review and identify IaaS issues or concerns
  • Ensure alignment to cloud standards and best practices
  • Follow the enterprise change management process to deploy fully tested and documented solutions/applications to a production environment
  • Interprets and advise on usage to optimize, maximizing utilization of deployed resources and reduce spend
  • Maintain and develop our growing Terraform infrastructure-as-code library which we use to deploy infrastructure and applications
  • Collaborate with Cloud Architects and Solution Engineers to deliver projects
  • Train development teams and new users on services and automation capabilities
  • Provide tier 2 and 3 production operational support
  • Potential for off-hours incident response
  • Actively engage with peer technical teams as appropriate to ensure a holistic approach
  • Willingness to learn new and emerging technologies and implement in a short time
  • Ability to multi-task and manage tasks with varying priorities
  • Self-motivated, innovative and able to work across diverse technical and non-technical teams. 
  • Must be self-directed and willing to learn new tools and services to stay up to date with our evolving platforms
  • Ability to communicate to non-technical and technical resources

Required Qualifications:

  • MUST HAVE - Ability to write and implement infrastructure as code and platform automation (Terraform preferred)
  • Experience implementing Infrastructure as Code – Terraform, Ansible etc.
  • Strong public cloud provider experience (AWS certification a plus)
  • Strong operating system and admin knowledge of the Linux platform using shell scripting
  • Working knowledge of DevOps and delivery tools (GitHub)
  • Practical experience with modern scripting languages (Python, .Net, C#, Java)
  • Practical understanding of infrastructure technologies - compute, network, storage
  • Ability to relate to software engineering challenges and practices

Required Education/Experience

  • Education: BS degree in Computer Science or Engineering or equivalent on the job, hands on experience.
  • 3+ years of overall IT experience, proficient with the Linux platform
  • 2+ years with Cloud (IaaS, PaaS, SaaS) services and platforms
  • 2+ years of automated solutions and implementation of highly scalable, highly available services
  • Understanding of Software Development Lifecycle Methodologies
  • Understanding of application development including application servers, middleware, systems management, monitoring, configuration management, capacity planning and performance tuning
  • Self-motivated and able to work across diverse technical and non-technical teams

Education: BS degree in Computer Science or Engineering or the required on the job, hands on experience is acceptable. 





Recognized as one of Fortune’s World’s Most Admired Companies, New York Life is committed to improving local communities through a culture of employee giving and volunteerism, supported by our Foundation. We invite you to bring your talents to New York Life, so we can continue to help families and businesses “Be Good At Life.” To learn more, please visit LinkedIn, our Newsroom and the Careers page of

Job Requisition ID: 82974




Nearest Major Market: Alpharetta
Nearest Secondary Market: Atlanta

Job Segment: Cloud, Engineer, Social Media, Linux, Developer, Technology, Engineering, Marketing