The role involves ensuring system reliability, performance, and automation in hybrid cloud environments, with strong collaboration on SRE and DevOps methodologies.
Description and Requirements
Overview:
MetLife is seeking a highly skilled Site Reliability Engineer (SRE) to drive the reliability, scalability, and performance of our mission-critical systems across hybrid cloud environments, with a strong emphasis on Azure Cloud and Azure DevOps. You will play a key role in automation, observability, incident management, and continuous improvement, collaborating closely with engineering and operations teams.
Key Responsibilities:
Qualifications & Skills:
About MetLife
Recognized on Fortune magazine's list of the "World's Most Admired Companies" and Fortune World's 25 Best Workplaces™, MetLife, through its subsidiaries and affiliates, is one of the world's leading financial services companies; providing insurance, annuities, employee benefits and asset management to individual and institutional customers. With operations in more than 40 markets, we hold leading positions in the United States, Latin America, Asia, Europe, and the Middle East.
Our purpose is simple - to help our colleagues, customers, communities, and the world at large create a more confident future. United by purpose and guided by our core values - Win Together, Do the Right Thing, Deliver Impact Over Activity, and Think Ahead - we're inspired to transform the next century in financial services. At MetLife, it's #AllTogetherPossible . Join us!
#BI-Hybrid
Overview:
MetLife is seeking a highly skilled Site Reliability Engineer (SRE) to drive the reliability, scalability, and performance of our mission-critical systems across hybrid cloud environments, with a strong emphasis on Azure Cloud and Azure DevOps. You will play a key role in automation, observability, incident management, and continuous improvement, collaborating closely with engineering and operations teams.
Key Responsibilities:
- System Reliability & Performance:
- Ensure high availability and optimal performance of services across on-premises and Azure cloud platforms.
- Proactively identify, troubleshoot, and resolve system issues, minimizing downtime and impact.
- Automation:
- Design, develop, and maintain automation scripts and tools (Python, PowerShell, Bash) to streamline operations and deployments.
- Monitoring, Observability & Incident Management:
- Architect and maintain robust monitoring, logging, and alerting solutions using Grafana, Splunk, and Azure Monitor/Application Insights.
- Lead incident response, root cause analysis, and post-mortem processes, driving corrective and preventive actions.
- Cloud & Containerization:
- Deploy, manage, and optimize workloads on Azure, leveraging services such as AKS (Kubernetes), Azure Functions, and App Services.
- Build and maintain containerized environments using Docker and Kubernetes.
- Collaboration & Best Practices:
- Partner with engineering teams to align system architecture and performance with business objectives.
- Champion SRE and DevOps best practices, fostering a culture of reliability, automation, and continuous improvement.
- Documentation & Knowledge Sharing:
- Maintain comprehensive system documentation and runbooks.
- Share knowledge and mentor team members to elevate operational excellence.
Qualifications & Skills:
- 3+ years of SRE or DevOps experience supporting hybrid cloud environments (On-Prem & Azure).
- Advanced proficiency in Azure Cloud services, Azure DevOps (Pipelines, Repos).
- Mandatory : Hands-on experience with Docker, AKS (Azure Kubernetes Service) and CI/CD automation.
- Deep experience with monitoring and observability tools: ELK Stack, Grafana, Splunk, Azure Monitor/Application Insights.
- Strong scripting skills: Python, PowerShell, Bash
- Good to have : Experience with configuration management tools such as Puppet or Ansible.
- Solid SQL and database troubleshooting skills.
- Familiarity with ITSM tools (e.g., ServiceNow).
- Business proficiency in English; Japanese language skills are a plus.
- Relevant certifications (e.g., Azure Administrator (AZ-104), Azure DevOps Engineer (AZ-400), CKA) are highly desirable.
About MetLife
Recognized on Fortune magazine's list of the "World's Most Admired Companies" and Fortune World's 25 Best Workplaces™, MetLife, through its subsidiaries and affiliates, is one of the world's leading financial services companies; providing insurance, annuities, employee benefits and asset management to individual and institutional customers. With operations in more than 40 markets, we hold leading positions in the United States, Latin America, Asia, Europe, and the Middle East.
Our purpose is simple - to help our colleagues, customers, communities, and the world at large create a more confident future. United by purpose and guided by our core values - Win Together, Do the Right Thing, Deliver Impact Over Activity, and Think Ahead - we're inspired to transform the next century in financial services. At MetLife, it's #AllTogetherPossible . Join us!
#BI-Hybrid
Top Skills
Aks
Azure Cloud
Azure Devops
Azure Monitor
Bash
Docker
Grafana
Kubernetes
Powershell
Python
Splunk
SQL
Similar Jobs at MetLife
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Join MetLife as a Jr. Software Platform Engineer to optimize production systems, manage incidents, and ensure operational efficiency while delivering excellent customer service and reporting metrics.
Top Skills:
AgileDevOpsItilSafe
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
As a Sr. Full Stack Software Engineer II at MetLife, you will be responsible for developing software solutions that meet business requirements, working collaboratively within teams, and ensuring the quality of applications.
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Full Stack Software Engineer will develop and implement cloud-native software solutions and AI applications, requiring both front-end and back-end skills.
Top Skills:
Ai TechnologiesCloud-Native Software
What you need to know about the Kolkata Tech Scene
When considering the industries shaping India's tech scene, gaming might not immediately come to mind. However, in the last decade, increased internet usage and greater access to mobile devices have catapulted the industry to new heights, with Kolkata-based companies like Virtualinfocom, Red Apple Technologies and Digitoonz, at the forefront, driving the design and animation of new gaming titles for players.

