The Software Engineer II role focuses on ensuring system reliability and performance, incident response, automation, and service design for digital transformation at MetLife.
Description and Requirements
Site Reliability Engineer
Role Overview
MetLife is seeking experienced SRE Engineers to contribute to its digital and AI transformation. This role involves responsibilities for ensuring the availability, scalability, and performance of our systems and services.
Key Responsibilities
System Reliability and Performance: Ensure the reliability, scalability, and performance of our systems and services, including monitoring, troubleshooting, and resolving issues.
Service Design and Implementation: Collaborate with engineering teams to design, implement, and operate large-scale systems, including developing software that automates and streamlines our operations.
Automation and Scripting: Develop and maintain automation scripts and tools to streamline operations, improve efficiency, and reduce manual errors.
Monitoring and Alerting: Design and implement monitoring and alerting systems to ensure timely detection and resolution of issues.
Collaboration and Communication: Work closely with engineering teams, product managers, and other stakeholders to ensure that systems and services meet business requirements and are aligned with company goals.
Incident Response and Management: Participate in incident response and management, including root cause analysis, post-mortems, and implementation of corrective actions.
Candidate Qualifications
Skills & Competencies
Tech Stack:Java, Python, Bash, PowerShell, Docker, Kubernetes, Azure Kubernetes Service, Azure Application Insight, Azure Log Analytics Splunk, Grafana, AppDynamics, ELK, Azure Monitor, ITIL, ServiceNow
Language:Business proficiency in English; Business Proficiency in Japanese is added advantage.
This is a great opportunity to be part of MetLife's technology transformation journey.
About MetLife
Recognized on Fortune magazine's list of the "World's Most Admired Companies" and Fortune World's 25 Best Workplaces™, MetLife, through its subsidiaries and affiliates, is one of the world's leading financial services companies; providing insurance, annuities, employee benefits and asset management to individual and institutional customers. With operations in more than 40 markets, we hold leading positions in the United States, Latin America, Asia, Europe, and the Middle East.
Our purpose is simple - to help our colleagues, customers, communities, and the world at large create a more confident future. United by purpose and guided by our core values - Win Together, Do the Right Thing, Deliver Impact Over Activity, and Think Ahead - we're inspired to transform the next century in financial services. At MetLife, it's #AllTogetherPossible . Join us!
#BI-Hybrid
Site Reliability Engineer
Role Overview
MetLife is seeking experienced SRE Engineers to contribute to its digital and AI transformation. This role involves responsibilities for ensuring the availability, scalability, and performance of our systems and services.
Key Responsibilities
System Reliability and Performance: Ensure the reliability, scalability, and performance of our systems and services, including monitoring, troubleshooting, and resolving issues.
Service Design and Implementation: Collaborate with engineering teams to design, implement, and operate large-scale systems, including developing software that automates and streamlines our operations.
Automation and Scripting: Develop and maintain automation scripts and tools to streamline operations, improve efficiency, and reduce manual errors.
Monitoring and Alerting: Design and implement monitoring and alerting systems to ensure timely detection and resolution of issues.
Collaboration and Communication: Work closely with engineering teams, product managers, and other stakeholders to ensure that systems and services meet business requirements and are aligned with company goals.
Incident Response and Management: Participate in incident response and management, including root cause analysis, post-mortems, and implementation of corrective actions.
Candidate Qualifications
- Education:Bachelor's degree in Computer Science or equivalent.
- Experience:
- 2-4 years in as a Site Reliability engineer supporting Hybrid Cloud environment.
- Strong scripting and programming skills in languages such as Java, Python, Bash, or PowerShell
- Proficiency in CI/CD, containerization and container orchestration platforms (Docker, Kubernetes), Terraform etc.
- Hands on experience on usage of ELK stack, observability tools like Grafana, Kibana, Splunk, App Insight etc.
- Strong analytical and problem-solving skills to identify and resolve issues in Production.
Skills & Competencies
Tech Stack:Java, Python, Bash, PowerShell, Docker, Kubernetes, Azure Kubernetes Service, Azure Application Insight, Azure Log Analytics Splunk, Grafana, AppDynamics, ELK, Azure Monitor, ITIL, ServiceNow
Language:Business proficiency in English; Business Proficiency in Japanese is added advantage.
This is a great opportunity to be part of MetLife's technology transformation journey.
About MetLife
Recognized on Fortune magazine's list of the "World's Most Admired Companies" and Fortune World's 25 Best Workplaces™, MetLife, through its subsidiaries and affiliates, is one of the world's leading financial services companies; providing insurance, annuities, employee benefits and asset management to individual and institutional customers. With operations in more than 40 markets, we hold leading positions in the United States, Latin America, Asia, Europe, and the Middle East.
Our purpose is simple - to help our colleagues, customers, communities, and the world at large create a more confident future. United by purpose and guided by our core values - Win Together, Do the Right Thing, Deliver Impact Over Activity, and Think Ahead - we're inspired to transform the next century in financial services. At MetLife, it's #AllTogetherPossible . Join us!
#BI-Hybrid
Similar Jobs at MetLife
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Software Platform Engineer II is responsible for developing software applications, utilizing Core Java and Spring Boot, and working with microservices architecture. The role involves building applications from scratch and collaborating with distributed teams in Agile environments.
Top Skills:
AWSAzureCore JavaGCPGitJavaScriptJSONKafkaKubernetesRest ApisSpring BootSQL
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Responsible for software application design and development, focusing on React Native for mobile apps, API integration, and production support.
Top Skills:
Api IntegrationCloud Native TechnologiesOracleReact Native
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Software Platform Engineer II designs, develops, and maintains software solutions, leading requirements documentation and technical design for applications while ensuring integration with existing systems.
Top Skills:
APIsBack End DevelopmentCloud Native TechnologiesDatabase Management SystemsFront End DevelopmentMobile TechnologiesWeb Technologies
What you need to know about the Kolkata Tech Scene
When considering the industries shaping India's tech scene, gaming might not immediately come to mind. However, in the last decade, increased internet usage and greater access to mobile devices have catapulted the industry to new heights, with Kolkata-based companies like Virtualinfocom, Red Apple Technologies and Digitoonz, at the forefront, driving the design and animation of new gaming titles for players.

