Lead critical incident response and deep diagnostics across application, infrastructure, and network layers. Improve observability, reduce alert noise, drive RCA and preventive improvements, and partner with application, platform, and operations teams to improve system reliability at enterprise scale.
Requisition Number: 2346938
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.
Role Summary
We are seeking a highly experienced SWAT Engineer to join an elite enterprise team specializing in critical incident response, deep-dive diagnostics, and system-wide reliability improvements. This role operates across application, infrastructure, network, and observability domains.
Primary Responsibilities:
Required Qualifications:
Preferred Qualifications:
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.
Role Summary
We are seeking a highly experienced SWAT Engineer to join an elite enterprise team specializing in critical incident response, deep-dive diagnostics, and system-wide reliability improvements. This role operates across application, infrastructure, network, and observability domains.
Primary Responsibilities:
- Lead/support P1/P2 incident war rooms and drive rapid triage
- Perform deep diagnostics across application, infra, and network layers
- Improve observability (logs, metrics, traces) and reduce alert noise
- Contribute to RCA and drive preventive improvements
- Partner across App, Platform, and Ops teams
- Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so
Required Qualifications:
- Solid distributed systems troubleshooting experience
- Expertise in Linux, networking, and cloud platforms
- Proficiency with observability tools (e.g., Splunk, Dynatrace)
- Proven ability to operate under high-pressure incident scenarios
Preferred Qualifications:
- Experience in enterprise-scale Tier-0 environments
- Background in SRE / Incident Management
- Familiarity with AIOps (InterLink, alert correlation tools)
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.
Similar Jobs at Optum
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Lead end-to-end data science projects: data ingestion, feature engineering, modeling (ML/DL/GenAI), deployment, performance tracking, analysis on large healthcare datasets, and communicate results to business partners.
Top Skills:
Deep LearningGenerative AiMachine LearningPythonSparkSQL
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, build, and maintain full-stack web applications using Java backends and Angular/React frontends. Develop RESTful APIs and microservices, ensure performance, scalability, and security, perform code reviews and debugging, collaborate across product/QA/UX, and manage development through testing, deployment, and support within Agile teams.
Top Skills:
AngularCi/CdCSS3DevOpsGitHTML5JavaJavaScriptJwtMicroservicesMongoDBMySQLNoSQLOauthOracleReactRest ApisSpring BootSQLTypescript
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, build, and deploy NLP/LLM and classical ML features for chat and voice workflows. End-to-end work: data preparation, modeling, evaluation, API integration, deployment, and monitoring in cloud. Implement responsible AI controls, maintain ML services and documentation, apply MLOps practices, and collaborate with cross-functional teams to improve reliability, latency, and cost per inference.
Top Skills:
AlertingAPIsAWSCi/CdContainersFeature StoresHugging FaceIamJavaLangchainLlmsLoggingMetricsModel RegistryMonitoringNlpNode.jsPythonPyTorchServerlessTensorFlowVector Databases
What you need to know about the Kolkata Tech Scene
When considering the industries shaping India's tech scene, gaming might not immediately come to mind. However, in the last decade, increased internet usage and greater access to mobile devices have catapulted the industry to new heights, with Kolkata-based companies like Virtualinfocom, Red Apple Technologies and Digitoonz, at the forefront, driving the design and animation of new gaming titles for players.

