Lead reliability and performance engineering for distributed systems: design and run performance tests (stress, load, endurance), define SLIs/SLOs, implement observability (OpenTelemetry, Grafana, Splunk), participate in on-call incident response, and drive operational excellence and performance strategy across teams.
Requisition Number: 2366671
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.
Primary Responsibilities:
Required Qualifications:
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone - of every race, gender, sexuality, age, location and income - deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.
#NIC #NJP
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.
Primary Responsibilities:
- Reliability & Performance Engineering
- Support the reliability, availability, and performance of distributed systems across cloud, edge, and device
- Create and execute test plans, conduct requirements reviews, perform test verification and validation, analyze results and document results.
- Collaborate closely with multifunctional teams to determine quality and suitability of cable assembly products. Maintain records for test execution, schedule, and reporting
- Help define, measure, and monitor SLIs and SLOs for services
- Identify reliability risks and collaborate with senior engineers on mitigation plans.
- Operational Excellence
- Participate in on-call rotations and assist with incident response and post-incident reviews
- Contribute improvements to runbooks, automation, and tooling that reduce alert noise and operational toil
- Help enhance detection, alerting, and response workflows
- Observability & Insight
- Implement and improve telemetry using OpenTelemetry, Grafana,splunk and related tools
- Build dashboards and tools that improve visibility into system health and AI service behavior
- Ensure observability data is complete, accurate, and actionable
- Collaboration & Best Practices
- Work closely with senior SREs, DevOps engineers, AI/ML teams, and platform engineers
- Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regard to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so
Required Qualifications:
- Undergraduate degree or equivalent experience
- 10+ years of Performance testing experience
- Solid knowledge of Performance Bottleneck analysis using various tools
- Knowledge of Neoload Tool (is positive)
- Knowledge of Cyara Voice Testing tool
- Leadership knowledge - Performance strategy, Plan and Reporting
- Engineering concepts of working with dev for Stress, Load and Endurance testing
- Collaboration with other team
- Proven excellent communication skills
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone - of every race, gender, sexuality, age, location and income - deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.
#NIC #NJP
Similar Jobs at Optum
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Lead Full Stack Engineer to guide technical design and delivery across architecture, product, and engineering. Responsibilities include full lifecycle development, building and deploying AI/ML features, cloud-native solutions, CI/CD, mentoring engineers, managing technical roadmaps, and improving scalable processes to meet business outcomes.
Top Skills:
Ai/MlAWSAzureC++Ci/CdGCPGitJavaNoSQLPythonSQL
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, build, deploy, and support scalable full-stack applications using Angular frontend and .NET/C# backend. Lead cloud architecture (Azure/AWS), CI/CD with GitHub workflows and JFROG, integrate security (SAST/DAST), implement observability with Datadog, and develop AI-enabled solutions. Mentor teammates and own application health from design through production.
Top Skills:
.NetAngularAWSAzureC#Ci/CdDastDatadogGithub WorkflowsJfrog ArtifactorySast
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Senior full-stack engineer owning React frontend and Java/Spring Boot backend. Build modular React components (TypeScript preferred), design and implement REST APIs and microservices, integrate with AWS services (API Gateway, IAM, S3, DynamoDB), use Kafka for eventing, apply CI/CD (GitHub Actions/CodePipeline), ensure secure API consumption (JWT/CORS), test automation, and integrate agentic frameworks like AWS Bedrock Agents within Agile teams.
Top Skills:
Aws Api GatewayAws Bedrock AgentsAws CodepipelineAws IamCi/CdCorsDynamoDBGitGithub ActionsIntegration TestingJavaJavaScriptJwtKafkaMicroservicesNoSQLPresigned UrlsReactRest ApisS3Spring BootTest AutomationTypescriptUnit Testing
What you need to know about the Kolkata Tech Scene
When considering the industries shaping India's tech scene, gaming might not immediately come to mind. However, in the last decade, increased internet usage and greater access to mobile devices have catapulted the industry to new heights, with Kolkata-based companies like Virtualinfocom, Red Apple Technologies and Digitoonz, at the forefront, driving the design and animation of new gaming titles for players.

