Lead - SRE (Site Reliability Engineering)

Posted 18 Hours Ago
Be an Early Applicant
560042, Shivaji Nagar, Karnataka
Senior level
Information Technology
The Role
As a Tech Lead SRE, you will oversee site reliability engineers to enhance platform performance and reliability. You will manage production environments, optimize system performance, implement automation, and improve incident management processes for a seamless customer experience.
Summary Generated by Built In

At First Advantage (Nasdaq: FA), people are at the heart of everything we do. From our customers and partners to our greatest advantage — our team members. Operating with empathy and compassion, First Advantage fosters a global inclusive workforce devoted to the diverse voices that make up our talent and products. Our team members empower each other to be their authentic selves and treat all with respect, integrity, and fairness.
Say hello to a rewarding career and come join a leading provider of mission-critical background screening solutions to some of the most recognized Fortune 100 and Global 500 brands.
We are seeking a Tech Lead SRE to empower our platforms with high availability, and stellar performance level.
What We Do:
We are on the frontline of recruitment enabling organizations to Hire Smarter. Onboard Faster™ First Advantage is an HR Tech company delivering innovative solutions and insights to enable our clients to manage risk and hire the best talent. Leveraging an advanced technology platform, First Advantage builds fully scalable, configurable screening programs that meet the unique needs of over 33,000 clients. Headquartered in Atlanta, GA and with an internationally distributed workforce spanning 19 countries with about 5,500 employees, First Advantage performs over 93 million screens in over 200 countries and territories annually.
Who You Are:
You are self-motivated and ready to “roll up your sleeves." While you are an independent contributor, you are also collaborative. You can spearhead a project and see it through from start to completion.
As a team player, you navigate cross-functional teams and work well with team members in other business units and departments toward a common goal.
An Innovator — you see gaps in current processes or workflows as an opportunity to improve and try something new.
A lifelong learner and always seeking out opportunities to learn and upskill, you understand the importance of thorough and secure screenings and are interested in the Human Capital sector and the confluence of people, process, and technology.

What You'll Do 

A successful Technical Lead of site reliability engineers (SREs) to empower our platforms with high availability, and stellar performance level. As we expand our SRE team, we are currently seeking an experienced SRE to deliver reliable and scalable Technology solutions to our clients that enable best in class customer experience. Specifically, we are searching for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction.

Responsibilities:

  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Build software and systems to manage platform infrastructure and applications
  • Drive implementation of automation and monitoring to promote early detection, self-healing, improved availability, and decreased number of outages
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
  • Reduce operational inefficiencies in the incident management process to ensure the fastest path to recovery through automation and continuous process improvement. Identify when escalation is required and trigger such escalation accordingly.
  • This role will be strategic in nature implementing best in class Incident response and communications through modern solutions such as Teams, SharePoint, etc. This will ensure our internal stakeholders and customers have accurate communications of any ongoing outages and what we are doing to restore as well as prevent it from occurring again. This includes driving Incident bridges to resolution with highest sense of urgency
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
  • Create and maintain recovery playbooks for commonly occurring customer patterns and issues. Drive down resolution times by improving alert coverage and accuracy.
  • Create sustainable systems and services through automation and uplifts
  • Implement Automated Recovery Scripts and other monitoring enhancements
  • Participate in system design consulting, platform management, and capacity planning
  • Provide primary operational support and engineering for multiple large, distributed software applications
  • Lead after action reviews and root cause analysis on a timely basis that identify repair items preventing future customer impact. Ensure resolution of product/service defects, process improvements and documentation enhancement to address live site or customer reported incidents

What You May Need to be Successful

4-year College minimum in related technology field (Computer, Engineering, Science, etc.) or comparable job experience. SRE (Site Reliability Engineering) related Certification.

  • 7+ years of experience in information technology preferably managing large-scale environments
  • Recent work experience in an SRE role implementing best in class Reliability solutions in a Large Product development organization
  • 3+ years of work experience with public cloud platform Azure
  • Experience with Azure Monitor & AppInsights is preferred
  • Ability to program (structured and OO) with one or more high level languages, such as Python, Java, C/C++, and JavaScript
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks.
  • Outstanding communication and presentation skills, written and verbal. Excellent listening skills and a high degree of empathy.
  • Proficient in quick problem-solving skills with attention to detail.
  • You must be able to work outside of normal business hours (weekend shifts, holidays, & evenings)
  • Excellent managerial skills and ability to collaborate with team members.
  • Strong analytical, and time management skills.
  • Incorporate various software engineering aspects to develop and implement services that improve IT and support teams. Services can range from production code changes to alerting and monitoring adjustments

Why First Advantage is Your Next Big Career Move

First Advantage is going through a technology transformation! We are looking for experts who are excited to work with advanced technologies and provide best-in-class user experiences, drive the development and deployment of scalable solutions, and smoothly guide our agile teams and clients through meaningful changes as we continue to expand our impact.

More About Our Values Code

  • Honor Honesty, Consistency, Responsibility: Do the right thing
  • Cultivate an environment of dignity: Show respect for the individual
  • Take an Outside-In approach: Put the client first
  • Think out-of-the-box: Innovate and create
  • Stay Team-Oriented: Collaborate and appreciate each other

What Are You Waiting For? Apply Today!

You have learned a little about us today – we want to learn about you! If you think this position and our company are a great fit for your areas of interest and expertise, tell us about you by applying now!

EMPLOYEE BENEFITS – India Region:

  • Most of the roles are enabled with the ability to work remotely with occasional business travel. Hybrid working model
  • Comprehensive employee Leave policy
  • Career progressions through Internal job opportunities and Global Talent mobility programs
  • Career Development: Mentoring Program, People Management Program, cross-functions training, soft skills training.
  • Continuous learning and development opportunities. Upskilling and reskilling opportunities mobilized through e-learning platforms
  • Training and Certification reimbursement programs
  • Medical Insurance coverage for employees and parental insurance benefits available. Calendarized Employee Wellness programs
  • Quarterly Rewards and Recognition program to recognize exemplary performance
  • Other attractive allowances – Weekend working, Holiday pay, Relocation assistance, Maternity bonus, Creche allowance, Shift allowance etc.

Top Skills

Sre
The Company
HQ: Atlanta, GA
3,712 Employees
On-site Workplace
Year Founded: 2003

What We Do

First Advantage delivers comprehensive background check solutions and insights that enable employers and housing providers to make confident choices, reduce risk, and maintain compliance.

With offices in 26 locations and a staff of 4,000+ employees, First Advantage leverages leading technology and the industry’s largest global capabilities to complete background checks in 200+ countries and territories. If you’re looking for employee or tenant background check solutions that enable fast and reliable decision making, we’re your First Advantage.

For more detailed information on First Advantage products and services, visit fadv.com.

Similar Jobs

Easy Apply
Bangalore, Bengaluru, Karnataka, IND
1100 Employees

Take-Two Interactive Software Logo Take-Two Interactive Software

SRE I

Gaming • Information Technology • Mobile • Software
Hybrid
Bengaluru, Karnataka, IND
6500 Employees

NVIDIA Logo NVIDIA

Senior Site Reliability Engineer - GPU Cloud

Artificial Intelligence • Hardware • Robotics • Software • Metaverse
Bengaluru, Bengaluru Urban, Karnataka, IND
21960 Employees

Granicus LLC Logo Granicus LLC

Site Reliability Engineer 1

Cloud • Marketing Tech • Professional Services • Social Impact • Software
Hybrid
Bengaluru, Karnataka, IND
1500 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account