Senior System Software Engineer, Conversational AI

Posted 16 Hours Ago
Be an Early Applicant
Īnd, Chamba, Himāchal Pradesh
Senior level
Artificial Intelligence • Hardware • Robotics • Software • Metaverse
The Role
The Senior System Software Engineer for Conversational AI at NVIDIA will architect and optimize scalable systems for AI agents using Retrieval Augmented Generation. Responsibilities include implementing multi-turn conversation workflows, analyzing system performance, collaborating on product features, and integrating AI frameworks with existing products, all within a dynamic team environment.
Summary Generated by Built In

NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars, robotics, and intelligent assistants. Come join the team and see how you can make a lasting impact on the world! We're looking to grow our company, and build our teams with the smartest people in the world. Join us at the forefront of technological advancement. NVIDIA is looking for a System Software Engineer to develop tools for building powerful, flexible, multi-modal AI agents driven by Large Language Models(LLM) & improve the experience of millions of customers. If you're creative & passionate about solving real world conversational AI problems, come join us.

What you’ll be doing:

  • Architect, implement and optimize GPU accelerated scalable Retrieval Augmented Generation(RAG) workflow. Build a scalable microservice based architecture deployable on multi-node, multi-cloud environment

  • Designing, implementing and testing domain specific agents and workflows and a framework which can support multi-turn, multi-modal, multi-user conversations with a LLM driven agents.

  • Develop knowledge discovery, and reasoning capabilities including but not limited to disambiguation, clarification, and anticipation for dialogue systems

  • Analyze RAG and conversational AI agent end to end accuracy and limitations and recommend the next course of action & Improvements.

  • Characterize performance and quality metrics across platforms for various AI and system components

  • Collaborate with various teams on new product features and improvements of existing products. Customize and integrate the conversational AI framework with other NVIDIA products

  • Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews and help innovate, identify problems, recommend solutions and perform triage in a collaborative team environment.

What we need to see:

  • Bachelor's degree or Master’s degree (or equivalent experience) in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math

  • 5+ years of experience

  • Excellent programming skills in Python

  • Hands-on experience of working with Retrieval Augmented Generation based applications

  • Knowhow of Large Language model applications, agentic workflows, LLM guardrails

  • Understanding of scalable deployment of LLM driven RAG and Agent applications in production environment

  • Familiarity with microservices, Docker, helm, kubernetes etc.

  • Experience of working on end to end Software lifecycle, release packaging & CI/CD pipeline

  • Hands-on experience on conversational AI Technologies like Large Language Model(LLM), LLM function calling, Information Retrieval, Vector Databases, Embedding and Rerank models, autonomous agents etc.

  • General background around version control and code review tools like Git, Gerrit, Gitlab.

  • Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic environment

Ways to stand out from the crowd:

  • Strong fundamentals in Programming, optimizations and Software design

  • Experience of working with open source frameworks like LangChain, LlamaIndex for building LLM driven applications

  • Strong knowledge of ML/DL techniques, algorithms and tools with exposure to Language Models

  • Familiarity with GPU based technologies like CUDA, CuDNN and TensorRT

  • Background with deploying machine learning models on data center, cloud, and embedded systems

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Python
The Company
HQ: Santa Clara, CA
21,960 Employees
On-site Workplace
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Jobs

NVIDIA Logo NVIDIA

Senior System Software Engineer, Conversational AI

Artificial Intelligence • Hardware • Robotics • Software • Metaverse
Īnd, Chamba, Himāchal Pradesh, IND
21960 Employees
Īnd, Chamba, Himāchal Pradesh, IND
17787 Employees
Īnd, Chamba, Himāchal Pradesh, IND
17787 Employees
Īnd, Chamba, Himāchal Pradesh, IND
4597 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account