Coupa employees grouped together on the left and sitting on the right.
Coupa Logo

Coupa

Senior AI Engineer, NLP & Training Data - 11316

Posted 4 Hours Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Bangalore, Bengaluru Urban, Karnataka
Senior level
In-Office or Remote
Hiring Remotely in Bangalore, Bengaluru Urban, Karnataka
Senior level
The Senior AI Engineer will design training data generation pipelines, build data labeling workflows, and analyze model evaluation results to improve dataset quality for NLP models.
The summary above was generated by AI
Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter, more profitable business decisions to improve operating margins.

Why join Coupa?

🔹 Pioneering Technology: At Coupa, we're at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend.
🔹 Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence.
🔹 Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other. 

Learn more on Life at Coupa blog and hear from our employees about their experiences working at Coupa. 

The Impact of a Senior AI Engineer, NLP & Training Data at Coupa: 

Coupa's AI platform uses a range of models, from classical ML classifiers to frontier LLM integrations, to power features across spend management. The Senior Engineer, NLP & Training Data will build the data factory that produces high-quality training datasets for our model development efforts. You will design pipelines that generate, label, validate, and curate the datasets used to improve model accuracy across Coupa's product suite.

What You’ll Do

  • Design and implement training data generation pipelines, including synthetic data generation.
  • Build data labeling and annotation workflows with quality validation loops.
  • Convert enterprise data into formats suitable for model training (instruction-tuning pairs, embeddings).
  • Implement active learning strategies to identify high-value training examples.
  • Collaborate with domain experts to validate training data quality and relevance.
  • Build automated data quality checks: coverage, balance, consistency.
  • Design training data versioning and lineage tracking.
  • Analyze model evaluation results to identify training data gaps.

What You Will Bring to Coupa

  • 5+ years of software engineering experience, with 2+ years in NLP, data science, or ML data engineering.
  • Experience with text processing, tokenization, and NLP pipelines.
  • Hands-on experience with data labeling tools and annotation workflows.
  • Experience generating synthetic training data using language model APIs.
  • Understanding of instruction-tuning and training data quality metrics.
  • Proficiency in Python (pandas, PySpark).
  • Experience with data versioning tools is a plus.
  • BS/MS in Computer Science, NLP, or equivalent experience.

Coupa complies with relevant laws and regulations regarding equal opportunity and offers a welcoming and inclusive work environment. Decisions related to hiring, compensation, training, or evaluating performance are made fairly, and we provide equal employment opportunities to all qualified candidates and employees. 

Please be advised that inquiries or resumes from recruiters will not be accepted.

By submitting your application, you acknowledge that you have read Coupa’s Privacy Policy and understand that Coupa receives/collects your application, including your personal data, for the purposes of managing Coupa's ongoing recruitment and placement activities, including for employment purposes in the event of a successful application and for notification of future job opportunities if you did not succeed the first time. You will find more details about how your application is processed, the purposes of processing, and how long we retain your application in our Privacy Policy.

Top Skills

Data Labeling Tools
Ml
Nlp
Pandas
Pyspark
Python

Similar Jobs at Coupa

4 Hours Ago
In-Office or Remote
Senior level
Senior level
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
The Sr. Lead AI Engineer will develop data pipelines for training AI models, ensuring data quality and processing large-scale enterprise spend data.
Top Skills: SparkCloud InfrastructureData Catalog ToolsETLManaged SparkObject StoragePysparkPythonSQL
4 Hours Ago
In-Office or Remote
Senior level
Senior level
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
The Sr. Engineer, Knowledge Engineering will design ontologies and knowledge graphs for AI, implement graph interfaces, and collaborate on ML data.
Top Skills: CypherElasticsearchElasticsearch DslGremlinJson-LdNeo4JNeptuneOwlPythonRdfSparql
4 Hours Ago
In-Office or Remote
Senior level
Senior level
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
The Sr. Manager, Data & AI Platform will lead an engineering team in India focusing on data infrastructure for AI capabilities, managing delivery, and ensuring technical quality.
Top Skills: ETLPysparkSpark

What you need to know about the Kolkata Tech Scene

When considering the industries shaping India's tech scene, gaming might not immediately come to mind. However, in the last decade, increased internet usage and greater access to mobile devices have catapulted the industry to new heights, with Kolkata-based companies like Virtualinfocom, Red Apple Technologies and Digitoonz, at the forefront, driving the design and animation of new gaming titles for players.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account