Tech Holding Logo

Tech Holding

ML / AI Data Engineer (Contract)

Posted Yesterday
Be an Early Applicant
Remote
Hiring Remotely in India
Senior level
Remote
Hiring Remotely in India
Senior level
The role involves designing and optimizing large-scale ML data pipelines, ensuring high-throughput data ingestion and processing, collaborating with teams on data workflows, and architecting GPU-based environments.
The summary above was generated by AI

About us:

Working at Tech Holding isn't just a job, it's an opportunity to be a part of something bigger. We are a full-service consulting firm that was founded on the premise of delivering predictable outcomes and high-quality solutions to our clients.  Our founders and team members have industry experience and have held senior positions in a wide variety of companies – from emerging startups to large Fortune 50 firms – and we have taken our combined experiences and developed a unique approach that is supported by the principles of deep expertise, integrity, transparency, and dependability.

We are looking for a highly skilled Senior ML / Data Pipeline Engineer who can translate complex machine learning and multimodal concepts into scalable, production-ready pipelines and workflows.
This role focuses on building and optimising large-scale video and multimodal data systems, enabling high-throughput ingestion, processing, and model training across distributed cloud environments.
Key Responsibilities
  • Design, deploy, and scale large-scale ML and data processing pipelines across cloud infrastructure.
  • Build systems to ingest, process, and serve 250,000+ hours of multimodal data (video, audio, metadata).
  • Architect and optimize GPU-based compute environments (e.g., NVIDIA Tesla clusters) for distributed training and inference.
  • Develop high-throughput backend systems for video ingestion from desktop and mobile platforms.
  • Implement distributed processing workflows, including job scheduling, fault tolerance, and resource allocation.
  • Design and build human-in-the-loop and automated annotation systems to ensure data quality and scalability.
  • Translate ML and multimodal research into scalable, production-grade cloud architectures.
  • Optimize pipelines for performance, reliability, and cost efficiency across compute, storage, and networking layers.
  • Collaborate with ML, data, and engineering teams to deliver end-to-end data workflows.
Requirements
  • 5+ years of experience in data engineering, ML pipelines, or distributed systems.
  • Strong experience building scalable data pipelines for large datasets (video/audio preferred).
  • Hands-on experience with cloud platforms (AWS, Azure, or GCP).
  • Experience working with GPU-based environments and distributed computing.
  • Strong programming skills in Python, Scala, or similar languages.
  • Experience with data processing frameworks (Spark, Ray, Kafka, Airflow, or similar).
  • Understanding of ML workflows, training pipelines, and inference systems.
  • Experience designing fault-tolerant, high-availability systems.
  • Strong knowledge of data storage systems (data lakes, object storage, distributed file systems).
  • Ability to handle high-throughput, large-scale data ingestion and processing.
Good to Have
  • Experience with multimodal AI (video, audio, NLP) systems.
  • Familiarity with annotation tools and data labeling workflows.
  • Experience with containerization and orchestration (Docker, Kubernetes).
  • Knowledge of cost optimization strategies for large-scale cloud workloads.

Tech Holding is proud to be an Equal Opportunity Employer and is committed to fostering a diverse and inclusive workplace. We welcome applicants from all backgrounds and experiences, and we consider qualified applicants without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, disability, veteran status, or any other legally protected characteristic. If you require accommodation in the application process, please contact our HR 

Similar Jobs

7 Hours Ago
Remote or Hybrid
Senior level
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
As a Lead Customer Success Manager, you will drive customer adoption of Dynatrace, build executive relationships, manage risks, and ensure long-term value for enterprise clients in India.
Top Skills: AWSAzureGCPKubernetes
8 Hours Ago
Easy Apply
Remote
India
Easy Apply
Senior level
Senior level
Artificial Intelligence • Enterprise Web • Information Technology • Productivity • Sales • Software • Database
As a Senior Backend Engineer, you'll design scalable backends, mentor team members, and lead software development lifecycle activities while improving quality and performance.
Top Skills: AnsibleDockerElasticsearchKubernetesMongoDBNode.jsReactRedisReduxRubyRuby On RailsTerraform
8 Hours Ago
Easy Apply
Remote
India
Easy Apply
Senior level
Senior level
Artificial Intelligence • Enterprise Web • Information Technology • Productivity • Sales • Software • Database
As a Senior Backend Engineer at Apollo.io, you'll design scalable backend solutions, mentor teammates, and work cross-functionally to enhance product quality and performance.
Top Skills: AIAnsibleDockerElasticsearchKubernetesMongoDBNode.jsReactRedisReduxRubyRuby On RailsTerraform

What you need to know about the Kolkata Tech Scene

When considering the industries shaping India's tech scene, gaming might not immediately come to mind. However, in the last decade, increased internet usage and greater access to mobile devices have catapulted the industry to new heights, with Kolkata-based companies like Virtualinfocom, Red Apple Technologies and Digitoonz, at the forefront, driving the design and animation of new gaming titles for players.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account