DeepSource Logo

DeepSource

L1 Data Engineer - Remote

Posted Yesterday
Remote
Hiring Remotely in India
Mid level
Remote
Hiring Remotely in India
Mid level
The Data Engineer is responsible for designing and maintaining data architecture and infrastructure, developing data solutions, and ensuring they meet organizational needs.
The summary above was generated by AI

We are looking for a motivated and technically solid L1 Data Engineer to join our growing Data & Analytics team. In this role, you will be responsible for designing, building, and maintaining the data architecture and infrastructure that supports our organization's data strategy. You will work hands-on to develop, test, and deploy reliable data solutions — ensuring pipelines are scalable, efficient, and aligned with business requirements.

This is an ideal opportunity for a data professional who is eager to deepen their expertise in cloud-native data platforms, particularly within the Microsoft Azure and Databricks ecosystem, and who thrives in a collaborative, fast-paced environment.

KEY RESPONSIBILITIES

• Design, develop, and maintain scalable data pipelines and ETL/ELT workflows to support business intelligence and analytics use cases.

• Build and optimize data ingestion processes using Azure Data Factory and Databricks, ensuring data quality and consistency across all layers of the data platform.

• Transform and process large datasets using PySpark and Python, applying best practices for performance and maintainability.

• Write and optimize complex SQL queries to support analytical reporting and data validation requirements.

• Collaborate with data architects and senior engineers to implement and maintain data models aligned with organizational standards.

• Monitor, troubleshoot, and resolve pipeline failures and data quality issues, applying root-cause analysis to prevent recurrence.

• Contribute to documentation of data pipelines, data dictionaries, and engineering standards.

• Support the team in exploring and evaluating new tools and approaches to continuously improve the data infrastructure.


Requirements
  • 2+ years of professional experience in a Data Engineering or closely related role.
  • Strong proficiency in Python for data processing, transformation, and automation tasks.
  • Hands-on experience with Pandas for data manipulation and PySpark for distributed data processing.
  • Practical experience with Databricks, including notebook development, clusters, and job orchestration.
  • Experience building and managing data pipelines with Azure Data Factory.
  • Working knowledge of Azure Synapse Analytics, particularly Spark pool integration.
  • Solid SQL skills, including query writing, optimization, and performance tuning.
  • Familiarity with data engineering principles including incremental loading, data lake architecture, and Delta Lake.
  • Understanding of data governance and security concepts within a cloud data platform.

NICE TO HAVE

  • Experience with SQL Server migration projects, including schema conversion and data movement.
  • Exposure to Terraform for Azure infrastructure provisioning and management.
  • Familiarity with CI/CD practices applied to data engineering workflows.
  • Experience with Delta Sharing or Lakehouse Federation concepts.

CERTIFICATION REQUIREMENT

  • Candidates are expected to hold or be actively working toward the Databricks Certified Data Engineer Associate certification. This certification validates foundational knowledge across the following domains:
  • Databricks Lakehouse Platform architecture and capabilities
  • ETL and ELT workflows using Spark SQL and PySpark
  • Incremental data processing and structured streaming
  • Production pipeline development and orchestration
  • Data governance and security within the Databricks environment

Similar Jobs

Yesterday
In-Office or Remote
Mid level
Mid level
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
This role involves managing IP-backbone security devices, troubleshooting issues, defining threats, and planning security access requests while collaborating with a telecom security team.
Top Skills: Authentication ServersDdos Mitigation SolutionsF5 AfmF5 AsmF5 GtmF5 LtmFortinetIpsJuniper SrxLoad BalancersMulti-Vendor FirewallsToken Server
3 Days Ago
In-Office or Remote
Internship
Internship
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
As a GenAI Developer Intern, you will develop generative AI applications on AWS, manage workflows, and ensure safety and observability in AI tools while collaborating across teams.
Top Skills: Api GatewayAutogenAWSBedrockCloudwatchCrewaiDockerEc2Ecs/FargateFastapiFlaskIamLambdaLangchainLanggraphLarge Language ModelsOpenai Agents SdkPythonS3SagemakerStep FunctionsTerraform
4 Days Ago
Remote or Hybrid
Internship
Internship
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
The Demand Planning Intern will develop Power BI dashboards, monitor daily shipments, validate production plans, track forecast accuracy, and support ad hoc analyses.
Top Skills: Power BI

What you need to know about the Kolkata Tech Scene

When considering the industries shaping India's tech scene, gaming might not immediately come to mind. However, in the last decade, increased internet usage and greater access to mobile devices have catapulted the industry to new heights, with Kolkata-based companies like Virtualinfocom, Red Apple Technologies and Digitoonz, at the forefront, driving the design and animation of new gaming titles for players.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account