YipitData Logo

YipitData

Data Engineer (Web Scraping)

Posted 2 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Mid level
Remote
Hiring Remotely in India
Mid level
As a Web Scraping Engineer, you'll design, build, and maintain web scrapers, implementing advanced techniques and collaborating with teams to ensure efficient data processing and quality.
The summary above was generated by AI

About Us:

YipitData is the leading market research and analytics firm for the disruptive economy and most recently raised $475M from The Carlyle Group at a valuation of over $1B. Every day, our proprietary technology analyzes billions of alternative data points to uncover actionable insights across sectors like software, AI, cloud, e-commerce, ridesharing, and payments.

Our data and research teams transform raw data into strategic intelligence, delivering accurate, timely, and deeply contextualized analysis that our customers—ranging from the world’s top investment funds to Fortune 500 companies—depend on to drive high-stakes decisions. From sourcing and licensing novel datasets to rigorous analysis and expert narrative framing, our teams ensure clients get not just data, but clarity and confidence.

We operate globally with offices in the US (NYC, Austin, Miami, Mountain View), APAC (Hong Kong, Shanghai, Beijing, Guangzhou, Singapore), and India. Our award-winning, people-centric culture—recognized by Inc. as a Best Workplace for three consecutive years—emphasizes transparency, ownership, and continuous mastery.

What It’s Like to Work at YipitData:

YipitData isn’t a place for coasting—it’s a launchpad for ambitious, impact-driven professionals.

From day one, you’ll take the lead on meaningful work, accelerate your growth, and gain exposure that shapes careers.

Why Top Talent Chooses YipitData:

  • Ownership That Matters: You’ll lead high-impact projects with real business outcomes
  • Rapid Growth: We compress years of learning into months
  • Merit Over Titles: Trust and responsibility are earned through execution, not tenure
  • Velocity with Purpose: We move fast, support each other, and aim high—always with purpose and intention

If your ambition is matched by your work ethic—and you're hungry for a place where growth, impact, and ownership are the norm—YipitData might be the opportunity you’ve been waiting for.

About The Role:

We are seeking a Web Scraping Engineer to join our growing engineering team. In this hands-on role, you’ll take ownership of designing, building, and maintaining robust web scrapers that power critical reports and customer experiences across our organization. You will work on complex, high-impact scraping challenges and collaborate closely with cross-functional teams to ensure our data ingestion processes are resilient, efficient, and scalable, while delivering high-quality data to our products and stakeholders.

As Our Web Scraping Engineer You Will:

Refactor and Maintain Web Scrapers

  • Overhaul existing scraping scripts to improve reliability, maintainability, and efficiency.
  • Implement best coding practices (clean code, modular architecture, code reviews, etc.) to ensure quality and sustainability.

Implement Advanced Scraping Techniques

  • Utilize sophisticated fingerprinting methods (cookies, headers, user-agent rotation, proxies) to avoid detection and blocking.
  • Handle dynamic content, navigate complex DOM structures, and manage session/cookie lifecycles effectively.

Collaborate with Cross-Functional Teams

  • Work closely with analysts and other stakeholders to gather requirements, align on targets, and ensure data quality.
  • Provide support, documentation, and best practices to internal stakeholders to ensure effective use of our web scraped data in critical reporting workflows.

Monitor and Troubleshoot

  • Develop robust monitoring solutions, alerting frameworks  to quickly identify and address failures.
  • Continuously evaluate scraper performance, proactively diagnosing bottlenecks and scaling issues.

Drive Continuous Improvement

  • Propose new tooling, methodologies, and technologies to enhance our scraping capabilities and processes.
  • Stay up to date with industry trends, evolving bot-detection tactics, and novel approaches to web data extraction.

This is a fully-remote opportunity based in India. Standard work hours are from 11am to 8pm IST, but there is flexibility here.

You Are Likely To Succeed If:

  • Effective communication in English with both technical and non-technical stakeholders.
  • You have a track record of mentoring engineers and managing performance in a fast-paced environment.
  • 3+ years of experience with web scraping frameworks (e.g., Selenium, Playwright, or Puppeteer).
  • Strong understanding of HTTP, RESTful APIs, HTML parsing, browser rendering, and TLS/SSL mechanics.
  • Expertise in advanced fingerprinting and evasion strategies (e.g., browser fingerprint spoofing, request signature manipulation).
  • Deep experience managing cookies, headers, session states, and proxy rotations, including the deployment of both residential and data center proxies.
  • Experience with logging, metrics, and alerting to ensure high availability.
  • Troubleshooting skills to optimize scraper performance for efficiency, reliability, and scalability.

What We Offer:

Our compensation package includes comprehensive benefits, perks, and a competitive salary: 

  • We care about your personal life, and we mean it. We offer flexible work hours, flexible vacation, a generous 401K match, parental leave, team events, wellness budget, learning reimbursement, and more!
  • Your growth at YipitData is determined by the impact that you are making, not by tenure, unnecessary facetime, or office politics. Everyone at YipitData is empowered to learn, self-improve, and master their skills in an environment focused on ownership, respect, and trust. See more on our high-impact, high-opportunity work environment above!

We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, marital status, disability, gender, gender identity or expression, or veteran status. We are proud to be an equal-opportunity employer.

Job Applicant Privacy Notice

Top Skills

Cookies
Headers
HTML
HTTP
Playwright
Proxies
Puppeteer
Restful Apis
Selenium
Session States
Ssl
Tls
Web Scraping

Similar Jobs

41 Minutes Ago
Easy Apply
Remote
India
Easy Apply
Senior level
Senior level
Artificial Intelligence • Edtech • Mobile • Natural Language Processing • Productivity • Software
Lead AI writing product strategy and development while collaborating with cross-functional teams to enhance long-form writing experiences.
Top Skills: Ai Writing ToolsApple PagesConfluenceGoogle DocsMs WordNotionOverleafWordpress
2 Hours Ago
Remote or Hybrid
Hyderabad, Telangana, IND
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Senior Production Infrastructure Engineer, you will automate project creation in Google Cloud, manage system performance, and enhance security protocols.
Top Skills: AnsibleBashElasticsearchGCPMongoDBMySQLNginxPython
2 Hours Ago
Remote or Hybrid
Hyderabad, Telangana, IND
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
This role involves designing and deploying machine learning solutions, especially in large language models, leading research and development, and collaborating with various teams.
Top Skills: DockerHuggingface TransformersKubernetesMlflowNumpyPythonPyTorchTensorFlow

What you need to know about the Kolkata Tech Scene

When considering the industries shaping India's tech scene, gaming might not immediately come to mind. However, in the last decade, increased internet usage and greater access to mobile devices have catapulted the industry to new heights, with Kolkata-based companies like Virtualinfocom, Red Apple Technologies and Digitoonz, at the forefront, driving the design and animation of new gaming titles for players.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account