The role involves building and maintaining automated pipelines for data collection and cleaning from public sources, integrating scrapers with backend systems, and ensuring data accuracy and compliance with platform rules.
Web Scraper / Data Engineer
Location: Remote
Job Type: Full-time
About STERRY
At STERRY, we’re not your average Growth Marketing Agency—we’re the rocket fuel behind crowdfunding and e-commerce success. Since day one, we’ve helped clients pull in over $100 million in trackable online revenue. We build strategies that go beyond brand and marketing—we deliver measurable results rooted in online performance
Role Overview
We’re looking for an experienced Web Scraper / Data Engineer to help us build and maintain automated pipelines that collect, clean, and enrich creator and campaign data from public sources. You’ll be responsible for designing reliable, scalable scrapers and integrating them with our backend.
Responsibilities
Requirements
What We Offer
Location: Remote
Job Type: Full-time
About STERRY
At STERRY, we’re not your average Growth Marketing Agency—we’re the rocket fuel behind crowdfunding and e-commerce success. Since day one, we’ve helped clients pull in over $100 million in trackable online revenue. We build strategies that go beyond brand and marketing—we deliver measurable results rooted in online performance
Role Overview
We’re looking for an experienced Web Scraper / Data Engineer to help us build and maintain automated pipelines that collect, clean, and enrich creator and campaign data from public sources. You’ll be responsible for designing reliable, scalable scrapers and integrating them with our backend.
Responsibilities
- Build scrapers and crawlers to collect creator profile data (followers, engagement, category, contact info, etc.) from social platforms (TikTok, Instagram, YouTube, etc.) and directories
- Parse and clean unstructured data into structured datasets (JSON, CSV, or direct to database)
- Integrate with APIs (YouTube, TikTok, Instagram, etc.) where possible
- Detect and handle rate limits, CAPTCHA, and anti-bot mechanisms
- Implement and monitor scraping tasks using proxy rotation and headless browsers (Puppeteer, Playwright, Selenium, etc.)
- Collaborate with the backend team to feed data into AI recommendation engine
- Maintain high data accuracy, freshness, and compliance with platform TOS and privacy rules
Requirements
- 2+ years experience building web scrapers, crawlers, or data extraction pipelines
- Strong Python or Node.js skills (BeautifulSoup, Playwright, Puppeteer, Scrapy, or similar)
- Experience with APIs, JSON, REST, and rate-limiting management
- Familiarity with databases (MongoDB, PostgreSQL, Firebase, etc.)
- Knowledge of proxies, headless browsers, and data scaling infrastructure
- Attention to detail and ability to deliver clean, well-documented code
- (Bonus) Experience with influencer data, social analytics, or SaaS platforms
What We Offer
- Flexible working hours (remote-first)
- Competitive pay (hourly or project-based)
- Long-term potential to transition into a data engineering role
- Opportunity to shape the foundation of a fast-growing AI SaaS startup
Top Skills
Beautifulsoup
Firebase
JSON
MongoDB
Node.js
Playwright
Postgres
Puppeteer
Python
Rest
Scrapy
Similar Jobs
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Assistant Manager - PMO will lead DevOps strategies, manage CI/CD implementations, mentor teams, and enhance automation while ensuring secure high-quality deliveries.
Top Skills:
AnsibleAzureAzure DevopsBashChefCi/CdDockerGitJavaJenkinsKubernetesNoSQLPowershellPuppetPythonSQL
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Lead IT vendor risk assessments, manage risk identification, participate in vendor contract negotiations, and ensure compliance with security standards.
Top Skills:
DplFfiecGdprGrc ToolHipaaInformation Security StandardsIso 27001MS OfficeNistPci RocSsae16
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
The QA Data Tester will handle testing in data warehouse projects, prepare complex SQL queries, and perform automation in big data environments, while collaborating in Agile teams.
Top Skills:
HadoopHiveNifiOraclePythonScalaShell ScriptingSparkSQLUnix
What you need to know about the Kolkata Tech Scene
When considering the industries shaping India's tech scene, gaming might not immediately come to mind. However, in the last decade, increased internet usage and greater access to mobile devices have catapulted the industry to new heights, with Kolkata-based companies like Virtualinfocom, Red Apple Technologies and Digitoonz, at the forefront, driving the design and animation of new gaming titles for players.


