Web Data Crawling

UpworkTRNot specifiedexpertScore: 65
Data ExtractionWeb ScrapingWeb CrawlingSelenium WebDriverData ScrapingBrowser AutomationReverse Engineering
We are seeking an experienced engineer to design and implement a resilient web data acquisition solution for collecting publicly accessible information from designated websites. The ideal candidate will have strong expertise in web architecture, browser automation, and scalable crawling systems. The objective of this project is to build a stable, high-performance data collection framework capable of handling dynamic content, structural variability, and operational constraints across complex websites. All activities must be conducted strictly within applicable legal boundaries and in compliance with relevant terms of service and data usage policies. Responsibilities Design and develop a robust, scalable crawling framework Extract structured data from dynamic, JavaScript-rendered websites Analyze network traffic to identify relevant data endpoints Implement intelligent retry, throttling, and session management mechanisms Ensure system stability, monitoring, and performance optimization Collaborate with internal teams to define data schemas and delivery requirements Required Skills Strong proficiency in Python (or equivalent) and asynchronous programming Advanced understanding of HTTP/HTTPS, sessions, cookies, headers, and request lifecycles Experience with browser automation tools (Playwright or Selenium) Ability to analyze and replicate network requests (HAR files, API inspection, GraphQL, XHR) Experience handling session persistence, rate limits, and adaptive traffic management Knowledge of distributed crawling architectures and job orchestration systems Experience deploying scalable systems in cloud environments (AWS, GCP, or Azure) Candidates with proven experience building high-availability, production-grade crawling systems are encouraged to apply.
View Original Listing
Unlock AI Intelligence, score breakdowns, and real-time alerts
Upgrade to Pro — $29.99/mo

Client

Spent: $200Rating: 0.0Verified