All jobs
MindriftData
Senior Python Data Scraping Engineer (Freelance)
Remoteup to $45 per hourPosted 2 days ago
Mindrift is seeking highly skilled Senior Python Data Scraping Engineers for the Tendem project to develop specialized data scraping workflows within a hybrid AI + human system. The role involves collaboration with Tendem Agents to ensure accurate data extraction and quality control, focusing on web scraping, data extraction, and processing.
Location: Remote
Salary: up to $45 per hour
Responsibilities
- Own end-to-end data extraction workflows across complex websites, ensuring complete coverage, accuracy, and reliable delivery of structured datasets.
- Leverage internal tools (Apify, OpenRouter) alongside custom workflows to accelerate data collection, validation, and task execution while meeting defined requirements.
- Ensure reliable extraction from dynamic and interactive web sources, adapting approaches as needed to handle JavaScript-rendered content and changing site behavior.
- Enforce data quality standards through validation checks, cross-source consistency controls, adherence to formatting specifications, and systematic verification prior to delivery.
- Scale scraping operations for large datasets using efficient batching or parallelization, monitor failures, and maintain stability against minor site structure changes.
Requirements
- At least 5+ years of relevant experience in data engineering, web scraping, automation, or software development.
- Bachelor’s or Master’s Degree in Engineering, Applied Mathematics, Computer Science, or related fields is a plus.
- Strong technical foundation and practical experience with scripting, automation, and AI-assisted workflows.
- Strong experience in Python web scraping (BeautifulSoup, Selenium or similar), including dynamic content (JS, AJAX, infinite scroll) and APIs via proxies.
- Proven ability to extract data from complex structures (hierarchies, archived pages, inconsistent HTML).
- Solid background in data cleaning, normalization, and validation, delivering structured datasets (CSV, JSON, Google Sheets).
- Experience handling anti-bot mechanisms and dynamic site structures at scale.
- Experience with cloud infrastructure (AWS or equivalent) and containerization (Docker).
- Hands-on experience with LLM frameworks (LangChain, OpenRouter, or similar).
- Strong attention to detail and commitment to data accuracy.
- Self-directed work ethic with troubleshooting skills.
- English proficiency: Upper-intermediate (B2) or above.
Benefits
- Competitive hourly compensation up to $45 per hour.
- Flexible part-time schedule estimated at 10–20 hours per week during active project phases.
Additional Information
- This is a freelance, part-time remote role for the Tendem project.
- The workload is estimated and not guaranteed, depending on project needs.
- Other projects on the platform may offer different compensation levels.
Similar remote jobs
Epidemiologist
Remote or onsite supporting Atlanta, GA-based Centers for Disease Control and Prevention (CDC)$95,000- $105,000
today