✨ AI Insights & Summary
Mindrift offers a unique opportunity to bridge the gap between human expertise and cutting-edge AI through its Tendem project. This part-time, remote role as an AI Pilot (Senior Python Data Scraping Engineer) is perfect for seasoned developers seeking to apply their critical thinking and specialized skills to real-world AI challenges. By collaborating with AI agents and contributing domain knowledge, you'll play a vital role in unlocking Generative AI's potential for major tech innovators, earning up to $25/hour equivalent while working flexibly.
About Mindrift
The Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.
About the Role: AI Pilot – Senior Python Data Scraping Engineer
As an AI Pilot, you will collaborate with Tendem Agents, leveraging your critical thinking, domain expertise, and quality control to ensure accurate and actionable results in specialized data scraping workflows. This is a freelance role for a Tendem project, ideal for technical professionals with hands-on experience in web scraping, data extraction, and processing.
Key Responsibilities
- Own end-to-end data extraction workflows across complex websites, ensuring complete coverage, accuracy, and reliable delivery of structured datasets.
- Leverage internal tools (Apify, OpenRouter) alongside custom workflows to accelerate data collection, validation, and task execution while meeting defined requirements.
- Ensure reliable extraction from dynamic and interactive web sources, adapting approaches as needed to handle JavaScript-rendered content and changing site behavior.
- Enforce data quality standards through validation checks, cross-source consistency controls, adherence to formatting specifications, and systematic verification prior to delivery.
- Scale scraping operations for large datasets using efficient batching or parallelization, monitor failures, and maintain stability against minor site structure changes.
Requirements
- Minimum 5+ years of relevant experience in data engineering, web scraping, automation, or software development (required).
- Bachelor’s or Master’s Degree in Engineering, Applied Mathematics, Computer Science, or related technical fields is a plus.
- Strong technical foundation and practical experience with scripting, automation, and AI-assisted workflows.
- Ability to solve non-trivial problems, work confidently with LLMs, and systematically collect, structure, and validate data from diverse sources.
- Methodical, detail-oriented approach and ability to work independently.
- Strong experience in Python web scraping (BeautifulSoup, Selenium or similar), including dynamic content (JS, AJAX, infinite scroll) and APIs via proxies.
- Proven ability to extract data from complex structures (hierarchies, archived pages, inconsistent HTML).
- Solid background in data cleaning, normalization, and validation, delivering structured datasets (CSV, JSON, Google Sheets).
- Demonstrated experience handling anti-bot mechanisms and dynamic site structures at scale.
- Experience with cloud infrastructure (AWS or equivalent) and containerization (Docker) as part of real workflows.
- Hands-on experience with LLM frameworks (LangChain, OpenRouter, or similar) applied to automation tasks.
- Strong attention to detail and commitment to data accuracy.
- Self-directed work ethic with the ability to troubleshoot independently.
- A link to GitHub is a plus.
- English proficiency: Upper-intermediate (B2) or above (required).
Project Time Expectations
Tasks are estimated to require approximately 10–20 hours per week during active phases, based on project requirements. This is an estimate and not a guaranteed workload, applying only while the project is active.
Compensation
Contributors can earn up to $25 per hour equivalent, depending on their level and pace of contribution. Compensation varies across projects based on scope, complexity, and required expertise.