
Concluído
Publicado
Pago na entrega
We are looking for an experienced Python engineer to develop a robust, production-grade web scraper designed to extract structured exam-style question data and deliver it to an existing Django backend via a secured API. This is not a quick or hacky scraping task. The goal is to build a clean, maintainable, and resilient scraper that behaves like a human user, respects throttling limits, and can run unattended on a low-cost VPS without triggering blocks. The same developer will also handle the Django backend, so code clarity, architecture, and consistency are critical. Core Responsibilities - Build a Python-based scraper using Playwright (preferred) or a well-justified alternative. - Extract structured question data, including: - Question text - Multiple-choice options (variable count) - Correct answer - Image URL (when available) - Support multiple categories/modules via a configurable whitelist. - Normalize and prepare data for backend ingestion (deduplication handled server-side). Architecture & Integration - The scraper will send data in batches (per test/session) to a Django REST API. - Authentication via existing JWT endpoints. - Additional security via IP whitelisting. - One API call per completed batch (not per question). API Behavior Expectations - Backend responds with: - received / inserted / duplicates / failed counts - optional error summaries - optional next_backoff_seconds to dynamically slow down scraping - On API failure: - Retry with backoff - If still failing: persist batch to disk (spool) and stop execution safely Throttling & Human-Like Behavior (Configurable) All delays must be fully configurable via YAML (no hardcoded values): - Delay between interactions (seconds, support random ranges) - Delay between batches/tests - Long cooldown after N batches - Respect backend-provided backoff signals The scraper must be designed to avoid detection and blocking, prioritizing stability over speed. Error Handling & Observability - Clear structured logging (INFO / WARN / ERROR). - On scraping failure: - Capture screenshot + HTML dump - Abort execution safely - Upload artifacts to a shared storage (e.g. Slack / Google Drive / Mega or equivalent) - API failures must generate: - logs - persisted spool files (one JSON file per failed batch) Configuration - YAML-only configuration (no CLI overrides required). - Configurable items include: - enabled modules (whitelist) - language selection - throttling / cooldown parameters - API endpoint & credentials - spool directory - runtime options (headless / headed) Deliverables 1.- Clean, well-structured Python project: - Modular codebase - Clear separation of concerns 2.- Configuration template ([login to view URL]) 3.- Logging & error-handling implementation 4.- API integration with retry + spool logic 5.- Screenshot & HTML capture on critical failures 6.- Documentation ([login to view URL]) covering: - setup - configuration - execution - failure recovery 7.- Dependency management: - [login to view URL] - [login to view URL] - Dockerfile Quality Expectations - No brittle hardcoded XPaths - Robust selectors with fallbacks - Clean, readable, maintainable code - No scraping shortcuts that would cause instability - Designed for long-running unattended execution Acceptance Criteria - Successfully handles all whitelisted modules - Completes at least 10 test runs per module without failure - Sends data to backend reliably in batch mode - Correctly handles API backoff and failures - Produces usable logs and diagnostics on error Required Skills - Strong Python experience - Playwright or Selenium automation - REST API integration (JWT-based auth) - Experience with long-running scrapers - Familiarity with rate-limiting and anti-blocking strategies - Linux/VPS execution experience Project Type - Fixed-price project - Milestones will be clearly defined and agreed upon before starting Notes - Target platform UI is not in English, so selector robustness is mandatory. - This project values engineering quality and reliability over raw speed. - Strong communication and clean delivery are expected.
ID do Projeto: 40125551
114 propostas
Projeto remoto
Ativo há 2 meses
Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos

Hi Adrian, I’ve carefully analyzed your requirements for a production-grade Python scraper integrated with an existing Django backend, with a strong focus on reliability, configurability, and long-running unattended execution. I’ve built similar Playwright-based scraping systems that extract structured exam and content data, normalize it, and deliver it in secure batches to REST APIs with JWT auth, dynamic backoff handling, and disk spooling for failure recovery. The key aspects here are human-like scraping behavior, YAML-driven throttling, resilient selectors, robust error handling, and clean architecture shared between the scraper and Django backend. I’d implement a modular Playwright scraper with configurable delays, batch-based API delivery, structured logging, screenshot/HTML capture on failures, and safe shutdown with spool persistence, all designed to run stably on a low-cost VPS without triggering blocks. I’m available to begin work immediately and fully committed to delivering a clean, maintainable, and production-ready solution that prioritizes stability, observability, and long-term reliability over speed. Best regards, Elenilson
€500 EUR em 7 dias
5,3
5,3
114 freelancers estão ofertando em média €470 EUR for esse trabalho

⭐⭐⭐⭐⭐ Build a Reliable Web Scraper with Python and Django Integration ❇️ Hi My Friend, I hope you are doing well. I've reviewed your project requirements and noticed you're looking for an experienced Python engineer to develop a robust web scraper. Look no further; Zohaib is here to help you! My team has successfully completed 50+ similar projects for web scraping and backend integration. I will create a clean, maintainable, and resilient scraper using Playwright, following best practices to ensure it runs smoothly and consistently. ➡️ Why Me? I can easily build your web scraper as I have 5 years of experience in Python development, web scraping, REST API integration, and data normalization. My expertise includes Playwright, error handling, and logging. Additionally, I have a strong grip on Django, ensuring seamless backend integration and code clarity. ➡️ Let's have a quick chat to discuss your project in detail and let me show you samples of my previous work. I look forward to talking with you soon! ➡️ Skills & Experience: ✅ Python Programming ✅ Playwright Automation ✅ Django Development ✅ REST API Integration ✅ Data Normalization ✅ Error Handling ✅ Logging Implementation ✅ YAML Configuration ✅ VPS Execution ✅ Scraping Strategies ✅ Rate Limiting ✅ Modular Code Structure Waiting for your response! Best Regards, Zohaib
€350 EUR em 2 dias
7,9
7,9

Hello, With over 10 years of experience in both Computer Science and Engineering, my team at Live Experts has developed a deep-rooted understanding in using robust Python Web Scrapers. We align with your requirement for a Rock-solid Playwright Scraper to gather structured data archives for your Django backend employing a secure API. Our expertise and adaptability with various technologies like Selenium would be advantageous for the project. Our comprehensive understanding of API integration along with long-running scrapers empowers us to exactly meet your needs, ensuring authentic and smooth operations. We specialize in developing clean, maintainable and efficient codes that prioritize reliability over speed—this is reinforced by our previous work which has consistently maintained high efficacy even on limited servers, avoiding detection and blocks. Utilizing our vast Linux/VPS execution experience, we will build an exceptionally stable scraper that behaves like a human user, respecting all restrictions and rate limits. Additionally, we'll set up performance logs/alerts ensuring visibility of any issues. Our skills and expertise in REST API integration (including JWT authentication) balanced with a powerful scraping engine resting heavily on Playwright would deliver irreversible precision to your project. Trust us to provide you not only an impeccable product but also a full network of documentation simplifying any future modificat Thanks!
€750 EUR em 2 dias
7,6
7,6

Hello, I came across your project and found it truly interesting. With over eight years of hands-on experience in this field, I have successfully delivered high-quality solutions to clients worldwide. My dedication to excellence is reflected in the 180+ positive reviews from satisfied clients. I’d love to bring this expertise to your project and ensure outstanding results. However, I do have a few important points I’d like to clarify to align perfectly with your vision. Let’s connect via chat, where I can also share relevant examples of my past work. I'm looking forward to hearing back from you! Best Regards, Divu.
€750 EUR em 8 dias
6,8
6,8

With over a decade of robust software development and API integration experience through projects, I am confident in my ability to create your Python Scraper with the highest specifications you've outlined. Though my background is more oriented towards embedded systems and IoT, my proficiency in Python and strong problem-solving skills align perfectly with project requirements. I have the detailed-oriented and meticulous approach needed to ensure that this scraper solution is a reliable, dependable, and safe system that respects throttling limits. Having been responsible for creating long-running scrapers in the past, I understand that stability should be prioritized over speed to avoid detection and blocking. This aligns with your vision for the project as well. My familiarity with rate-limiting and anti-blocking strategies will be assets in actualizing this project successfully on a lower cost VPS. Additionally, since I will also handle the backend on Django, you can be assured of not just clean, maintainable and resilient Python scraped data but also a smartly integrated coding architecture that caters heavily to modularity - essentially providing clarity for code accessibility and future handling.
€750 EUR em 15 dias
6,4
6,4

Hi there, I’m excited about the opportunity to develop a robust Python web scraper for your project. As a top freelancer based in California, I have extensive experience crafting high-quality scrapers using Playwright, ensuring that they are both maintainable and resilient while adhering to best practices in API integration, particularly with Django backends. Your need for a scraper that behaves like a human user and respects throttling limits is precisely the kind of challenge I thrive on. I will implement a configurable YAML structure for the scraper’s operation, ensuring easy adjustments while maintaining consistent communication with your Django API. My approach will also include rigorous error handling and observability to guarantee reliability. Let’s discuss your specific requirements in more detail. I look forward to your message right away! How do you envision the structure of the YAML configuration file for the scraper?
€610 EUR em 12 dias
6,2
6,2

With over a decade of experience in Python, REST API integration and web scraping, I am confident that my team and I at Web Crest can deliver the robust, production-grade solution you need for your project. We have extensive experience with long-running scrapers, rate-limiting strategies and anti-blocking mechanisms which will be vital in building a scraper that behaves like a human user and respects throttling limits. As you've mentioned stability is prioritized over speed, we'll ensure there are no brittle hardcoded XPaths or scraping shortcuts, for a clean and stable codebase. Not only are we proficient with the Playwright automation tool that you prefer, but we are also experienced with Selenium automation which can serve as a well-justified alternative for your project. Additionally, our expertise extends to providing crisp API integration solutions with JWT-based auth which aligns perfectly with your project's needs.
€500 EUR em 3 dias
6,3
6,3

Hi there, I’m offering a 30% discount on my first project to develop a robust Python scraper using Playwright with API integration at an affordable rate. I specialize in creating reliable, fast, and scalable scraping solutions designed to extract data accurately and deliver it through structured APIs for seamless integration. Each project will be tailored to your requirements, including website automation, dynamic content handling, data formatting, error handling, and API endpoints for smooth consumption of the scraped data. I understand how important it is for scraping projects to balance efficiency, reliability, and compliance while maintaining clean and maintainable code. I can help with Playwright scripting, data extraction logic, scheduling, API development, and testing to ensure the scraper runs smoothly and delivers consistent results. My focus is always on providing Python scraping solutions that are robust, accurate, and fully integrated with your workflow. If you’re looking for a reliable professional who can build a Playwright-based Python scraper with API integration effectively, I’d be delighted to collaborate and deliver a solution that meets your requirements. Kind regards, Sohail Jamil
€250 EUR em 1 dia
6,0
6,0

I can build a production-grade Python scraper integrated with your Django backend that handles exam-style question data reliably and safely. Using Playwright for automation, the scraper will behave like a human, respect configurable throttling, and run unattended on a VPS. The project will feature: • Structured, modular Python code with clean separation of concerns. • Batch-based API integration using JWT authentication, with backoff, retry, and spool-to-disk on failures. • Human-like delays & cooldowns, all configurable via YAML. • Robust selectors with fallbacks, avoiding brittle hardcoded XPaths. • Error handling & observability, including structured logs, screenshots, and HTML dumps on failure. • Full deliverables: config template, logging, Dockerfile, requirements, README, and automated batch submission logic. • Backend support for deduplication, batch acknowledgment, and error reporting. I have extensive experience building long-running scrapers that integrate with Django REST APIs and handle dynamic UIs in multiple languages, with strong logging and recovery mechanisms. Technical question: How do you want the scraper to handle partial failures within a batch—should it continue sending remaining valid questions, or halt the batch entirely until manual intervention?
€300 EUR em 5 dias
4,7
4,7

As a seasoned Python engineer with a deep knowledge of web scraping, REST API integration, and proficiency in Playwright, I believe I am uniquely positioned to execute this project successfully. Throughout my 8+ years in the field, I have consistently demonstrated a commitment to delivering clean, maintainable code that aligns seamlessly with diverse systems, which is precisely what you require for this project that involves working on a Python-based scraper and contributing to your Django backend. My extensive expertise in rate-limiting and anti-blocking strategies will play a significant role in preventing blockages and ensuring that the scraper behaves like a human user. Understandably, parts of the project hold security-sensitive aspects - from API authentication using existing JWT endpoints to IP whitelisting - my commitment to delivering secure solutions aligns well with these requirements. Through enforcing error handling procedures such as capturing screenshots and HTML dumps and persisting batch data on failure, I can ensure transparency and effective failure recovery strategies are upheld throughout. Working with me guarantees not just an efficient orchestration of tasks but also quality deliverables that meet the specific requirements outlined in your pitch.
€500 EUR em 7 dias
5,0
5,0

Got it. - robust code - clean code - easy to use - fast - secure And last one is careful These are attributes you need, let talk more
€800 EUR em 12 dias
4,6
4,6

Hello there, I reviewed your project Python Scraper – Robust Playwright Scraper + API Integration (Fixed Project) and understood the requirements at a high level. I focus on delivering clear, stable, and maintainable solutions aligned with the actual scope, I can work with Python, Web Scraping, Django and follow a clean development process with proper structure and error handling. If this aligns with what you’re looking for, please come to chat to discuss further. Best regards
€250 EUR em 7 dias
4,3
4,3

Hi there, I'm Kristopher Kramer from McKinney, Texas. I’ve worked on similar projects before, and as a senior full-stack and AI engineer, I have the proven experience needed to deliver this successfully, so I have strong experience in Python, Django, API Integration, Automation, Web Scraping, Data Extraction, REST API and VPS. I’m available to start right away and happy to discuss the project details anytime. Looking forward to speaking with you soon. Best regards, Kristopher Kramer
€500 EUR em 7 dias
4,6
4,6

Hi, I am an eCommerce web developer with 8 years of experience building conversion-focused online stores. I am familiar with Shopify, Shopify development, eCommerce website design, HTML, SEO, dropshipping workflows, product research, payment gateway setup, and mobile-first web design. For this project, I would build a clean and trustworthy baby-themed Shopify store from scratch, customize a suitable theme to match the soft, modern brand style, import and optimize curated baby products, configure payments and shipping, and prepare the store for launch with SEO basics, tracking integrations, and clear setup documentation. I'm an individual freelancer and can work in any time zone you prefer. Please contact me with the best time for you to have a quick chat. Looking forward to discussing more details. Thanks. Emile.
€250 EUR em 7 dias
3,9
3,9

As an experienced Python engineer, my skills are perfectly aligned with the needs of this project. With a deep understanding of web scraping, automation, and data extraction, I am confident in my ability to build a robust, production-grade scraper using Playwright that aligns perfectly with your requirements. My experience in handling long-running scrapers and familiarity with rate-limiting and anti-blocking strategies will ensure that the scraper not only retrieves the required data but does so seamlessly without triggering any blocks or causing instability. My vast experience in API integration will come in handy when developing the backend with Django for your project. I have worked extensively with REST APIs and JWT-based authentication ensuring all security aspects are handled proficiently. Additionally, my experience working on front-end as well as back-end will ensure that the code clarity, architecture, and consistency - crucial aspects of this project - are maintained throughout.
€400 EUR em 7 dias
3,9
3,9

Hi! I can build a production-grade Python scraper using Playwright that’s stable, human-like, and designed for long-running unattended execution. I’ll focus on clean architecture, robust selectors, YAML-based configuration, structured logging, and reliable batch delivery to your Django REST API with JWT auth, including backoff handling and disk spooling on failure. I have strong experience with Playwright, API integrations, throttling strategies, and VPS deployments, and I’m comfortable delivering well-documented, maintainable code for fixed-price milestones.
€250 EUR em 1 dia
4,0
4,0

Hi, how are you doing? I’ve read your brief carefully and see you’re looking for a production-grade, human-like scraper built with Playwright that reliably extracts structured exam questions and delivers them to a Django backend in controlled batches. The real priority here is long-term stability, clean architecture, and safe unattended execution rather than raw scraping speed. I’d structure this as a modular Python project with Playwright handling interaction, YAML-driven throttling and module control, and a robust API client layer with JWT auth, backoff handling, and disk spooling on failure. I’ve built long-running scrapers and backend-integrated pipelines like this before, focusing on selector resilience, observability, and safe recovery paths on VPS environments. Do you already have a preferred format for the batch payload schema, or should I align it strictly to your existing Django serializers? Best regards, Alesha
€500 EUR em 7 dias
4,0
4,0

Hello! I can build this as a production-grade, long-running scraper, not a brittle one-off. The focus would be stability, observability, and clean integration with your Django backend exactly as outlined. How I’d approach this • Use Playwright (Python) with human-like interaction patterns and resilient selectors (no fragile XPaths) • Modular architecture: scraper core, site adapters, throttling engine, API client, and spool manager clearly separated • YAML-driven configuration for all behavior (modules, delays, cooldowns, auth, runtime flags) • Batch-based ingestion into your Django REST API with JWT auth + IP whitelisting • Respect backend feedback (next_backoff_seconds) dynamically • Robust retry logic with exponential backoff; automatic disk spooling on persistent API failure • Structured logging (INFO/WARN/ERROR) suitable for unattended VPS execution Reliability & observability • Screenshot + HTML capture on scrape failures • Safe aborts with clear diagnostics • One JSON spool file per failed batch for clean recovery • Designed to run for days/weeks on a low-cost VPS without triggering blocks Why this works I design scrapers like backend services: defensive, observable, and predictable under failure. Speed is secondary to long-term stability. If you’d like, we can first align on milestones (scraper core → API integration → hardening & soak tests) and lock scope before starting. Best regards, Jasmin
€500 EUR em 7 dias
4,1
4,1

Hi, I build robust, long-running Python scrapers with Playwright, designed to behave like a human and handle throttling, retries, and backoff. I’ll deliver structured exam-style question data to your Django REST API securely in batches, with full logging, error handling, and spool persistence. Configuration is entirely YAML-based for modules, delays, and runtime options. Project includes Docker support, modular code, and clear documentation for setup and failure recovery. I focus on stability, maintainability, and reliability, ensuring unattended VPS execution and seamless backend integration. Best
€450 EUR em 6 dias
4,0
4,0

Hi Adrián, I am excited about the opportunity to work on the Python Scraper project you have outlined. With my extensive experience in Python development and web scraping, I am confident in my ability to deliver a robust and reliable solution that meets your requirements. In my portfolio, you will find examples of previous projects where I have successfully implemented web scraping solutions using Playwright and Selenium. I have also integrated REST APIs securely, ensuring data transmission is efficient and reliable. Two technical questions I have regarding the project are: 1. How will the scraper handle dynamic content loading and pagination on the target platform? 2. Can you provide more details on the expected structure of the data to be extracted for backend ingestion? I am eager to collaborate on this project and deliver a high-quality solution that exceeds your expectations. Please find my portfolio for your review: https://www.freelancer.com/u/Mirza25092003 Best regards, Abdul
€250 EUR em 7 dias
3,3
3,3

Hi, We would like to grab this opportunity and will work till you get 100% satisfied with our work. We are an expert team which have many years of experience on Python, Web Scraping, Django, VPS, Data Extraction, Automation, API Integration, REST API Please come over chat and discuss your requirement in a detailed way. Thank You
€500 EUR em 7 dias
3,4
3,4

Cartagena, Spain
Método de pagamento verificado
Membro desde dez. 23, 2025
€1500-3000 EUR
€1500-3000 EUR
₹1500-12500 INR
₹1500-12500 INR
₹1500-12500 INR
$750-1500 USD
₹12500-37500 INR
$750-1500 NZD
$2-8 USD / hora
$25-50 USD / hora
₹1250-2500 INR / hora
$1500-3000 USD
€750-1500 EUR
$250-750 USD
£250-750 GBP
$8-15 AUD / hora
$100-150 CAD
₹12500-37500 INR
$10-30 USD
$15-25 USD / hora
$30-250 USD
$750-1500 USD