
Awarded
Posted
Paid on delivery
I have roughly ten thousand ISBN-13 codes and I need a production-ready Python pipeline that can take those codes, pull the corresponding book details from [login to view URL], [login to view URL], and a small set of external APIs, then push the cleaned results straight into a Google Sheets workbook. The pipeline must • survive Amazon’s throttling, bot checks, and page format changes without manual babysitting, • finish a full run on 10 k titles in a single session without crashing or silently skipping rows, and • give me fields that are already matched and normalised so downstream staff can link them to our catalogue instantly. Architecture is up to you: Scrapy, Playwright, headless Chrome, rotating residential proxies, Selenium, or a custom HTTP solution—whichever mix keeps the request footprint human-like and maximises up-time. What matters is that the codebase is clean, well-documented, and easy for an internal engineer to extend later. Deliverables 1. Fully annotated Python source (PEP 8 compliant) packaged so I can run it with one command. 2. A Google Sheets connector that inserts or updates rows atomically, preserving formulas already in place. 3. README with environment setup, proxy configuration, and step-by-step deployment instructions for macOS and Ubuntu. 4. Brief test report showing a run on at least 300 sample ISBNs, including elapsed time, success rate, and any retries triggered. Acceptance will be based on: • ≥ 98 % scrape success on the 300-item test set, • no Amazon “bot detected” blocks during that run, and • correctly populated Google Sheets in the agreed format. If you have proven experience scraping Amazon at scale and piping results into Google Sheets, I’m ready to review your plan and timeline.
Project ID: 40425219
78 proposals
Remote project
Active 11 hours ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
78 freelancers are bidding on average $85 USD for this job

Hello there, I am experienced in web scraping and building scripts or a Windows desktop application using Python. I am also experienced in large data scraping from a given website, bypassing IP, Captcha, and anti-bot or cloud flair protection. Please message me to discuss this project in detail. Best Regards Enamul
$30 USD in 2 days
8.2
8.2

Hello!!==>> I can build a production-ready Python pipeline that processes 10,000 ISBN-13 codes, extracts book data from Amazon (Japan & US) and external APIs, and directly syncs clean, normalized results into Google Sheets. The system will be designed to handle throttling, bot protection, retries, and large-scale execution reliably in a single run without data loss. I will also deliver a clean, documented codebase with setup guide, proxy configuration, and a tested sample run meeting your success criteria.
$150 USD in 2 days
6.1
6.1

https://www.freelancer.com/projects/data-scraping/Automated-Counterfeit-Detection/reviews Dear. Nice to meet you. I am very pleasure to submit my proposal on your scrapping and automation project. I have many experiences in these field using python. Recently, I developed Automated Counterfeit Detection and Reporting System on Amazon. You can check this in my portfolio. I am sure and I can start immediately. I will wait for your good news. Thank you.
$20 USD in 1 day
5.8
5.8

Hi, I have strong experience building large-scale Python scraping pipelines with Playwright/Scrapy, proxy rotation, anti-bot handling, and Google Sheets integrations. I can build a production-ready ISBN pipeline that processes 10k+ ISBNs reliably, normalizes the book metadata, and updates Google Sheets safely without breaking formulas. ✔ Amazon scraping with retry handling, throttling protection & session management ✔ Google Sheets API integration with atomic updates ✔ Clean, modular, documented Python code (PEP8 compliant) ✔ Tested workflows with reporting, logging, and recovery support ✔ macOS & Ubuntu deployment instructions included I can start immediately and provide an initial 300-ISBN test run quickly. Looking forward to discussing the architecture and timeline.
$20 USD in 1 day
5.5
5.5

Scraping 10k ISBNs from Amazon in a single session triggers bot checks almost immediately without a rotating proxy layer. I've architected pipelines handling 150+ external integrations where reliability mattered more than speed. The key here isn't just fetching data. It's handling the failures gracefully so your sheet stays consistent. I recommend Playwright with stealth plugins alongside residential proxies to mimic human behavior. For the Google Sheets connector, I'll implement batch updates to avoid rate limits on their end too. Clean code is a priority. I will deliver fully annotated Python source packaged so you can run it with one command as requested. Production-ready scraping usually incurs proxy costs beyond the development fee. Are you already set up with a proxy provider, or do you need recommendations on that front?
$25 USD in 7 days
5.5
5.5

I’ll build a resilient Python pipeline (Playwright/Scrapy + API fallbacks) that processes 10k ISBNs, normalises book metadata from Amazon + external APIs, and syncs atomically into Google Sheets with retry queues, logging, and proxy-aware throttling handling. Clean PEP8 code, deployment docs, and tested sample-run report included for stable large-scale execution
$30 USD in 1 day
5.4
5.4

As a seasoned Full-Stack & Mobile App & CRM & Artificial Intelligence Expert, I have gained immense experience delivering high-performance applications for both web and mobile platforms over the last 8 years. In line with your requirements, I specialize in architecting scalable and efficient digital pipelines that can successfully handle the scale of scraping operations you need. In terms of technical skills, my expertise in Python as a backend language is complimented by a solid foundation in cloud computing, specifically AWS. This expertise coupled with my knowledge of containerization (Docker and Kubernetes) and DevOps (CI/CD) will ensure a robust, scalable, and easy-to-integrate solution that will remain functional amidst Amazon's frequent throttling measures or page format changes.
$20 USD in 7 days
4.9
4.9

Hi, this fits well with my Python scraping and data pipeline work. The real challenge here isn’t just fetching book pages — it’s making the run reliable enough that 10k ISBNs don’t create missing rows, duplicate writes, or messy catalogue data. I’d build this as a resumable Python pipeline with retry tracking, source priority rules, normalisation, and a safe Google Sheets upsert layer that preserves existing formulas. For a similar workflow, I built a scraper that pulled messy supplier data from multiple sources, cleaned inconsistent fields, and pushed only validated rows into a reporting sheet so staff didn’t have to fix records manually afterward. I’d start with a 300-ISBN test batch to confirm Amazon/API coverage, field mapping, throttling behaviour, and Sheets update logic. The main risk is Amazon blocking or changing markup, so I’d avoid brittle selectors where possible, use API fallbacks, structured retries, checkpoints, and clear failure logs instead of silently skipping anything. Thanks!
$20 USD in 7 days
3.8
3.8

Hi there, I am a Python Programmer since last 16 years and developed around 500+ scrappers and applications for my clients. I have understood your requirements and can create this pipeline for you within short span of time. Lets connect.
$100 USD in 1 day
3.8
3.8

Hey , I just finished reading the job description and I see you are looking for someone experienced in Selenium, API Integration, Software Architecture, Google Sheets, Data Mining, Web Scraping, Scrapy and Python. This is something I can do. Please review my profile to confirm that I have great experience working with these tech stacks. While I have few questions: 1. These are all the requirements? If not, Please share more detailed requirements. 2. Do you currently have anything done for the job or it has to be done from scratch? 3. What is the timeline to get this done? Why Choose Me? 1. I have done more than 250 major projects. 2. I have not received a single bad feedback since the last 5-6 years. 3. You will find 5 star feedback on the last 100+ major projects which shows my clients are happy with my work. Timings: 9am- 9pm Eastern Time (I work as a full time freelancer) I will share with you my recent work in the private chat due to privacy concerns! Please start the chat to discuss it further. Regards, Abdul Haseeb Siddiqui
$10 USD in 3 days
3.8
3.8

Hi, how are you doing? I’ve built robust Python pipelines for large-scale data extraction and clean delivery to Google Sheets, with resilient crawling that handles throttling and schema changes, plus clean, annotated code and a ready-to-run package. I can deliver a fully documented setup, a Google Sheets integration that updates atomically, and a 300-item test run with a concise report. Let me know further if interested
$30 USD in 5 days
3.4
3.4

Hi There , Good morning! I am skilled mobile engineer with skills including Selenium, API Integration, Python, Data Mining, Web Scraping, Scrapy, Software Architecture and Google Sheets. Please send a message to discuss more about this project. Hope to hear from you soon
$10 USD in 2 days
2.6
2.6

Hello, there, I’ll approach this as a production-ready, maintainable Python pipeline that reliably handles 10k ISBNs and yields ready-to-use, normalized book data for Google Sheets. The plan emphasizes resilience to Amazon throttling and format shifts, with clear, documented boundaries for extension by an internal engineer. A key risk is Amazon’s bot checks and throttling, which can silently skip rows if not handled thoughtfully. My approach uses a combination of headless browser tactics and API fallbacks with controlled pacing, robust error handling, and explicit retries. I’ll ensure a clean, well-documented codebase and an atomic Google Sheets updater that preserves existing formulas while upserting rows, so downstream staff can link records instantly. From prior experience, I’ve built modular scrapers that respect rate limits, isolate transient failures, and normalize fields early in the pipeline. A practical improvement is an idempotent upsert layer with per-ISBN state tracking and a small in-process queue to manage retries without thrashing the target sites or Google Sheets. Thanks, Jim.
$20 USD in 1 day
2.8
2.8

Hello, I'm Cindy Viorina, an experienced Python developer with a proven track record in building robust web scraping pipelines. I understand you require a production-ready solution to scrape book details from multiple sources using a set of ISBN-13 codes, and I'm confident I can deliver exactly what you need. To tackle the challenges posed by Amazon's throttling and bot checks, I propose using a combination of Scrapy and headless Chrome with rotating proxies. This architecture will ensure that we maintain a human-like request footprint while achieving high uptime and reliability. I will ensure the codebase is clean and well-documented, adhering to PEP 8 standards, making it easy for your internal team to extend and maintain. I can communicate in real-time according to your time zone and can provide a simple demo of the solution within 12 hours of commencement. Q1: What specific fields do you want included in the Google Sheets output? Q2: Are there any specific APIs you prefer for additional book details? Q3: What is your preferred deadline for this project? I look forward to discussing this further! Can you clarify the preferred date for project completion?
$10 USD in 7 days
2.8
2.8

Yes! You are on the right bid. I have read all project details and descriptions regarding Python Book Data Scraping Pipeline I will save your time by letting my work speak for you. If I am lucky enough to get your attention, please feel free to reach me so we can spend 10-15 minutes and discuss everything ;) You can check my portfolio and reviews regarding your Project: https://www.freelancer.pk/u/Q@d33rM3hdi Best regards! Qadeer Mehdi!
$15 USD in 1 day
2.4
2.4

Hi, I can build a production-ready Python scraping pipeline for your ISBN workflow using a scalable custom architecture with proxy rotation, retry handling, session persistence, and Google Sheets synchronization designed for long uninterrupted runs. I can structure it to handle Amazon rate limits safely while keeping the scraped data normalized, validated, and easy for your internal team to extend later. Do you already have preferred proxy providers or should I recommend a stable residential setup? Also, which exact book fields need to be normalized into Google Sheets besides title, author, and pricing? And should failed ISBNs retry automatically in later passes or be logged separately for manual review? Best regards, Muzammil
$20 USD in 7 days
2.4
2.4

Hello, To be honest, this would be my first data scraping project. I haven’t worked on large-scale scraping before, but I’m genuinely interested in learning and giving my full effort to this work. If you can give me one opportunity and a little guidance in the beginning, I’ll do my best to complete the project properly and responsibly. I’m ready to spend extra time and effort to make sure the work is done correctly. If you want, you can first give me a small demo or test task. Once you are satisfied with the results, then you can approve the full project. I may be new in this particular field, but I’m serious about the work and willing to learn whatever is needed to complete it successfully. Regards Himanshu bisht
$12 USD in 7 days
1.9
1.9

Hi, ⭐15+ Yrs Sr Developer here⭐ I can build a production-ready Python pipeline that processes your ISBN-13 list, enriches book data from approved APIs and Amazon sources, normalizes the fields, and updates Google Sheets safely. I’d structure it with retries, rate limiting, logging, checkpointing, duplicate handling, and row-level status tracking so a 10k-title run can resume cleanly instead of silently losing records. For reliability, I’d prioritize official/external book APIs where possible, then use Scrapy/Playwright only where needed with a careful request strategy that respects site limits. The Google Sheets connector can insert or update rows atomically while preserving existing formulas and keeping downstream catalog matching clean. You’ll receive documented Python code, setup instructions for macOS/Ubuntu, proxy/API configuration notes, and a 300-ISBN test report with success rate and retry details. If you think I am a good fit, feel free to ping me anytime. — GAZMIR
$20 USD in 1 day
1.6
1.6

I'm a certified AI, Python Automation & Data Analyst specialist with hands-on experience in web scraping, Selenium, Playwright, Flask, n8n workflow automation, and data analysis using Python, R, Pandas, and NumPy. I don't just deliver code — I deliver working solutions that save your time and reduce manual effort. I hold certifications in AI Development (IBM) and Python Automation & Data Science (Coursera & Packt), so you can trust that my work is professional and up to standard. I'm available to start immediately, communicate regularly, and will not close the contract until you are 100% satisfied. Let's discuss your project — feel free to send me a message!
$30 USD in 7 days
1.1
1.1

I understand the importance of building a robust and efficient Python data scraping pipeline for your project. With a proven track record in web scraping, particularly from Amazon and integrating data into Google Sheets, I am well-equipped to deliver a solution that meets your requirements. My approach will involve using a combination of Scrapy and Selenium to ensure we can navigate Amazon's defenses while maintaining a human-like request footprint. I will implement rotating proxies and thorough error handling to ensure that the pipeline runs smoothly and can handle all 10,000 ISBNs in a single session without issues. You will receive fully annotated, PEP 8 compliant code that is easy for your team to extend, along with a Google Sheets connector that updates data atomically. A detailed README will guide you through the setup, and I will provide a test report demonstrating the pipeline's effectiveness on a sample set. I aim for a ≥ 98% success rate with no detection issues during scraping. Let's work together to turn your vision into reality.
$20 USD in 14 days
0.6
0.6

United States
Payment method verified
Member since Feb 11, 2026
$10-30 USD
$30-250 USD
$10-30 USD
$30-250 USD
$20-40 USD
₹12500-37500 INR
$250-750 USD
₹12500-37500 INR
₹600-1500 INR
₹750-1250 INR / hour
€30-250 EUR
$10-200 USD / hour
€30-250 EUR
$250-750 USD
$30-250 USD
$30-250 AUD
$10-50 USD
$10-30 USD
€30-250 EUR
₹1500-12500 INR
$30-250 USD
₹750-1250 INR / hour
$10-30 AUD
₹12500-37500 INR