
Fechado
Publicado
Pago na entrega
I need a repeatable Python-based scraping pipeline that pulls historical data from [login to view URL] and then continues to run weekly. Scrapy is strongly preferred because I want to spin the whole thing up inside a GitHub Codespaces Docker container and keep deployment friction to a minimum. Scope of data • Players page first: full player-level stats are the initial priority. • Draw page next: I specifically need match dates, kick-off times and the team & player stats embedded in each fixture. • Ladder and any other public stats pages can follow once the core player and draw feeds are solid. Data model & output All raw HTML should be parsed into tidy, flat tables (CSV or Parquet). Please create sensible surrogate keys so that tables for players, matches, teams and ladder positions can be joined cleanly in a downstream warehouse. Deliverables • Scrapy project with clearly named spiders for Players, Draw, Ladder and Stats • Dockerfile / [login to view URL] so the whole thing launches in a Codespace with one click • README that shows me the command to run a full historical scrape and the command for an incremental weekly scrape • Sample output files proving the schema and join keys work Acceptance I’ll run the spiders in a fresh Codespace: if they complete without errors, produce the stated tables as files and the keys line up across tables, the job is done.
ID do Projeto: 40185111
66 propostas
Projeto remoto
Ativo há 1 mês
Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
66 freelancers estão ofertando em média $159 AUD for esse trabalho

⭐>>-- Scraping Boss is here--<<⭐ I totally understand what you want. I have S_T_R_O_N_G experience in web/data scraping and crawling. I can scrape any website and overcome all anti-bot policies like Recaptcha, IP detection, etc... Just click the "Chat" button for further discussion. Thank you.
$150 AUD em 1 dia
6,9
6,9

Hello! As a seasoned data engineer, I specialize in building robust, containerized Scrapy pipelines for sports analytics, with over 9 years of Python experience. Your vision for a one-click GitHub Codespace solution is exactly the kind of low-friction, repeatable system I deliver. Here's how I can help: - Build a Scrapy project with dedicated spiders for Players and Draw data first, outputting to tidy Parquet/CSV with clean surrogate keys. - Provide a complete Docker/Devcontainer setup for instant Codespaces launch, plus clear commands for full/incremental scrapes. - Deliver a normalized data model so your player, match, and team tables join seamlessly downstream.
$140 AUD em 4 dias
6,6
6,6

Hey there Glane here, hope you're doing well. I can help you in scraping the desired data present on the given website via headless selenium and will push the desired files into GitHub wirh a docker container
$250 AUD em 7 dias
6,2
6,2

As a Top 0.03% ranked freelancer, I have dedicated my career to creating efficient and reliable solutions through data mining and web scraping, which makes me an excellent fit for your NRL Data Scraping Pipeline project. Not only do I specialize in Python-based scrapers like Scrapy, but I also have a firm grasp on full player-level stats, match-based data, and public stats from various sports websites. Moreover, my proficiency with Docker containers and GitHub Codespaces will help ensure that your scraping pipeline is not only robust but easily deployable. I ensure that all raw HTML will be parsed into tidy, flat tables while maintaining sensible surrogate keys for clean data joins. For your downstream warehouse needs, consider it done! Lastly, my focus when crafting READMEs revolves around simplicity and clarity- providing you with comprehensive instructions to run historical and incremental scrapes with ease. My passion for clean and effective data-driven solutions ensures that the end product meet the highest standards of quality as requested. Let's rock this project together and deliver exactly what you envisioned!
$140 AUD em 7 dias
6,4
6,4

Hi, We’ve built similar data pipelines for sports data, including scraping player stats and match results from multiple sources. We also developed a dedicated web app to manage and visualize this data, which was later integrated with a mobile app. For your project, I suggest using a dedicated web app instead of a GitHub Codespace. This way, we can set up a fully managed environment with CI/CD, scheduled tasks, and more, while keeping costs low. We can also use a single codebase for both web and mobile apps, saving you time and money. Let’s schedule a 10-minute call to discuss your project in detail and see if I’m the right fit. I usually respond within 10 minutes. I’m eager to learn more about your exciting project. Best regards, Adil
$154,73 AUD em 7 dias
6,0
6,0

Hello, Hope you are doing great, i am expert in web scraping , I can easily scrape all the target data from the website using Python or any other script so you don't have to spend any time or effort doing it manually. Plus, I provide quality results quickly and efficiently within your budget. Lets connect through chat for further detailed discussion, i can start the work right after the discussion., thank you Gaurav D.
$250 AUD em 7 dias
6,4
6,4

hi there, python hub , have read the project description , willing to start work please come to chat box so we can easily discuss in details, Thank You
$180 AUD em 1 dia
5,6
5,6

I’ve built Scrapy-based pipelines with clean data models, joinable keys, and Docker/Codespaces setups for weekly and historical sports data scrapes. I can deliver modular spiders for players, draw, and ladder with tidy CSV/Parquet outputs and a frictionless one-click Codespace run.
$100 AUD em 1 dia
5,1
5,1

Hi, there! My name is Ian Brown, and I’d be happy to help with your project. I can provide a clean, reliable solution tailored to your needs, keeping everything simple, efficient, and easy to use. My goal is to streamline your workflow, save time, and deliver results that fit smoothly into your existing process. I’m ready to jump in and help make your project run as smoothly as possible!
$140 AUD em 7 dias
4,7
4,7

With my extensive 8+ years' of experience in data analytics and science, I can confidently harness everything Python, Docker, and Web Scraping to build an efficient, highly-repeatable NRL Data Scraping Pipeline specifically tailored to fulfill your requirements for scrapy-based solution. My familiarity with Scrapy is a key advantage, as it ensures a quicker and frictionless deployment using your desired GitHub Codespaces Docker container setup. To appreciate the importance of organizing data in a structured manner, I’ve delved deep into data modeling and manipulation using Python libraries such as Pandas and SQLAlchemy. Ensuring all raw HTML is parsed into tidy, flat tables (CSV or Parquet) with appropriate surrogate keys for easy joining in your downstream warehouse is part of this package. Reiterating my in-depth knowledge of ETL processes and various analytics tools such as SQL server, BigQuery (GCP), and Snowflake can be invaluable. Moreover, given my prior experience executing effective ML-driven solutions to optimize operations and forecast outcomes across several industries including finance and e-commerce, you gain additional insights leveraging my skills with TensorFlow and PyTorch. Finally, my commitment to clear understanding of client expectations is reflected in my ability to create accurate project documentation as well as thorough testing for successful project acceptance. Don't hesitate to let me help you unlock the full power of your data!
$140 AUD em 7 dias
4,3
4,3

Hi, I have over two years of experience building repeatable scraping pipelines in Python using Scrapy and Docker. I can create a clean Scrapy project with structured spiders, joinable schemas, and a Codespaces-ready setup for both historical and weekly runs. You will get a ready-to-run Docker setup, documented commands, and sample outputs for validation. Happy to start and review the target pages with you.
$100 AUD em 6 dias
4,4
4,4

⭐ Hello there, My availability is immediate. I read your project post on Python Developer for NRL Data Scraping Pipeline. I am an experienced full-stack Python developers with skill sets in - Python, Django, Flask, FastAPI, Jupyter Notebook, Selenium, Data Visualization, ETL - React, JavaScript, jQuery, TypeScript, NextJS, React Native - NodeJS, ExpressJS - Web App Development, Data Science, Web/API Scrapping - API Development, Authentication, Authorization - SQLAlchemy, PostegresDB, MySQL, SQLite, SQLServer, Datasets - Web hosting, Docker, Azure, AWS, GPC, Digital Ocean, GoDaddy, Web Hosting - Python Libraries: NumPy, pandas, scikit-learn, tensorflow, etc. Please send a message So we can quickly discuss your project and proceed further. I am looking forward to hearing from you. Thanks
$230 AUD em 3 dias
4,3
4,3

Hi there, I’m Robert, a Senior Full-Stack & AI Engineer with over 10 years of experience architecting and delivering SaaS platforms, automation systems, and intelligent applications, specializing in web scraping, data modeling, and Python. I have developed a Python-based scraping pipeline using Scrapy for various clients, effectively extracting and processing complex datasets with high accuracy. My extensive background in data engineering aligns perfectly with your needs for a robust NRL data scraping pipeline. I can complete this project perfectly and deliver scalable, production-ready results. I’m committed to clean architecture, structured documentation, CI/CD automation, and OWASP-based security practices, ensuring that your data is handled efficiently. Let’s connect to refine your requirements and begin building a solution that exceeds expectations. What specific scheduling frequency do you envision for the pipeline after the initial setup?
$220 AUD em 10 dias
2,7
2,7

I can build a robust Scrapy pipeline that scrapes historical and weekly NRL data, covering players, draws, teams, ladder, and stats. The project will include well-structured spiders, clean CSV/Parquet outputs with joinable surrogate keys, and a Dockerized GitHub Codespaces setup for one-click deployment. I’ll also provide a clear README with commands for full historical and incremental weekly runs, plus sample output files to verify schema and keys. Delivery will ensure error-free execution and ready-to-use tables for downstream analytics.
$90 AUD em 1 dia
2,7
2,7

Hi, I can build a clean, repeatable Scrapy-based pipeline that runs smoothly inside GitHub Codespaces with Docker. I’ve delivered production-ready scraping systems with proper data models, surrogate keys, and incremental scheduling. I’ll structure clear spiders, tidy flat outputs (CSV/Parquet), and thorough documentation so you can run historical and weekly scrapes confidently with zero setup friction.
$100 AUD em 7 dias
3,1
3,1

As an experienced web scraper and a seasoned Python developer, I come armed with the exact skills needed for your NRL Data Scraping pipeline. I've worked extensively with Scrapy and deploying it within Docker containers, like you've mentioned, brings me great excitement with its efficiency. My proficiency in Data Mining, Processing and Scraping will be key to creating tidy and accurate tables - exactly what you need to analyze NRL data effectively. I have a strong emphasis on delivering clean code while also ensuring scalability and high performance. By using my expertise designing efficient databases, I'll create logical schema and well-thought-out join keys to enable you to gather valuable insights for player-level stats, fixures' embeded team & player stats among others. Beyond crafting the pipeline, I’ll provide a detailed README that'll enable you to run scrapes both historically and weekly. Additionally, I’ll present sample output files for you to verify whether the schema and join keys align seamlessly as expected. For me, your satisfaction is paramount - this means producing errorless tables as files of all essential feeds from raw HTML so that the migration into any downstream warehouse is seamless. Let me handle this II'll ensure the job's done successfully!
$150 AUD em 3 dias
2,4
2,4

Hello there, I read "NRL Data Scraping Pipeline" description "I need a repeatable Python-based scraping pipeline that pulls historical data from nrl" thoroughly. Your project looks straightforward and achievable without over-engineering. We focus on practical, scalable web solutions, whether it’s a quick fix or full build. Happy to start immediately and keep things smooth. Thanks, Bravix
$100 AUD em 2 dias
1,6
1,6

I appreciate the opportunity to work on your Python-based scraping pipeline for nrl.com. Your focus on a well-structured, seamless Scrapy setup within a GitHub Codespaces Docker container highlights the need for a professional, integrated solution that minimizes deployment friction. I may be new to Freelancer, but I bring solid experience to the table in building user-friendly scraping pipelines with Scrapy, Docker, and data modeling to deliver tidy, joinable datasets. I’m happy to offer a free call to go over the project if you would like. Regards, Blaze Nicholas
$100 AUD em 14 dias
0,8
0,8

As an avid sports enthusiast and a seasoned data scraper, I'm the perfect fit for your NRL scraping pipeline project. I bring years of experience developing highly-optimized bots that integrate well with platforms like Scrapy and Docker, ensuring efficient deployment, and maintenance. My proficiency in algorithmic design and data analysis will contribute to the creation of robust table structures with accurate surrogate keys. This will enable you to easily join tables for players, matches, teams and ladder positions downstream without any complications. Moreover, having worked under strenuous market conditions and produced consistent results, you can rely on me to provide a reliable solution that adapts to dynamic needs, such as weekly incremental scraping.
$140 AUD em 7 dias
0,5
0,5

Hello, I’m Ankur, a freelance developer with a dedicated team of professionals. I read all your requirements for Website and I assure you that I will provide high-quality work at the proper time. Additionally, we also provide you 3 months of support from our side. As a Full Stack Developer, I specialize in Web and App Development, boasting a portfolio of stunning projects with top-notch UI/UX design. My expertise spans Flutter (for both Android and iOS), PHP, and WordPress, and I bring over 7 years of experience to the table. Whether it’s websites, applications, or e-commerce platforms, I’ve got you covered. But I’m not limited to just coding. My skill set extends to graphic design and logo creation, offering you a one-stop solution for all your project needs. With a track record of over 500 completed projects, I am committed to delivering nothing short of excellence. My ultimate goal is your complete satisfaction. Thank you for considering me for your project. I’m ready to transform your vision into a reality that stands out in today’s competitive landscape. Best Regards, Ankur Hardiya
$140 AUD em 7 dias
0,2
0,2

Sydney, Australia
Método de pagamento verificado
Membro desde out. 11, 2022
$250-750 USD
$30-250 USD
₹12500-37500 INR
$250-750 USD
$30-250 NZD
$30-250 USD
$10-20 USD / hora
$30-250 USD
$10-30 USD
₹1500-12500 INR
$30-250 USD
£10-15 GBP / hora
$3000-5000 USD
₹10000-20000 INR
₹12500-37500 INR
$250-750 AUD
$30-250 USD
$30-250 USD
₹1500-12500 INR
$30-250 USD