
Closed
Posted
Paid on delivery
I need a Python-based solution that automatically gathers companies and shareholders data, pulls supplementary details via external APIs, and outputs a clean, unified dataset I can query at any time. Scope of the scrape • Sources: company websites, financial databases and relevant public records. • Website focus: company profiles, turnover figures and any available Demat / share-holding particulars. What the tool should do 1. Crawl or call the above sources, respecting [login to view URL] and rate limits. 2. Parse the required fields, normalise names and IDs, then enrich each record through one or more APIs (for example OpenCorporates, Clearbit or any better suggestion you have). 3. Store results in a structured format (CSV plus an SQLite or Postgres option). 4. Offer a simple command-line trigger as well as a callable function so I can integrate it into larger workflows later. 5. Log activity and errors clearly. Tech stack Python 3.x with common libraries such as Requests, BeautifulSoup or Scrapy, Pandas and an ORM (SQLAlchemy is fine). If Selenium or Playwright is unavoidable for dynamic pages, please factor that in. Acceptance criteria • Full source code with virtual-env requirements file. • Sample run that fetches at least 30 real company records, shows enrichment working and saves the combined dataset. • README explaining setup, usage and how to swap in new API keys or data sources. Let me know your approach, estimated timeline and any previous work scraping financial/company data so I can move forward quickly.
Project ID: 40293397
17 proposals
Remote project
Active 4 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
17 freelancers are bidding on average ₹19,941 INR for this job

One Stop Solution for Data Intensive Tasks: Being a seasoned Full-Stack Developer with a striking 7+ years of experience, I have proven expertise in churnings tons of data into meaningful insights. Data processing and scraping is one of my core skills, and I have hands-on experience with Python libraries such as BeautifulSoup, Scrapy, SQLAlchemy which aligns perfectly to what you need. Furthermore, my niche command over PHP can be utilized strategically if Selenium or Playwright is unavoidable for scraping dynamic pages. Quality & Value Driven Delivery: I understand the essence of clean code, optimized performance, long-term maintainability, and their far-reaching impacts on your project. Besides delivering the requisite source code for the solution, I'll provide detailed documentation explaining setup, usage instructions, and readying it for future expansion i.e., swapping new APIs or adding data sources. Reaping full value from your high-volume data isn't just about acquiring it; it's leveraging it optimally. Hire me as your Python Scraping Ninja to assure effortless acquisition with impeccable quality! Let's connect-*pitch does not meet the maximum 1000 character cap*
₹25,000 INR in 7 days
8.8
8.8

Have over 18 years of experience in data mining/ Web scrapping/ Scraping Bots/ Chrome/Opera Extensions I have done it all. Tell us your source and we will put it in excel for you, Or we can even give you filtered results as per your requirement, In the format you want. You can also ask for data into a particular format - Excel, Json, Mysql, Databases, XMLs, you name them. Further Can help you with integrating it with ur databases, Can create json outputs. We are not only good with scraping but also with the tools that u may need after that. We can help you build you softwares round the data we have 99% Data Accuracy. We have Duplicate finder. etc., We can help with Statistics on the data We can help with creating Api's front the data We can create Softwares to manage that data We can build Sites round the data
₹25,000 INR in 7 days
6.9
6.9

Hello there, ✸✸✸Python Expert is Here✸✸✸ I’ve checked your project – “Python Scraper & API Enrichment -- 2” And read the description carefully. As a professional Python Developer, I’m damn sure that I can “Python-based solution that automatically gathers companies and shareholders data, pulls supplementary details via external APIs, and outputs a clean, unified dataset I can query at any time.'' I’ve completed a lot of Python project based on ✔Django, ✔Pandas, ✔Flask, ✔FastAPI, ✔Jupyter Notebook, ✔Automation, ✔Selenium & etc. Libraries in various platform. Here is some of my recent completed Python Project: ✔️ https://www.freelancer.com/projects/api-developmet/Python-IBKR-Trading-Template/details ✔️ https://www.freelancer.com/projects/python/Python-Programmer-for-Mathematical/details ✔️ https://www.freelancer.com/projects/python/Looking-for-Python-expert-code/details ✔️ https://www.freelancer.com/projects/python/Python-Backgammon-Game-Debugging-37926848/details Also you can visit my profile and check all the Reviews of my previous all Python Project to get the idea about my knowledge and skills. I’m ready to be hired or ready to be awarded as I can start this task Right Now. So, I’m waiting for your response in chat box. Best Regards! Eng. Bablu Mondol
₹13,000 INR in 3 days
5.9
5.9

Hi, As per my understanding: You need a Python-based data pipeline that collects company and shareholder information from multiple sources such as company websites, financial databases, and public records. The system should extract key fields (profiles, turnover, shareholding details), normalize company identifiers, enrich the data through APIs like OpenCorporates or Clearbit, and store everything in a clean, queryable dataset. It should support both command-line execution and reusable function calls, while logging activity and errors clearly. Implementation approach: I will build a modular Python scraping and enrichment pipeline using Requests/Scrapy for crawling and BeautifulSoup for parsing structured fields. For dynamic pages, Selenium or Playwright can be added when necessary. Extracted data will be normalized and enriched via APIs (OpenCorporates, Clearbit or alternatives), then structured using Pandas and SQLAlchemy before exporting to CSV and storing in SQLite or PostgreSQL. The tool will include CLI commands for manual runs and callable modules for integration into larger workflows. Logging and rate-limit handling will ensure reliable scraping while respecting robots.txt. A few quick questions: Which countries or registries should the company data focus on initially? Do you already have API keys for enrichment services like OpenCorporates or Clearbit? Approximately how many records per run should the system handle?
₹12,500 INR in 7 days
5.5
5.5

Hi Mate , Good evening! I’ve carefully checked your requirements and really interested in this job. I’m full stack node.js developer working at large-scale apps as a lead developer with U.S. and European teams. I’m offering best quality and highest performance at lowest price. I can complete your project on time and your will experience great satisfaction with me. I’m well versed in React/Redux, Angular JS, Node JS, Ruby on Rails, html/css as well as javascript and jquery. I have rich experienced in Python, Data Scraping, Data Extraction, PHP, BeautifulSoup, Data Processing, Scrapy and Software Architecture. For more information about me, please refer to my portfolios. I’m ready to discuss your project and start immediately. Looking forward to hearing you back and discussing all details.. Many thanks
₹27,750 INR in 4 days
4.5
4.5

Hi,I’m a seasoned Applied ML Engineer in FinTech(6+ yoe) & can build this as a reliable company/ownership data pipeline with realistic expectations around what’s publicly available Relevant work I’ve delivered: >>Built KYC/AML-style entity enrichment pipelines: domain/company name -> canonical entity record (registry APIs,firmographic APIs),normalization & audit-friendly logs >>Developed market/issuer data aggregators pulling from public filings + APIs with robust rate limiting,caching,retries & schema versioning >>Implemented entity resolution(name variants,duplicate merging,identifier linking like LEI/registration IDs) & stored results in Postgres with clean query interfaces >>Production scraping experience using Requests/Scrapy + Playwright only where required + compliance controls Approach: 1. Define the target fields and jurisdiction + clarify shareholders level(named owners vs category % /beneficial ownership) 2. Build a modular source layer: company websites + selected public registries (OpenCorporates/Companies House/SEC EDGAR) + enrichment(Clearbit/LEI where applicable) 3. Normalize names/IDs,dedupe & produce a unified schema; persist to CSV + SQLite/Postgres via SQLAlchemy 4. Provide CLI + Python function + structured logs & a sample run(30+ companies) 5. Documentation Important note: "Demat/shareholder names" is often restricted or incomplete publicly; I’ll implement the best available public sources & clearly flag data confidence/coverage
₹15,000 INR in 7 days
4.2
4.2

This project requires a robust Python solution for automated data extraction and enrichment. My technical approach involves building a multi-stage pipeline: a core scraper using libraries like Beautiful Soup or Scrapy, followed by intelligent API integration to validate and enrich the raw data. This ensures the final dataset is comprehensive and actionable. The implementation plan focuses on delivering a working prototype for one primary data source, with structured output in JSON or CSV. My experience with Python and backend systems guarantees a maintainable solution, ready for future expansion. The budget for this initial milestone is 380.0 INR, covering development, integration, and delivery of the core scraper and primary API enrichment. What is the primary website or data source you intend to scrape first?
₹35,000 INR in 3 days
2.9
2.9

Hey, your project, Python Scraper & API Enrichment -- project looks like a great fit for my skills. I've worked on similar PHP projects and can deliver solid results. Let me know if you'd like to chat about the approach.
₹12,500 INR in 7 days
4.2
4.2

Hello Sir, I am a professional Python developer with over 7+ years of experience. I have read your requirements and am interested in working with you. I have hands-on experience in Python automation, web scraping, and data handling. My skills include Python (Scrapy, Selenium, BeautifulSoup) for efficient data extraction, and I can store and manage data in CSV files and database systems such as MongoDB. I focus on delivering reliable, clean, and well-structured solutions. I am ready to start immediately and look forward to your response. Best regards, SoftNexus Technologies
₹28,000 INR in 2 days
1.2
1.2

Your project requires a robust Python tool to aggregate and enrich company-shareholder data from multiple sources while ensuring compliance and flexibility. I’ll build a crawler using Requests and BeautifulSoup, integrating APIs like OpenCorporates and Clearbit for enrichment. Data normalization and storage will leverage Pandas and SQLAlchemy, supporting CSV and PostgreSQL outputs. The tool will offer CLI and callable function interfaces, with detailed logging and error handling. I’ll provide full source code, a virtual environment, a sample run with 30+ records, plus comprehensive documentation. Prior experience includes scraping financial databases with similar tech. Estimated timeline: 2–3 weeks. I would love to chat more about your project! Regards, Adriaan Potgieter.
₹18,750 INR in 30 days
0.0
0.0

Hi, I've built this exact type of system before — Python scrapers that pull company/shareholder data from financial databases, enrich it via external APIs, and output clean unified datasets. I can adapt my existing pipeline for you rather than starting from scratch. My stack: BeautifulSoup/Scrapy for scraping, requests for API calls, Pandas for data unification and deduplication, SQLite/CSV for queryable output. ₹12,500 — 1 day. Want me to walk you through how I'd adapt it for you? Saif
₹12,500 INR in 1 day
0.0
0.0

Lucknow, India
Member since Mar 10, 2026
₹12500-37500 INR
₹1500-12500 INR
₹1500-12500 INR
₹12500-37500 INR
₹1500-12500 INR
$30-250 USD
₹1500-12500 INR
₹37500-75000 INR
₹600-1500 INR
$30-250 USD
$10-40 USD
₹1500-12500 INR
$750-1500 USD
₹750-1250 INR / hour
$10-30 USD
£250-750 GBP
$30-250 USD
₹1500-12500 INR
$10-30 USD
$30-250 USD
$8-15 USD / hour
₹1500-12500 INR
$250-750 AUD
€30-250 EUR
$15-25 USD / hour