
Concluído
Publicado
Pago na entrega
I need a Python system that runs nonstop, scours the web for fashion, clothing, and footwear brands or sellers only for india, extracts every piece of publicly available business contact data it can find, pinpoints decision-makers when their names or roles appear, then verifies each email address with DNS-level checks only—no third-party validation services. All captured information must immediately land in clean, well-structured Excel workbooks, so I can open the file and start outreach without touching a database. Reliability is critical: the workflow has to restart itself after a crash, pick up where it left off, and scale out smoothly if I decide to add more crawling threads or containers later. Package everything inside a Linux-compatible Docker image; a single docker-compose up command should spin up the full pipeline. Deliverables • Full, readable Python source code organized into clear modules • Dockerfile and [login to view URL] configured for Linux hosts • An example Excel file that proves the schema and shows sample scraped contacts • Setup & run instructions (markdown or plain text) • Brief note on how the system self-heals and the exact DNS checks performed (syntax, MX, Catch-All detection, etc.) Acceptance criteria 1. Continuous discovery and scraping run successfully for 24 hours in a test environment without manual intervention. 2. At least 95 % of exported email addresses pass the specified DNS validation steps. 3. A fresh environment can be provisioned with only Docker installed by following the supplied instructions, producing the same results.
ID do Projeto: 40349438
7 propostas
Projeto remoto
Ativo há 15 dias
Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos

I can build a robust Python scraping system with DNS-level email validation, self-restarting workflows, and clean Excel outputs—fully Dockerized for one-command deployment and scalable crawling. Reliable, modular, and ready for 24/7 operation with clear docs and sample data.
₹7.000 INR em 1 dia
5,4
5,4
7 freelancers estão ofertando em média ₹7.821 INR for esse trabalho

Hello. I can build a nonstop Python scraping system packaged in Docker that continuously discovers Indian fashion brands and extracts structured contact data into Excel. The crawler will be modular, scalable, and fault-tolerant with automatic restart and checkpointing so it resumes without data loss. Email validation will be done using DNS-level checks only including MX lookup, syntax validation, and catch-all detection without any third-party services. Data will be written directly into clean Excel files using a consistent schema ready for outreach. The system will run via docker-compose with a simple command and include clear setup instructions and sample output. I’ll also document the self-healing logic and validation flow so you can scale or extend it easily later.
₹6.000 INR em 5 dias
4,6
4,6

Having spent considerable years in the realm of automation, data management, and web scraping, I am more than equipped to take on your project. My impressive proficiency in Python complements the requirements of your task. To assure smooth functionality, I can efficiently design a system that not only withstands continuous use without manual intervention but also self-recovers from a crash if needed. Besides, my expertise in creating Linux-compatible Docker images would ensure hassle-free provisioning and scalable performance for your future needs. The essence of your project lies in capturing fashion, clothing, and footwear brands or sellers' business data for India, including relevant decision-makers. My extensive range of skill sets is well-suited to curate this data with utmost precision and present them to you in a structured Excel format. Whether it is extracting contact details or verifying emails with DNS-level checks utilizing syntax, MX and Catch-All detection methods (as required), I have mastered these techniques and can do so accordingly. With detailed instructions and run guide, I will ensure that any fresh environment set up by you would generate identical results. Offering 24/7 availability, professional support, and unlimited revisions until 100% satisfaction is achieved; your project's success is inevitable with my assistance. Let's team up and make your automated fashion lead generation endeavor for India a streamlined reality!
₹6.000 INR em 1 dia
4,1
4,1

The challenge of continuously and reliably extracting verified business contacts from India’s fashion sector demands a robust, scalable solution tailored for uninterrupted operation and precise data validation. Capturing decision-makers’ information while ensuring DNS-level email verification without third-party dependencies requires a system designed with fault tolerance and modularity at its core. The need for immediate export into well-structured Excel workbooks further emphasizes the importance of seamless data flow and accessibility for direct outreach. This project will leverage Python’s powerful web scraping libraries combined with asynchronous processing to efficiently crawl multiple sources simultaneously, while intelligently managing rate limits and data deduplication. DNS validation will be implemented using native Python libraries to perform syntax checks, MX record lookups, and catch-all detection, guaranteeing over 95% accuracy of email addresses. The architecture will incorporate persistent state management to enable automatic recovery and resume from the last checkpoint after any interruption. Docker and docker-compose configurations will encapsulate the entire stack, enabling straightforward deployment on any Linux environment with a single command. Commitment to quality will be demonstrated through comprehensive modular code, clear documentation, and rigorous testing to ensure 24-hour continuous operation under real conditions. The deliverables will include a sample Excel file illustrating the data schema, detailed setup instructions, and a technical note explaining the self-healing mechanisms and DNS validation process. Let’s discuss the next steps to tailor the system precisely to your needs and ensure a seamless lead generation experience.
₹11.250 INR em 7 dias
3,1
3,1

Mumbai, India
Método de pagamento verificado
Membro desde abr. 4, 2026
₹1500-12500 INR
$8-15 USD / hora
$250-750 USD
₹1500-12500 INR
$30-250 USD
£5000-10000 GBP
€6-12 EUR / hora
£10-15 GBP / hora
$2-8 AUD / hora
€250 EUR
£20-250 GBP
$250-750 USD
$250-750 AUD
₹12500-37500 INR
$30-250 USD
$10-30 AUD / hora
₹12500-37500 INR
$5000-10000 USD
£10-15 GBP / hora
$30-250 USD
₹12500-37500 INR