The Website: [url removed, login to view]
- First drop-down is the discipline [In French: Disciplines exercées:" ], e.g. "Medicine Generale" (the largest one). In Total there is 97 of them.
- Additionally you will have to select at least one of the following option:
- Please select one of the Region (there is 28 of them in total - significantly less that the other options) As a consequence, you will have to search all disciplines for every single region.
- To find out the sex, you have to breakdown your search by sex, because it is not part of the results.
So you have to search all disciplines both for male and female and for every region = 5432 Search Queries.
Disciplines exercées: MEDECINE GENERALE
Results: 1286 in total
Furthermore, the scraping is further complicated by two elements.
1. After your 1st search query you will have to agree with the term of service
[url removed, login to view]
2. You will have to solve a Captcha. -It may be tricky
2.1 You will have to answer to the question if you are a Robot or not
2.2 You will have to select some pictures according to the question asked = e.g. "select all the pix with a cat"
- Example of an output in the ideal format :
<< Be Advise: Usually in French the first space separate the Lastname from the Firstname - But it's a rule with a lot of exceptions. So if you can't be sure of what is the Last and the firstname, please take the whole Name>>
Name: TALAEE-OBRINGER GOLNAZ
Identifiant RPPS: 10002461910
Discipline exercée: MEDECINE GENERALE
Disciplines complementaires d'exercice: ANGEIOLOGIE
street: 9 RUE CHARLES DE GAULLE
Tel: 03 89 07 31 74
Other titles 1:
other titles n:
Please note: a doctor can have more than one discipline. You will need extra tables listing all titles and disciplines and a mapping table that maps a contact to one or more of these titles and disciplines.
Unfortunately, we had some very bad experience with a similar project and before assigning the project, we will have to ask you to provide us with a Data scraping sample from this Website.
Thank you for your understanding and your help !
1. IPs: We have very clean lists of proxies that are constantly checked for stability and speed.
2. Captchas: We have access to a captcha-solving API on https://de-captcher.com. They charge $2 for 1000 solved captchas. We will cover these costs.
DeCaptcher provides possibility to communicate with all types of clients' systems through any of the following interfaces:
API for programming languages
API for the DeCaptcher is availablve with the following languages:
C (both *nix and Windows)
PHP (all platforms)
Java (all platforms)
Visual Basic 6.0 (Windows)
Visual Basic 6 (Windows)
Perl (both *nix and Windows)
Also API is encapsulated within two libraries that can be used too:
33 freelancers are bidding on average €156 for this job
Hello I'm interesting your project very well I'm a Good C++/C#, Java, Scrap, Algorithm expert. I m quite well experienced in these jobs. Let's go ahead with me I want to service for you continously. Thanks
HI I am ready to start the work right [url removed, login to view] can do this work easily and [url removed, login to view] I have a big team with 7 people .so if you are interested to me message me.Thanks.
Hello sir,we have 8 member team and we can start the work right [url removed, login to view] work will be on my office just award me project so we can [url removed, login to view]
Hi, I have read your posting job carefully. I am looking for a job like this. I can do your job perfectly and also assure you that my work will be 100% satisfied to you.
I'm a Web Developer having a lot of experience in PHP,MySql,Web Searching, Web Scraping, Data Entry, Excel etc. I have worked with [url removed, login to view] and [url removed, login to view] scraping work for large amount of data. I can assure you 1 Mais