Em Andamento

fetching 1800 urls, parsing data into csv or database

i will deliver a set of 1800 urls,

all html files will be formatted the same way, it is a directory of people, and you have to parse things like name, street, zip, email, telephone, fax, etc.

the html is cleanly formatted and very straight forward to parse with a few regular expressions or grep commands.

you will have to fetch the html (each one between 10 and 100k) so the total data size should be around 100MB only.

you will have to parse the html data and feed it into a csv or mqsql table or whatever database you like to work with.

please see the sample html, you would have to extract the following information from it:

sample 1st name:

Name: Dr. bach

Adresse: Kosef-Str. 18

ZIP: 05116

City: Mainz


Schwerpunkte1: prophylaxeorientierte

Schwerpunkte2: minimalinvasive

Schwerpunkte3: schmerzfreie

Schwerpunkte4: Aesthetikzahnheilkunde

Tel: 07131 228290

Fax: 07131 232123

sample 2nd name

Name: [url removed, login to view]

Adresse: Agippastrasse 5

ZIP: 05131

City: Mainz


Homepage: [url removed, login to view]

Praxisinformationen1: Haltestelle

Praxisinformationen2: Rollstuhlgerecht mit Einschrenkungen

Praxisinformationen3: Abdensprechstunde

Praxisinformationen4: Samstagsprechstunde

Praxisinformationen5: klimatisierte Praxisräume

Schwerpunkte1: ästhetische Zahnheilkunde

Schwerpunkte2: Endodontie

Schwerpunkte3: Parodontologie

Schwerpunkte4: Cerec

Tel: 07131 234300

Habilidades: Programação C, Linux, PHP, Instalação de Script, Captura de dados na web

Ver mais: set data, sample regular expressions, regular expressions c, data str, c regular expressions, c++ parse html 5, ume, street 3, str, parse email, MIT, fetch data, expressions, dr, csv php, parse php table html, php data table, php csv sample, data html data, parse table, table set html, php table csv, php html data, php html parsing, php extract data email

Acerca do Empregador:
( 1 comentário ) Berlin, Germany

ID do Projeto: #544046

Premiar a:


Hello, I'm a programmer from Bulgaria. Can do this job for you easily.

$30 USD em 0 dias
(3 Avaliações)

14 freelancers estão ofertando em média $43 para este trabalho


I'm interested in your job.

$30 USD em 1 dia
(8 Comentários)

I can do regex task. Pls see my pm.

$30 USD em 1 dia
(4 Comentários)

Hi, I have already prepared a script to do this... Please check PM.

$30 USD in 2 dias
(1 Comentário)

Hi, recently I have created a script which can almost the same things and I think I can do it. However should you choose my bid I would require you to tell me exactly which fields need to captured and also the location Mais

$30 USD in 2 dias
(0 Comentários)

I have just recently finished a very similar project which involved much more complicated parsing of an online game, and automatically playing the game. I can easily complete this project for you very quickly. Thanks Mais

$30 USD em 1 dia
(0 Comentários)

Hello. I can help you parse this data. Please contact me. Thank you.

$40 USD in 2 dias
(0 Comentários)

Hi, I can write a C++ subroutine to accomplish this task. Sourabh

$30 USD in 2 dias
(0 Comentários)

Hi, Please check PM Thanks Tamrakar

$30 USD em 1 dia
(0 Comentários)

Hi, I have parsed the info for you ... please check PM. Thanks Ted

$30 USD em 1 dia
(0 Comentários)

$10 Save to CSV

$30 USD in 2 dias
(0 Comentários)

I'm interest to work your project...please give me one chance ....charlie

$30 USD in 2 dias
(0 Comentários)

I am an expert in web scraping, I can give you my code in few hours, Please see my pm, Thanks.

$30 USD in 0 dias
(0 Comentários)

I have 2 year experience in this field

$200 USD in 4 dias
(0 Comentários)