Em Andamento

Data mining and extraction from website

I need someone to help me extract some data from 4 websites. The data I need is the same, but the structure of the sites is different.

Basically, for the following insurance companies listed below, I want to extract a listing of all agents (agency name, address, and zip (zip must be in a separate field)) in the state of Indiana. The 4 companies and the agent locator URL is below.

Cincinnati Insurance

[url removed, login to view]

Indiana Insurance

[url removed, login to view];pagename=ramInternet%2FPage%2FramApplication&c=Page

Travelers (Select "Small Business" under Type of Buiness)

[url removed, login to view]

Auto-owners

[url removed, login to view]

I have a list of all zip codes in Indiana (~1000) (attached). What would be possible is for someone to write a script that feeds the zip codes one by one into the sites with maximum search distance, and then appends the results into a single file. I could then eliminate all duplicates manually.

I need the Name and Address of each result.

The last (Auto-owners) is a bit different, as the site only returns one at a time, but seems to cycle through all available agents in that area. A script would probably need to be customzied for that.

Timing is ASAP. If you could do by tomororow, that'd be ideal. But up until Thursday night is ok.

Habilidades: Processamento de Dados, Processamento de dados

Ver mais: www travelers com, www data processing, www auto owners com, what is data structure in c, what is data structure, what is data in data structure, what is a data structure in c, what is a data structure, what data structure, websites of companies sites, type of data structure in c, type of data structure, travelers insurance, state auto insurance, state auto, search data structure, need of data structure, list of data structure, list in data structure, list data structure, i want someone to do data entry for me, ideal websites, html codes for website, different type of data structure, data structure with c

Acerca do Empregador:
( 27 comentários ) Glendale, United States

ID do Projeto: #148707