Em Andamento

Data mining and extraction from website

I need someone to help me extract some data from 4 websites. The data I need is the same, but the structure of the sites is different.

Basically, for the following insurance companies listed below, I want to extract a listing of all agents (agency name, address, and zip (zip must be in a separate field)) in the state of Indiana. The 4 companies and the agent locator URL is below.

Cincinnati Insurance

[url removed, login to view]

Indiana Insurance

[url removed, login to view];pagename=ramInternet%2FPage%2FramApplication&c=Page

Travelers (Select "Small Business" under Type of Buiness)

[url removed, login to view]


[url removed, login to view]

I have a list of all zip codes in Indiana (~1000) (attached). What would be possible is for someone to write a script that feeds the zip codes one by one into the sites with maximum search distance, and then appends the results into a single file. I could then eliminate all duplicates manually.

I need the Name and Address of each result.

The last (Auto-owners) is a bit different, as the site only returns one at a time, but seems to cycle through all available agents in that area. A script would probably need to be customzied for that.

Timing is ASAP. If you could do by tomororow, that'd be ideal. But up until Thursday night is ok.

Habilidades: Processamento de Dados, Processamento de dados

Ver mais: websites companies sites, type data structure, travelers insurance, state auto insurance, state auto, search data structure, need data structure, list data structure, want someone data entry, ideal websites, html codes website, different type data structure, data structure type, data structure list, data processing business, data bit, codes websites html, cincinnati insurance, bit data, auto owners insurance, auto owners, auto codes, amp agency, html codes websites, data processing companies

Acerca do Empregador:
( 27 comentários ) Glendale, United States

ID do Projeto: #148707