I would like you to build a scraper that can successfully scrape websites that we give you, and assign the appropriate values to the different contact fields.
I am giving you the URL of the HOME PAGE.
Contact information is sometimes on the home page, but most of the time the contact information is on a page called "contact", "contact us", "about us", and in some cases, "locations".
So, if your scraper cannot find contact information on the following URL's, it must look for links that say "contact", "contact us", or "about us", and scrape those pages.
We would like you to write a scraper that will work for 1000 URL's that we will send over the next two days (300 are here, 700 coming tomorrow).
In the end, we would like the code, so that we can add it to our system and execute the scraper ourself.
We would like it to return a hash, and we are looking for the correctly assigned fields for:
- Address - number/street
- Address - city
- Address - State
- Address - Zip Code
- Telephone number
- an email address located somewhere near the contact information