I am looking for the development of a web spider that meets these requirements:
Given a list of URLs it will visit each website.
It will find and extract bricks and mortar address, telephone and email address for each website (if they are published on the site). These may be anywhere on the site and in any format.
The spider will also discover, and store, URLs on each website that display specific information. There may be any number of URLs to store per web site.
The spider also needs the ability to find this information on HTTPS pages and also framed pages.
All code is to PHP 4+ and forward compatible with PHP 5. We use MySQL version [url removed, login to view]
This is the first phase of the project. If you do a great job then you will be awarded the second and third phases as seperate projects.
I will only consider developers who have specific experience developing web spiders so please PM me with why I should select you and examples of spider development you've done.