I need a program, preferably in Python, that will scrape a list of URLs and provide the mailing address as output. The URLs will be company websites in the United States. The approach of how to do this may vary (one approach would be to target the contact page, then target the 5 digit zip code and scrape 200 characters before and after, save this out put in file and then run it against another commercially available address extracter program). The program would have to accept 10,000+ URLs and return the mailing addresses with a high degree of accuracy. I would want to review the approach with the selected programmer before the begin to program. Programmers need to have extensive experience writing scraping programs.
I need this program in a beta version by Close of Business Friday October 5th, USA Pacific Time. I will test in a few hours and return it later on the 5th with comments. A tested, finalized version of the program needs to be completed by Sunday Evening, October 7th. If the programmer completes the project in this time I will award an additional $100 beyond the agreed upon bid amount.