I require 3-4 pages to be visited (per site, and total approx 10,000 sites) and some data noted from the HTML FOrms on those websites.
Only apply for this project if you have good knowledge of HTML. You may need to look at the source code of websites.
I have created a tool for making the process semi-automatic, kindly take a look at the attached file which contains instructions on what data is to be noted (refer [url removed, login to view]) and to see how the semi-automatic tool works refer Collect_data_for_3_sites_using_semi_automatic_tool.txt.
Some of the sites will be in German/Romanian/other languages, and for such sites you will have to use Google Toolbar->Translate. (Google Toolbar is available for Internet explorer and firefox).
If something is confusing, then kindly get back to me with your queries and I will try to resolve your queries.
1) All deliverables will be considered "work made for hire" under U.S. Copyright law. Employer will receive exclusive and complete copyrights to all work purchased. (No 3rd party components unless all copyright ramifications are explained AND AGREED TO by the employer on the site per the worker's Worker Legal Agreement).
windows or linux or mac