I require 3 pages to be visited per site (across total 8000 sites) and some data to be noted about each page.
I have a data collection tool for this purpose, on each page that the data has to be collected you have to use this tool and click on the captcha image of that page, and you will see some rows of prefilled data- you just have to fill in the name of each parameter.
Eg user login page will typically have 2 parameters- user name and password. So there will be one record for user id, one for password, and 5 fixed records per page- these are for "Form URL" "form Method" (Get or pOST), "Form_captcha", the 4th record is "Form_Captcha_src", 5th record is "Form_Action"
Note that since you will be using semi-automated tool, most of the time these 5 records will be automatically created by the tool. But in rare cases, you may have to look into the html source code to find out these values. (This will be the case for very few sites).
Please read through the instructions attached with the bid request. Also the attachment contains sample data collected for one site.
In your bid kindly estimate number of days you require to collect this data.
KNowledge of html esp. forms in HTML, is required, only then you can do this work.
THere will be one record per site that will have to be added by you manually.
Also some sites will be in German/Romanian/Other language, for such sites you have to use Google Toolbar->Translate.
1) All deliverables will be considered "work made for hire" under U.S. Copyright law. Employer will receive exclusive and complete copyrights to all work purchased. (No 3rd party components unless all copyright ramifications are explained AND AGREED TO by the employer on the site per the worker's Worker Legal Agreement).
windows or linux or mac with internet explorer or firefox