Java Web Site Screen-Scrape Webcrawler Application (759351)
$100-350 USD
Pago na entrega
**Java Web Site Screen-Scrape Webcrawler Application****
***Our firm seeks a prototype for a java application to log onto a web site, navigate to an appropriate location in that web site, capture and collect specific data, dump it to a text file, and store it in a mySQL database.
The utility you deliver must be launched from a command line as follows:
java -jar [login to view URL] siteloginURL username password inputfile
Input file must be CSV file format containing:
FirstName, LastInitial, customer number, date of birth, gender
Output must be a file for every query from input and its matching mySQL record.
Deliverable must be a working prototype that demonstrates the functionality, associated compilable source code, in line documentation, and instructions on recompilation.
Technical skills needed: Java, mySQL.
***Additional Clarification: ** Keep in mind that this is a prototype. It simply has to demonstrate the basic screen scrape technique (preferrably on more complex DHTML, Java Script, and Framed pages). For now we are not concerned with the actual site to scrape from (use any you like that requires login). Right now we are using mySQL 4.1.14 auto-configured by Joomla JSAS.... but feel free to recommend that we use any other mySQL instantiation (so long as it is not too difficult for us to set up). Also, we will need to be able to recompile your prototype when we get it... so you need to provide some environment setup and config instructions.
Also, there is a good chance that if we are happy with what you produce, in a follow-on contract, we will ask you to extend your prototype and eventually provide the actual production site (or sites) that require data collection.
More clarification in attached ZIP file.
## Deliverables
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
## Platform
Java utility should run anywhere (Windows XP, MAC OS X, SUSe Linux).
ID do Projeto: #3261060