We have a project which requires data points to be collected from a number of different sources. We have broken down the task into three steps ??" all using regex strings - isolate the desired section from the page - remove or replace any inconsistent or complex html - loop over and extract each individual item An example ??" though not the actual project ??" would be retrieving city temperatures from a page ??" assuming the temperatures are within a div with the id ‘temperatures’, that there are unneeded bold and italic tags, and that each game is enclosed in a table. See [url removed, login to view] attached for an example. The person we select must be fluent and practiced in writing regular expressions from scratch. RegexBuddy and similar tools will be useless to you here. Experience in scraping page content will be an assumed prerequisite. If the above example is not immediately self-evident, you will be very overwhelmed by this project. This needs to be done right on the first try, we cannot afford delays. This could lead to ongoing and steady work if you are proficient, fast and efficient
1) All deliverables will be considered "work made for hire" under U.S. Copyright law. Employer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the employer on the site per the worker's Worker Legal Agreement).
2) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
3) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Employer's environment--Deliverables must be installed by the Worker in ready-to-run condition in the Employer's environment.
b) For all others including desktop software or software the employer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this project.