Regex / Content Scraper Expert Needed

We have a project which requires data points to be collected from a number of different sources. We have broken down the task into three steps ??" all using regex strings - isolate the desired section from the page - remove or replace any inconsistent or complex html - loop over and extract each individual item An example ??" though not the actual project ??" would be retrieving city temperatures from a page ??" assuming the temperatures are within a div with the id ‘temperatures’, that there are unneeded bold and italic tags, and that each game is enclosed in a table. See [url removed, login to view] attached for an example. The person we select must be fluent and practiced in writing regular expressions from scratch. RegexBuddy and similar tools will be useless to you here. Experience in scraping page content will be an assumed prerequisite. If the above example is not immediately self-evident, you will be very overwhelmed by this project. This needs to be done right on the first try, we cannot afford delays. This could lead to ongoing and steady work if you are proficient, fast and efficient

## Deliverables

1) All deliverables will be considered "work made for hire" under U.S. Copyright law. Employer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the employer on the site per the worker's Worker Legal Agreement).

2) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

3) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):

a) For web sites or other server-side deliverables intended to only ever exist in one place in the Employer's environment--Deliverables must be installed by the Worker in ready-to-run condition in the Employer's environment.

b) For all others including desktop software or software the employer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this project.

## Platform


Habilidades: PHP, Arquitetura de software, Teste de Software, Hospedagem Web, Gestão de Site , Teste de Website

Ver mais: writing expressions, using regular expressions, using expressions, table content project writing, steps writing well, sources lead, regular expressions example, regex is, regex explained, regex example, regex c, prerequisite software, item writing hire, italic writing, hire web page expert, gnu regex, first person writing example, example regular expressions, content writing data, self proficient, fluent software, regex, php regex, legal content, game content

Acerca do Empregador:
( 13 comentários ) Canton, United States

ID do Projeto: #2982498

2 freelancers estão ofertando em média $94 para este trabalho


See private message.

$85 USD in 14 dias
(17 Comentários)

See private message.

$102 USD in 14 dias
(0 Comentários)