Regex / Content Scraper Expert Needed

We have a project which requires data points to be collected from a number of different sources. We have broken down the task into three steps ??" all using regex strings - isolate the desired section from the page - remove or replace any inconsistent or complex html - loop over and extract each individual item An example ??" though not the actual project ??" would be retrieving city temperatures from a page ??" assuming the temperatures are within a div with the id ‘temperatures’, that there are unneeded bold and italic tags, and that each game is enclosed in a table. See [url removed, login to view] attached for an example. The person we select must be fluent and practiced in writing regular expressions from scratch. RegexBuddy and similar tools will be useless to you here. Experience in scraping page content will be an assumed prerequisite. If the above example is not immediately self-evident, you will be very overwhelmed by this project. This needs to be done right on the first try, we cannot afford delays. This could lead to ongoing and steady work if you are proficient, fast and efficient

## Deliverables

1) All deliverables will be considered "work made for hire" under U.S. Copyright law. Employer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the employer on the site per the worker's Worker Legal Agreement).

2) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

3) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):

a) For web sites or other server-side deliverables intended to only ever exist in one place in the Employer's environment--Deliverables must be installed by the Worker in ready-to-run condition in the Employer's environment.

b) For all others including desktop software or software the employer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this project.

## Platform


Habilidades: PHP, Arquitetura de software, Teste de Software, Hospedagem Web, Gestão de Site , Teste de Website

Veja mais: regex scraper, writing expressions, web scraping tools php, using regular expressions, using regex in c, using expressions, table of content in project writing, steps to writing well, sources lead, scraping web content, scraping tools web, regular expressions in c, regular expressions example, regular expressions c, regex is, regex in c, regex explained, regex example, regex c, prerequisite software, lead scraping software, item writing hire, italic writing, hire a web page expert, gnu regex c

Acerca do Empregador:
( 13 comentários ) Canton, United States

ID do Projeto: #2982498

2 freelancers are bidding on average $94 for this job


See private message.

$85 USD in 14 dias
(17 Comentários)

See private message.

$102 USD in 14 dias
(0 Comentários)