Encerrado

Username scraper2

We need a script to scrape a list of usernames from a site that match/fulfil a certain pattern for their username and their profile. This pattern is made up from a string from one or more lookup lists plus some regular expressions added to it. The script should populate a database table containining the username plus other specifics such as source where it was found and optional fields like registration date, birthdate, hobbies, profile-url from source, e-mail address if present and other details of profiles found to be specified on a per source basis. Technically the script should be prepared to later pass all HTTP gets thru our proprietary multiget/proxy class but the bidder can provide a slower, serialized version as long as the http gets are serialized. The script should store it's processing state and already processed usernames as it goes, so that it doesn't lose all data if it crashes unexpected. We expect the script to have some intelligent batching logic so that it can be restarted after a crash without having to redo everything over and over again.

## Deliverables

1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

2) Deliverables must be in ready-to-run condition, as follows? (depending on the nature? of the deliverables):

a)? For web sites or? other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.

b) For all others including desktop software or software the buyer intends to distribute: A software? installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.

3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).

## Platform

php, mysql

Habilidades: Engenharia, MySQL, PHP, Arquitetura de software, Teste de Software, Hospedagem Web, Gestão de Site , Teste de Website

Ver mais: string processing in c, string pattern match, string pattern, string match, string c plus plus, script php proxy web, regular expressions list, regular expressions in c, regular expressions c, pattern string, match string, hire logic, c string pattern, c regular expressions, c plus plus string, fulfil, mysql populate table, php class found, lookup table mysql, http proxy request, working scraper code, date scraper, http proxy server list, scrape usernames, date class program

Acerca do Empregador:
( 80 comentários ) Austria

ID do Projeto: #3040836

1 freelancer is bidding on average $383 for this job

bitworksltd

See private message.

$382.5 USD in 14 dias
(105 Comentários)
7.3