Encerrado

Username scraper2

We need a script to scrape a list of usernames from a site that match/fulfil a certain pattern for their username and their profile. This pattern is made up from a string from one or more lookup lists plus some regular expressions added to it. The script should populate a database table containining the username plus other specifics such as source where it was found and optional fields like registration date, birthdate, hobbies, profile-url from source, e-mail address if present and other details of profiles found to be specified on a per source basis. Technically the script should be prepared to later pass all HTTP gets thru our proprietary multiget/proxy class but the bidder can provide a slower, serialized version as long as the http gets are serialized. The script should store it's processing state and already processed usernames as it goes, so that it doesn't lose all data if it crashes unexpected. We expect the script to have some intelligent batching logic so that it can be restarted after a crash without having to redo everything over and over again.

## Deliverables

1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

2) Deliverables must be in ready-to-run condition, as follows? (depending on the nature? of the deliverables):

a)? For web sites or? other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.

b) For all others including desktop software or software the buyer intends to distribute: A software? installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.

3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).

## Platform

php, mysql

Habilidades: Engenharia, MySQL, PHP, Arquitetura de software, Teste de Software, Hospedagem Web, Gestão de Site , Teste de Website

Ver mais: string pattern match, string match, script php proxy web, regular expressions list, pattern string, match string, hire logic, string pattern, fulfil, mysql populate table, lookup table mysql, username mail, lookup mail, data proxy php, script web proxy, lookup table php, crash program, proxy site script, string processing, bidder profile fields, present data table php, proprietary database, specifics registration site, proxy address list

Acerca do Empregador:
( 80 comentários ) Austria

ID do Projeto: #3040836

1 freelancer está ofertando em média $383 para este trabalho

bitworksltd

See private message.

$382.5 USD in 14 dias
(105 Comentários)
7.3