Em Andamento

Web Data Extraction Tool

I need a web spider that does the following:

1- User enters a url

2- Spider goes to that url and follows every link on it (1 deep only).

3- Spider searches the subpage source for a word

4- For each occurence of the word, the spider extracts the word + 5 preceding characters + 5 trailing characters. Ignore white space.

5- The extracted text is written to file, 1 entry per line.

6- Spider goes to and does same thing for each subpage.

7- Spider finds a link on the bottom of original url with text "Click for more" and follows it.

8- Process loops back to step 2

9- Process ends when the "Click for more" link is not found.

*A simple configuration file called "[url removed, login to view]" should store the following variables.

- The word to search for on subpages (step 2)

- The # of preceding characters to extract (step 4)

- The # of trailing characters to extract (step 4)

- The text of the link to be followed in Step 7 and 9.

This is for personal use - no need for fancy options. Configuration file can be manually opened and altered.

Please let me know your expected time to completion. I would prefer someone who is ready to work on this as soon as possible.

## Deliverables

1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Installation package that will install the software (in ready-to-run condition) on the platform(s) specified in this bid request. 3) Complete ownership and distribution copyrights to all work purchased.

## Platform

XP

Habilidades: Programação C, Delphi, Engenharia, MySQL, PHP, Arquitetura de software, Teste de Software, Visual Basic

Ver mais: web search tool, web programming platform, web config php, tool for php programming, tool back, spider web data extraction, programming with data, programming loops, loops programming, loops in programming, link deep web programming, data enters, c++ programming web, c programming loops, web spider software, search web searches web, web extraction, web data, Web Data Extraction, spider store, tool store, source code tool, url tool, extract data text, web search process

Acerca do Empregador:
( 216 comentários ) San Francisco, United States

ID do Projeto: #2964031

Premiar a:

sixi

See private message.

$10 USD em 3 dias
(11 Avaliações)
2.7

20 freelancers estão ofertando em média $57 para este trabalho

softservicesvw

See private message.

$76.5 USD in 3 dias
(329 Comentários)
7.6
atufa

See private message.

$17.85 USD in 3 dias
(72 Comentários)
6.0
idleswell

See private message.

$58.65 USD in 3 dias
(171 Comentários)
5.9
baajhanvw

See private message.

$50.15 USD in 3 dias
(16 Comentários)
5.7
zbronek

See private message.

$17 USD in 3 dias
(18 Comentários)
5.2
michaeldweber

See private message.

$42.5 USD in 3 dias
(35 Comentários)
4.6
qadram

See private message.

$46.75 USD in 3 dias
(10 Comentários)
4.5
nooneyouknow

See private message.

$29.75 USD in 3 dias
(18 Comentários)
3.7
rolandanderson

See private message.

$80.75 USD in 3 dias
(5 Comentários)
3.2
xcodervw

See private message.

$68 USD in 3 dias
(6 Comentários)
3.0
dracx

See private message.

$55.25 USD in 3 dias
(3 Comentários)
2.5
robeddielee

See private message.

$42.5 USD in 3 dias
(7 Comentários)
1.9
vijayvvw

See private message.

$21.25 USD in 3 dias
(1 Comentário)
0.0
pulsesoftware

See private message.

$29.75 USD in 3 dias
(4 Comentários)
0.0
idyaresearch

See private message.

$85 USD in 3 dias
(0 Comentários)
0.0
farooqsl

See private message.

$80.75 USD in 3 dias
(1 Comentário)
0.0
pravetz8m

See private message.

$80.75 USD in 3 dias
(0 Comentários)
0.0
kumarnarayanan

See private message.

$212.5 USD in 3 dias
(0 Comentários)
0.0
et1031

See private message.

$42.5 USD in 3 dias
(1 Comentário)
0.0