Encerrado

eBay scraper & analyzer

I need a program, in whatever programming language you want, that:

1. fetches all the sold completed listings resulting from specified categories/keywords in eBay + all the completed listings found in the feedback page(s) of the relative sellers.

Note: it must be able to fetch all the found listings (even thousands), not only the first page...

2. groups identical/similar pages (between those fetched in point 1)

Note 1: to detect the identical/similar pages you must use one of the following three open source 'similar detection' algorithms (or others better than these): "Sif Fingerprint", "Substring Fingerprint", "Levenshtein distance", with user configurable similarity threshold. You must IMPLEMENT ALL THESE THREE ALGORITHMS: I will select in the GUI the one of my preference.

I suggest you to copy the first two of the above cited algorithms (Sif and Subst. Fing.) from this good open source Java program that uses them: [url removed, login to view]

Note 2: the detection of the duplicates/similars must be operated only over a filtered part of the pages (e.g. title and description of the eBay item): the program must allow me to specify this filter in the form of (one or more) regexps.

3. for each so formed group, displays the number of sold items, the total earned in dollars and the average earned dollars for day.

Here is a short example of how this final result should be: [url removed, login to view]

Habilidades: .NET, Programação C, Java, Perl, PHP

Ver mais: uses algorithms, use algorithms programming, substring c, program algorithms, open source programming language, number algorithms, java open source programming, java distance, use substring, gui number, good algorithms, first programming language, example algorithms, ebay groups, c substring, all algorithms, dump3 sif fingerprint, the fing, cgi group, similarity, open source programming projects, fingerprint, ebay copy, net form gui, geocities net

Acerca do Empregador:
( 69 comentários ) Milano, Italy

ID do Projeto: #126755

2 freelancers estão ofertando em média $160 para este trabalho

markusweb

I can do it.

$200 USD in 7 dias
(14 Comentários)
4.4
kvv20

It would be interesting taks for me

$120 USD in 10 dias
(5 Comentários)
4.0