Em Andamento

Downloading and parsing html documents

Query the United States Patent and Trademark Office website for all patents that reference a particular patent number that I’ll provide. (This process is very straightforward and takes just seconds; I can provide full instructions.) The resulting list includes 1,314 results with 50 hits per page. Each hit is linked to the full text document for a specific patent.

What I need:

1) Someone to download the html code from the full text document for each referencing patent (i.e., each of the 1,314).

2) Once these pages of plain text html code are in hand, someone to parse the results into fields in an Excel file. There will be about 15 fields. Four of these fields (inventor, inventor location, patents referenced, and other references) will have up to 30 individual entries. I can provide full details on the specific fields that I need for each patent and guidance on the unique text that can be used as markers for finding each field within the full text document.

The deliverable is:

1) An Excel file with each of the 1314 results in its own row. The columns would be the specific fields scraped and parsed from the full text documents.

2) The code you used to do this. It must be well commented.

Habilidades: Programação C, Java, Perl, PHP, Python

Ver mais: what is linked in, well referenced, download code html website, united states as, trademark, text parsing, plain, parsed, parse html, parse an html, no html, inventor, html, HTML%, html to excel, html on, html c, html /, HITS, excel to html, excel html, html code pages, text html php, php parsing text, html location

Acerca do Empregador:
( 6 comentários ) Eugene, United States

ID do Projeto: #29522

Premiar a:

PaulWalton

Hello, ajnelson. Please read the PM board for details. Paul

$50 USD em 2 dias
(6 Avaliações)
4.1

25 freelancers estão ofertando em média $71 para este trabalho

gaffapi

please PM me the actual links so I could make a demo for you.

$100 USD in 2 dias
(72 Comentários)
6.3
danguer

Hi, I can help you, I have a very good connection (1 MBs) and very handled to this

$90 USD in 2 dias
(10 Comentários)
6.0
CruzDelSur

Hi, I would like to write a little demo for you, I will do it in PHP, could you posible show me source link from you want to get content? Regards CruzDelSur

$100 USD in 3 dias
(27 Comentários)
5.6
Zuprem

i can help you with this.

$30 USD em 1 dia
(53 Comentários)
5.5
PSE

Hi, Please check PMB for details

$90 USD em 1 dia
(14 Comentários)
5.0
gogetter

Hi, I have implmented similar projects. Since you require data to be in excel, the code would have to run on Windows (or the code could generate CSV file that you can late import in Excel). I can provide the solution i Mais

$95 USD in 6 dias
(2 Comentários)
4.4
nadeem2005

Dear Sir, We have relevant experience. Please see the PMB for complete description about this project. Here is our place holding bid for this project. Best Regards, Nadeem

$30 USD em 1 dia
(19 Comentários)
4.3
inakiseri

Please contact me for a fast development

$100 USD em 1 dia
(3 Comentários)
4.0
neon

we have done something similar and we can help you with this work too

$100 USD in 7 dias
(7 Comentários)
3.7
varatare

Hello ajnelson This is what I will do. I will use PHP to parse the HTML code and covert it into cvs format. :)

$60 USD in 3 dias
(3 Comentários)
2.9
mohanprabha

Dear Sir, I have 6+ years experience in software development regards mohan

$100 USD in 3 dias
(2 Comentários)
2.6
ranosoft

SL Hi, We take this oppurtunity to introduce ourself as an ISO 9001:2000 companyand also we are the first Indian IT company to have ISO14001 certification. [url removed, login to view] and 4 curren Mais

$100 USD in 15 dias
(3 Comentários)
4.0
cks121

I have work on this type of project. I acn commit this task within 2-5 hours if you provide me $120.00 Thanks

$51 USD em 1 dia
(0 Comentários)
0.0
vniranjan1979

Hi sir, I have pretty good knowledge and experience in parsing and validating documents of [url removed, login to view] html will be very much easire and faster in [url removed, login to view] forward to hear from u to start up this project

$40 USD in 3 dias
(0 Comentários)
0.0
UTStudios

Hi, please check your pm

$50 USD in 3 dias
(0 Comentários)
1.6
vladag

Hi, I have done similar projects and can give you little demo working according your tasks

$30 USD in 2 dias
(0 Comentários)
2.0
sanju0011

Hi, This can be done. I m committed to provide u quality sol. Thanx

$90 USD in 5 dias
(0 Comentários)
0.0
superbrain

I can do this project for you. need escrow payment and good review.

$100 USD in 3 dias
(0 Comentários)
0.0
hashvin

I have done similar jobs with html parsing using PHP. I use good technique when writing code so u can be gaurenteed it will be commented well. Please let me know any time if you would like me to get started with the pr Mais

$65 USD in 2 dias
(0 Comentários)
2.4
DanielRomero

I can do the job

$50 USD in 2 dias
(0 Comentários)
0.0