Cancelado

PDF Parsing Script Creation

THE PROBLEM

We need various PDFs (that are published by the Army) to be converted to a specific format for a website accessed by iPhone users.

The problem is that these PDFs all seem to vary slightly in some way or another, and they end up breaking the PHP parsing script I wrote. I know all about exporting from Acrobat to XML/HTML/etc. - the problem here is consistency (and knowing what you're doing with regular expressions).

WHAT I AM LOOKING FOR

I need a script (perl/php/ruby/java or whatever you want really) to parse these PDF files (or the exported XML/HTML/whatever) into an HTML format that is specifically formatted to our needs.

Please do not bid unless you are the type that dreams of creating the perfect regular expression!

I will supply all the needed PDFs (or Acrobat exported PDFs) that I will need converted. Your job will be to supply a script (or multiple scripts) that can convert it into our HTML layout format.

DETAILS

The layout is nothing extreme, chapters have their own html file, and each sub-chapter has its own html file as well. Certain items need to be bold, some in a list, and determining where the opening and closing of div tags are. Images will also need to be stripped from the PDFs (automatically done in acrobat) in PNG format and placed back into the new layout.

EXAMPLE PDFs

[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

I will supply examples that have already been converted by us.

Habilidades: Java, Perl, PHP, Ruby on Rails, XML

Ver mais: perl pdf parsing, ruby parse pdf, php parse pdf file, parsing pdf iphone example, xml to pdf php, xml to pdf in php, xml pdf php, what is a regular expression, what are regular expressions in java, well published, website opening problem, website creation in java, us army, regular expression with examples, regular expressions list, regular expressions in java examples, regular expressions in c, regular expressions examples in java, regular expressions examples, regular expressions example, regular expressions c, regular expression or example, regular expression no, regular expression java example, regular expression in java examples

Acerca do Empregador:
( 0 comentários ) GOLETA, United States

ID do Projeto: #597283

10 freelancers estão ofertando em média $385 para este trabalho

swwiz

Please see PM

$500 USD in 7 dias
(3 Comentários)
5.3
gelo76

Please check PM.

$400 USD in 7 dias
(11 Comentários)
4.8
easydoneus

Hello. This seems a nice job, especially that I am doing a master in Natural Language Processing, this means that hard and complex regular expressions is my breakfast,to say so. I hope you'll find my reviews and my pas Mais

$250 USD in 3 dias
(4 Comentários)
3.7
EnjoySoft

Hello! Please, check your pmb.

$300 USD in 10 dias
(3 Comentários)
3.8
ericnobl

Please see PMB.

$250 USD in 3 dias
(4 Comentários)
2.8
freelancerrediff

Hi I could use ruby with its internal gem modules and parse your PDF file using file handles to an intermediate file which we can parse using regex and format and the o/p will be in word as u wish 250/15 - budget Mais

$250 USD in 15 dias
(1 Comentário)
0.0
sharav

Please check PMB

$450 USD in 7 dias
(1 Comentário)
0.0
sgnkar

Hi I am java programmer. I am interested on this application. Thanks

$700 USD in 16 dias
(0 Comentários)
0.0
hdpatel

Hi..Hope you are doing great..I have been working on Ruby language since last 3 years..Lets make a single script for free if it will work for you then will go ahead..thnks..Looking forward for your kind response..:)

$500 USD in 10 dias
(0 Comentários)
0.0
toronto0036

Hi please See pm Thank you

$250 USD in 3 dias
(0 Comentários)
0.0