Em Andamento

Build Database from pdf files

I need an application program (.exe and source) to read a sequence of “pdf?? files and extract specific information from each file.? The program should: prompt user to select a file, extract specific information from this file to a worksheet, process all data in the file up to a certain page (the page heading of the ending page is known), and when finished prompt user for the next file, etc.?

I’ve included two input pdf files and an output example (xls) file to show what must be extracted.? The data are lists of universities and faculty, so the extraction requires you to associate the university information with each faculty in that university.? Also, each faculty member has specific information that must also be extracted. ? Please look carefully at the two input files as there is some variation in their formats. ? Note that the output example in Excel illustrates the most general description of these data, but not all fields will be present in every file and some fields will have information for certain faculty but not others.

Also, note that the university data must be “filled down?? in the output file.? See the [url removed, login to view] file for the proper presentation of this information.? You may use xls or xlsx formats.?

An initial DEMO that shows you can read pdf files increases the chance of being selected.

Habilidades: Engenharia, Excel, Microsoft Access, Microsoft Exchange, MySQL, PHP, Powerpoint, Gestão de projetos, Arquitetura de software, Teste de Software, Word

Ver mais: pdf project management, xlsx, university management, database extraction, data extraction input, php database project pdf, pdf excel extraction, extract excel files, excel pdf fields, excel project use database, extract excel pdf, extract fields data pdf, php extract pdf file, pdf extract information, select pdf file, presentation demo, excel output pdf, university management project, input information database, pdf extract data, application university, pdf data extract, pdf read, build database application, read xlsx file

Acerca do Empregador:
( 15 comentários ) United States

ID do Projeto: #3063267

Premiar a:

hanghuuhuy

See private message.

$255 USD em 5 dias
(167 Avaliações)
6.8

3 freelancers estão ofertando em média $213 para este trabalho

dpune

See private message.

$170 USD in 5 dias
(72 Comentários)
5.6
YourProgrammer

See private message.

$212.5 USD in 5 dias
(2 Comentários)
3.3