I need an application program (.exe and source) to read a sequence of “pdf?? files and extract specific information from each file.? The program should: prompt user to select a file, extract specific information from this file to a worksheet, process all data in the file up to a certain page (the page heading of the ending page is known), and when finished prompt user for the next file, etc.?
I’ve included two input pdf files and an output example (xls) file to show what must be extracted.? The data are lists of universities and faculty, so the extraction requires you to associate the university information with each faculty in that university.? Also, each faculty member has specific information that must also be extracted. ? Please look carefully at the two input files as there is some variation in their formats. ? Note that the output example in Excel illustrates the most general description of these data, but not all fields will be present in every file and some fields will have information for certain faculty but not others.
Also, note that the university data must be “filled down?? in the output file.? See the [url removed, login to view] file for the proper presentation of this information.? You may use xls or xlsx formats.?
An initial DEMO that shows you can read pdf files increases the chance of being selected.