This project is to develop low-level programming specifications to parse new files from the SEC EDGAR database ([url removed, login to view]) and pull off the attached Exhibits (when they are legal agreements) and store into a MySQL database. The table where the data should be stored should have the rtf formatted text of the document and the title of the document.
This will require some research into the EDGAR system and a determination of the best way to pull down new filings. The specifications should also take into account parsing older data as well. It is anticipated that the resulting application will run each evening and populate new data.
The deliverable from this stage is only detailed specs that can be passed on to a developer for implementation.