We need to post-process MS Word documents created by a vendor's PDF conversion software, to convert certain formatting - such as extra paragraph marks, frames, poor table layouts - to more reflow-friendly formatting, while maintaining document layout.
The PDF conversion software takes the text layer from searchable PDF files and attempts to render the text in formatted MS Word document while maintaining the layout of the PDF file as much as possible. Unfortunately, the conversion includes Word formatting that gets in the way of text and formatting reflow once the output document starts to be edited. We need to remove the extraneous formatting while maintaining the look and feel of the document, so that when the output document is edited, text and formatting reflow properly.
This project requires expert knowledge of the MS Office Object Model, as well as detailed knowledge of Word document structures.