Write a python programm which analyzes office documents (upto office2003) and get the following information out
- the special document properties (author, version, ...)
- full text without formatting (in case of word,frontpage & powerpoint) get rid of nonsense information like (index, ...)
We need a function in python which can do this (argument is filename of doc to be processed).
THe output should be a text file for full text & XML for properties Documents we want to do this for- word - excel- powerpoint- frontpage
1) Complete and fully-functional working program(s)
2) Complete ownership and distribution copyrights to all work purchased.