get properties & text out of office documents (python)

Write a python programm which analyzes office documents (upto office2003) and get the following information out

- the special document properties (author, version, ...)

- full text without formatting (in case of word,frontpage & powerpoint) get rid of nonsense information like (index, ...)

We need a function in python which can do this (argument is filename of doc to be processed).

THe output should be a text file for full text & XML for properties Documents we want to do this for- word - excel- powerpoint- frontpage

## Deliverables

1) Complete and fully-functional working program(s)

2) Complete ownership and distribution copyrights to all work purchased.

Habilidades: Engenharia, Excel, Microsoft Access, Microsoft Exchange, MySQL, PHP, Powerpoint, Python, Arquitetura de software, Teste de Software, Word

Veja mais: xml frontpage, powerpoint programm, php programm, python work, python program, python excel, python 3, powerpoint formatting, php python, excel python, excel programm, document properties, php text xml, without powerpoint, php powerpoint text, documents xml, text function excel, program python, write function python, python function, python xml file, excel word amp, full text, amp word excel, formatting word excel document

Acerca do Empregador:
( 24 comentários ) Belgium

ID do Projeto: #2958658

Concedido a:


See private message.

$85 USD em 20 dias
(45 Comentários)