Finding PDF documents and extracting the document properties

Concluído Postado há 3 anos Pago na entrega
Concluído Pago na entrega

Just like most of the Word documents are made using Microsoft Word, most of the PDF documents are made using Adobe products. However these are mostly static documents like, books, brochures. Another example of a static document is that people sometimes convert their document from Word to PDF. In our study, we are NOT interested in these kind of documents.

The software used to create a PDF document is mentioned in the properties of the document (called Producer Line). What I want you to do in this task is to check the producer line of the PDF documents that you receive in your emails. I say emails, because nowadays companies send PDF documents via email and these are NOT static documents. They generate same looking document for every customer but with different data in it. These are the documents that I am interested in. Example document types might be, invoices, telephone bills, subscription documents, personal letters, quotes, certificates etc.

Note: I am not interested in any kind of personal data. Just to give an analogy here: you have your house and your house is made out of bricks. I ask you to check the brand of the bricks that the constructor used when building your house.

So, in short, you need to:

1. Check your emails containing PDF documents (in gmail you can search for "filename:pdf")

2. Download the PDF document to your PC.

3. Open the PDF document using Adobe Acrobat Reader (or whatever reader you use)

4. From the menu, select File -> Properties (or Ctrl + D)

5. Copy the full "PDF Producer" line

An example producer line can be:

"Adobe PDF Library 15.0"

"Adobe LiveCycle PDFG ES; modified using iText® 5.5.6 ©2000-2015 iText Group NV (AGPL-version)"

Your report should be a simple excel file that contains

- What kind of document is this? (for example credit card statement document, telephone bill, student score report, boarding pass etc.)

- The company that created the document (for example Wells Fargo Bank, or the name of the insurance company)

- The PDF Producer (as explained above)

You will be paid by the number of documents that are unique only. "Unique" means, different banks, different insurance companies, different airline companies, basically any company sending out PDF documents to people. Unique also means different document type in companies, for example bank statement and mortgage statement are two different document type, so these are counted as unique as well, even though they might be from the same bank. You will be paid for every unique PDF producer of document you present.

I have attached an example screenshot of the producer line of a document.

Java Design Gráfico Pesquisa de Mercado Python Programming

ID do Projeto: #29467419

Sobre o projeto

6 propostas Projeto remoto Ativo em há 3 anos

Concedido a:

BaibaOzola

I do most of my shopping online and I have too much free time on my hands, so I can finish this task ASAP.

$10 USD em 1 dia
(0 Comentários)
0.0

6 freelancers estão ofertando em média $20 nesse trabalho

sharmaalana1

HI I am experienced in Python Graphic Design Market Research etc I can start right now but i have few doubts and questions lets have a quick chat and get it started waiting for your reply

$20 USD in 7 dias
(2 Comentários)
2.1
djtalati

Hey Daksh this side. I read details and would love to work with as I have experience. More discussion will be on chat. I will be waiting for your response. Thanks in advance.

$20 USD em 1 dia
(1 Comentário)
1.7
MirikCoder

HI I am experienced in Java .i have few doubts and questions lets have a quick chat and get it started waiting for your reply

$20 USD in 15 dias
(0 Comentários)
0.0
imran98666

I can do anything related to PDF or word, Excel I will do this work to but I need more information. I Need Work

$22 USD em 1 dia
(0 Comentários)
0.0