Find Jobs
Hire Freelancers

Make Word List with pages number from PDF

$30-5000 USD

Cancelado
Publicado há mais de 13 anos

$30-5000 USD

Pago na entrega
I want a program that will take a PDF file, go through it, make a list of words and identify which pages they are on. This is like an index but for every word. Next, it should have a list of words to skip. I will provide this list. For example, I would not want the word "I" or "and" indexed so I would put those in the list of words to skip. It should also combine continous page sequences, for example if a word appears on page 12, 13, 14, 15, it would not list those individually but would list 12-15 It should pay attention to capitalization so James and james would be treated as two different words. The output file must be in alphabetical order. I will proved an example. Note that these page numbers should be the page numberss which may differ from the PDF numbers. For example, a PDF starts numbering page 1 at the first page, but a PDF document may have a title page which is not numbered and a blank page, then start numbering 1 on the PDF page 3. I believe the best way to handle this is to just ask the user to enter an offset number. Then the program should try to determine the page number by looking for a page number(which might be at the top or bottom of the page or in the header and isolated and sequential. So if you find a 6 on one page and a 7 at the top of the next, you know that is the page number, but if you cannot look at the top or bottom of the page and find an isolated number, then use the offset and the PDF page number. So, if the user enters -3 as the offset, then the word JAMES appears on? the seventh page? of the pdf, but? that page is actually numbered as 4 (7-3=4) so the final text listing would be JAMES - 4 because the author of the document did not number the first 3 pages. If you can handle pages numbered in Roman numerals(i, iv, vii..) that is good but not required, standard numbers are good enough. The program is for Windows XP/Vista/7 Final deliverables must be a fully working program that includes an installer. I will provide a license agreement to display for the installer. Final deliverables? must also ? include the source, the installer, and a second version of the installer which is a shareware version that only processes 10 pages(only the first 10 pages of a PDF) The program will be named? Elite Concordance and Index Creator Thanks ## Deliverables EXAMPLE The output can be a text file which might look like this: alpha - 1, 17, 204 James - 12-18, 112 james - 12-13, 119 Yesterday - 26, 110 In the above example it says the word "alpha" appears on page 1, 17, and 204. The word James with a capital J appears on each page 12, 13, 14, 15, 16, 17, 18 and then again on page 112.
ID do Projeto: 3613838

Sobre o projeto

7 propostas
Projeto remoto
Ativo há 14 anos

Quer ganhar algum dinheiro?

Benefícios de ofertar no Freelancer

Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
7 freelancers estão ofertando em média $481 USD for esse trabalho
Avatar do Usuário
See private message.
$1.700 USD em 14 dias
4,9 (26 avaliações)
6,4
6,4
Avatar do Usuário
See private message.
$126,65 USD em 14 dias
4,5 (13 avaliações)
4,4
4,4
Avatar do Usuário
See private message.
$170 USD em 14 dias
4,1 (28 avaliações)
4,3
4,3
Avatar do Usuário
See private message.
$170 USD em 14 dias
5,0 (4 avaliações)
2,7
2,7
Avatar do Usuário
See private message.
$1.015,75 USD em 14 dias
5,0 (2 avaliações)
2,4
2,4
Avatar do Usuário
See private message.
$34 USD em 14 dias
5,0 (2 avaliações)
1,2
1,2
Avatar do Usuário
See private message.
$153 USD em 14 dias
0,0 (0 avaliações)
0,0
0,0

Sobre o cliente

Bandeira do(a) UNITED STATES
Camarillo, United States
5,0
167
Método de pagamento verificado
Membro desde out. 29, 2008

Verificação do Cliente

Obrigado! Te enviamos um link por e-mail para que você possa reivindicar seu crédito gratuito.
Algo deu errado ao enviar seu e-mail. Por favor, tente novamente.
Usuários Registrados Total de Trabalhos Publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Carregando pré-visualização
Permissão concedida para Geolocalização.
Sua sessão expirou e você foi desconectado. Por favor, faça login novamente.