Description of this mini-project:
Search through many pdf books on a map or elsewhere to find sentences where all the alphabet letters appear in equal numbers for example.
I need sentences ( NOT words) where the letter A is written 10 000 times, B is written 10 000 times, C is written 10 000 times….. etc. until the end of the alphabet… Y is written 10 000 times, X is written 10 000 times and Z is written 10 000 times.
This must be within sentences from point (.) to point (.) taken from about 400 different PDF books.
What is important to me here is that I have the same number of times where A is in the text as the B and C….and X and Y and Z.
We will use the Albanian alphabet and books which you will get from me.
There are both opportunities to write the script as a web application or in cross-platform like JAVA
Please contact me only if you understand what I am looking for and if you are a professional developer.
Criteria thats need to be fulfilled on the App:
* Easy to add any language alphabet from the user which the app use it for looking on the pdf books
* Easy to put the number for each letter how many times i want them to show up on the sentences but even a button if i want the number i enter to apply to all the Letters.
* Searching & Comparing different sentences until the App finds the best sentence which include the letters needed to fulfill the number given above to be reached for each letter.
* There may NOT be repeated word in different sentences.
* Easy to add all the pdf books/files where the App will search/look up for those sentences/letters
* Generate a DB or excel with the result which the user is going to use as his/her final product.
* The App has to work both on web and pc/mac with required credential for login to be able to use the App.
* The script MUST take only the body (text, sentences) from the ebook, NOT header or footer or anything else that does not belong to the logical sentences.
Please read the description before you give a bid.