I need an application to input a Text file and then perform a word sort on the entire text file. After which, the words are to be counted and the final report should state how many of each word was found. No punctuation is needed so they can be discarded (. , ; : ' " ? ! ). Hyphens could be apart of a word so they will have to stay. The application should be flexible enough to only report words that are repeated 2 or more times. The 2 being flexible upwards to whatever. I guess there would be a limit but I do not know what that would be. In some of my test words have been repeated 300+ times. Also, I will provide a text file with what is called stop words. These are words that should be ignored. If the user wants to add or deleted words in this list then that option should be present. I am interested in seeing the actual number of repeats so if an option could exist for easily removing thos numbers that would be awesome. Lastly, the resulting word list should be shown in a text box so the user can edit and/or possibly save to a new Text file. Plus, the total counts of the words being repeated should be able to be toggled on/off. In other words; if "Bubba" is shown to be repeated 200 times then a toggle/option should be active where the actual 200 total is not shown. See attached file for examples of what I am looking for.
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
Windows 2000 +