This job is to create a tool written in [url removed, login to view] which parses data text files - Each conversion is a selection of text files which are read in turn split up and turned into xml. The application needs to separate out the mechanics of reading and writing the files from the parsing template so that a template can be created for each conversion type. The template is to be in xml and a tool written to allow the user to enter for each file type the start, end, data type, and label for each data field.
Once the files are converted to xml they are tested for data quality against data type and ranges this is to help identify if the template is correct or needs adjusting.
The next stage is for the xml files to be converted to another xml schema and a tool is needed to map from one schema to another.
Data cleaning is then done more rigorously and involves being able to add data checks and have these applied to the xml.
The steps are:
Conversion files read in
The selected xml template is used to convert the file into xml
Data checks are done against the xml and problems identified to allow opportunity for the user to change the xml template
A mapping is done between the xml produced above to the desired xml schema
Mapping is stored and can later be selected at this stage of the process
Data is checked for quality, this process should allow for rules to be added and run against the data to see if any non conformance
The application needs to be ergonomically designed and the interface appealing. The code needs to be of high quality and the application will evolve over time
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Installation package that will install the software (in ready-to-run condition) on the platform(s) specified in this bid request.
3) Exclusive and complete copyrights to all work purchased. (No GPL, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site).