Background: A key step in validating a proposed idea or I have to manually look through papers to find suitable datasets and their URLs, Which is strenuous and inefficient. To better encourage the data discovery process. And provide a better understanding of how and where datasets have been used. I propose a framework to effectively identify datasets within the methodical collections. The procedural challenges are identical of datasets, and finding of the connection between a dataset and the URLs where I can be retrieved.
Methods: I searched Google VLDB 2014 accptance papers to identity the research paper which were published, developed and used of another published research papers. Total of 75 published research papers were categorized into this field and out of which I studied datasets which were ealisyed finding out.
Result: Each research papers were studied and as a result of which, I was able to differentiate datasets used by each research paper. Most of the research papers had same mechanism and approximately same used dataset names in terms of the distance covered but the algorithms by each of the research paper varied.