Hello. First of all, I am a researcher at NLP-FII group(natural language processing group) at Faculty of Computer Science,Iasi, Romania. Here, I've developed a crawler for linguistic documents retrieval and the project continued becoming a heuristic crawler, that could find his way to the processing data through fuzzy logic or neural networks or just heuristics proposed by the user. It uses ontology mapping( the domain of existance of required data to be crawled) and other facilities that make him a reliable solution for hard crawling. Also, it can be adapted to crawl almost anything, not just for linguitics purposes. I attach here a piece of code(it's made in Python, but I can convert it to Java) : def start_ontology_search(self): for line in [login to view URL]: req=[login to view URL](processWords,[argument]) [login to view URL](req) [login to view URL]() [login to view URL]([login to view URL]) ont=[login to view URL]([login to view URL]) [login to view URL](word) This is just a part from the OntologyCrawlerTechnique, a class that implements an advanced crawling method. Hope you'll choose my bid. Good day.