Hi, I need a small C# program that will extract some links from a html page. The page is a result page of blogsearch.google.com.
You C# class, called GoogleBlogSearchParser should have 2 static methods that return ArrayLists (please don't use List<>) . The methods are:
ArrayList GetPageLinks(string path, string body);
ArrayList GetResultLinks(string path, string body);
GetPageLinks should get all of the links from the page navigation. Ie, the links to page 1, 2, 3, etc.
GetResultLinks should get all of the links for the results on a give page.
Your class should use another class called PageResult. This class only has two members, string url, and string title.
When you are extracting the links for the results, you should also get the title, and set [url removed, login to view] to this title, [url removed, login to view] to the url. Then add the PageResult to the ArrayList and when you are done return.
Make sure you use Regex for parsing the links. I know if the html on the page changes that it might break the parser, that is okay. But if you are clever with your regex it shouldn't be too easy.
Also, you don't need to download the page. If you use a saved file that is find. I have a result downloader already. I just need the parser.
I need this fast so first bids win!
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
C# .net 2.0