I have written an application that can extract from html or rss feeds news or other data. You can configure with regular expresions the in- and out format and give the URL and outfile. The program have an http client, goes to the url, gets content and extracts what You have configured. The aplication is written in C.