I need a server script that will chron fetch html pages from here:
<[url removed, login to view]>
See the id number at the end? They increment by one and have -1 at the end so the next would be 13695907-1
The data is structured the same on all of the results pages (thousands) but there are a few diff structures, a search of about 30 results pages would show the variations.
I want to parse the tournament results into a searchable database
I have an almost working script that chron fetches the latest results and crawls backwards for older ones. It was parsing perfectly on a diff php version and on diff structured pages. It does not work now but would help a coder probably.
I need? a fetching system, a parsing of the html data into a DB and then a way to do some searching of the data by username and date range.
You can see on the results page that each tourney has a title, buyin type and a list of players with their cash results. I need to be able to compile this data to be searchable.