Need a script/tool that is able to process html/javascript from a website to build the website's DOM. So the script needs a webbrowser control or a javascript engine.
## Deliverables
Hello!
The first point: You are free in the used operating system for the tool I need. It can run on a Windows Server 2008 or a Linux server.
What the tool I need should do:
The tool is called from a php script and the php overgives two values to the tool:
- a browser resolution (i.e. 1280x1024)
- a website address (i.e. <[login to view URL]>)
Now the tool should call the html source from that website, build the DOM and returns it to the calling php script. The tool should build the DOM like a modern browser (IE 8, FF 3.x or Google Chrome) is able to do it. The webbrowser control should build the DOM for a browser resolution of the value which is overgiven from the php script. (Most DOMs do not process this values but some do). And the tool/webbrowser control should build the DOM as if all standard browser settings and most installed addons like flash are given.
After returning the DOM source to the php script this tool instance should be closed.
Some add-ons:
- If possible the tool should not process the javascript commands like opening pop ups. Only build the DOM, not proccessing the html or DOM.
- Each tool instance should be closed latest 60 seconds after been build to make sure that connection problems to remote servers or kind of that would let increase the number of active tool instances
- the tool should be multi-instance-ready
- the tool should work in window-less mode to reduce needed server performance
- the tool should be not too slow. ;) Normally the speed should not be a problem here but if as a default setting of your webbrowser control a complex DOM rendering is calculated for example this proceass may take 10 seconds or so. But the tool should not need more than 3 seconds to build the DOM after receiving all html source.
If you have questions concerning this project, please ask me!
Marc