Monday - Interview with a Wiki Ninja
Tuesday - TNWiki Article Spotlight
Wednesday - Wiki Life
Thursday - Community Wins
Friday - International Update
Saturday - Top Contributors of the Week
Sunday - Surprise
This week I'd like to give you a sneek peak at one of the tools that I use in my Wiki Life.
I use it to generate the Top Contributors of the Week Awards.
It's a very simple web crawler, written in C# and WPF
This doesn't win any style awards you understand, it is a quick and dirty tool to get the stats I need, written mostly in one evening for the task I had been given.
Below is a short (6 minute) video that shows the tool in action, with a small (24 hours) date range.
Please forgive the lack of audio track commentary.
Here is an outline of what you are seeing:
If there are any fellow developers out there, you may ask why I physically load each page, instead of just processing raw html responses from the server.
The answer is because many of these kind of pages, like the revision compare pages generate their content from Javascript loaded in the page and is not available from raw html response, but retrieved once the page has loaded.
For this reason, I have to physically load the page, wait for the Javascript to pull the page content, THEN read the page.
This means for a slow 2 hour crawl for a whole week, but works fine as a background job.
There are still plenty of stats I plan to collect and present over the coming months.
If you have any ideas for other awards we could present from this data, please let us know and I will try to include it in future crawls.
Regards,
Peter Laker