Testing the Spider
It is now 4 in the morning, and I have spent it testing the spider on various tutorial sites. I spidered all the tutorials on GreyCobra in a couple of minutes, found out they have 229 tutorials to be exact. This automated process will make it easier for webmasters, in the sense that when they write a new tutorial, they do not have to submit it to the database, but instead, the spider would crawl the site for it. The test went smoothly of course, and I am liking the speed and the accuracy of the spider to a great degree.
My next big opponent is 13Dots.com. I figured that if I had a lot of tutorials in the database, even to start off with at least 1000 tutorials off the bat would be great for the website. So I might as well test the spider on the larger tutorial communities.
While testing the spider, I made a lot of changes to the algorithm of the spider and how it operates. I changed some methods for parsing the tutorial urls and ensuring that we are grabbing the correct urls. We will also depend on the user community to keep the spider in tip top shape, reporting bad tutorials, so that the bugs can be investigated and recompiled into the spider.
