I find very usefull to have in the messagesthe date / time it was recorded.
It is giving a new twist on this tool: very detailed Performance monitoring on each page
and to be more visual I did a sample here:
1/4/2009 8:07:01 Pulling from queue:
http://testa.test.com/listing/1/4/2009 8:07:01------------------------------------------------------------------------------------------
1/4/2009 8:07:01 Attempting to retrieve webpage:
http://testa.test.com/listing/1/4/2009 8:07:01 HttpWebResponse determined.
1/4/2009 8:07:01 Extracting Last Modified Date.
1/4/2009 8:07:02 Extracting HTML.
1/4/2009 8:07:02 Response Status Code: OK, Status Description: OK
1/4/2009 8:07:02 URL found in HTML:
http://testa.test.com/index.php1/4/2009 8:07:02 Cleaned URL:
http://testa.test.com/index.php1/4/2009 8:07:02 WWW checked URL:
http://testa.test.com/index.php1/4/2009 8:07:02 URL valid to spider (no invalid text)
1/4/2009 8:07:03 Validating extension of URL
a vey nice idea that compliments the above is to add a line with the time it took to will be to process each page in a 3rd column such as:
1/4/2009 8:07:00 - 00 Pulling from queue:
http://testa.test.com/listing/1/4/2009 8:07:00 - 00 ------------------------------------------------------------------------------------------
1/4/2009 8:07:00 - 00 Attempting to retrieve webpage:
http://testa.test.com/listing/1/4/2009 8:07:01 - 01 HttpWebResponse determined.
1/4/2009 8:07:01 - 01 Extracting Last Modified Date.
1/4/2009 8:07:02 - 02 Extracting HTML.
1/4/2009 8:07:02 - 02 Response Status Code: OK, Status Description: OK
1/4/2009 8:07:02 - 02 URL found in HTML:
http://testa.test.com/index.php1/4/2009 8:07:02 - 02 Cleaned URL:
http://testa.test.com/index.php1/4/2009 8:07:02 - 02 WWW checked URL:
http://testa.test.com/index.php1/4/2009 8:07:02 - 02 URL valid to spider (no invalid text)
1/4/2009 8:07:03 - 02 Validating extension of URL
The next best idea will be instead of removing the messages when reach 5000 entries, save them in log files that are automatically named using a filename with a counter like:
export_0000.txt
export_0001.txt
export_0002.txt
So all the information can be recorded if needed in order to be checked later to find out what pages took more than certain time to download in big sites.
What you think?