<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://arachnode.net/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>General Questions</title><link>http://arachnode.net/forums/7.aspx</link><description /><dc:language>en</dc:language><generator>CommunityServer 2008.5 SP1 (Debug Build: 31106.3070)</generator><item><title>Re: Exceptions query,  do my results look typical</title><link>http://arachnode.net/forums/thread/12186.aspx</link><pubDate>Fri, 09 Apr 2010 15:23:15 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:12186</guid><dc:creator>arachnode.net</dc:creator><slash:comments>0</slash:comments><comments>http://arachnode.net/forums/thread/12186.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=12186</wfw:commentRss><description>&lt;p&gt;Also, these typically mean that your ISP is throttling you.&lt;/p&gt;
&lt;p&gt;8855&amp;nbsp;Unable to connect to the remote server.&lt;/p&gt;
&lt;p&gt;31604&amp;nbsp;The remote server returned an error: (404) Not Found.&amp;nbsp; &lt;strong&gt;(as discussed)&lt;/strong&gt;&lt;br /&gt;8855&amp;nbsp;Unable to connect to the remote server &lt;strong&gt;(ISP throttling)&lt;/strong&gt;&lt;br /&gt;8390&amp;nbsp;The remote name could not be resolved:&amp;nbsp;&amp;nbsp;&lt;strong&gt;(DNS errors - &lt;/strong&gt;&lt;a href="http://www.asdfsdgawsrvwesdvsdssdsfsdf.com"&gt;&lt;strong&gt;www.asdfsdgawsrvwesdvsdssdsfsdf.com&lt;/strong&gt;&lt;/a&gt;&lt;strong&gt;, etc.)&lt;/strong&gt;&lt;br /&gt;1263&amp;nbsp;The remote server returned an error: (401) Unauthorized.&lt;br /&gt;766&amp;nbsp;The remote server returned an error: (403) Forbidden.&lt;br /&gt;727&amp;nbsp;The remote server returned an error: (500) Internal Server Error.&lt;br /&gt;545&amp;nbsp;The operation has timed out &lt;strong&gt;(default 60 second timeout to connect)&lt;/strong&gt;&lt;br /&gt;501&amp;nbsp;Invalid URI: The hostname could not be parsed. &lt;strong&gt;(chrome://)&lt;/strong&gt;&lt;br /&gt;494&amp;nbsp;&amp;#39;charset=iso-8859-1&amp;#39; is not a supported encoding name. Parameter name: name &lt;strong&gt;(custom meta tags, HttpRequestHeaders implemented improperly by site owners)&lt;/strong&gt;&lt;br /&gt;390&amp;nbsp;Too many automatic redirections were attempted.&amp;nbsp; &lt;strong&gt;Recenly impvoed cookie handing should take care of this one.&amp;nbsp; Check SVN / WebClient.cs&lt;/strong&gt;&lt;br /&gt;369&amp;nbsp;The value of the date string in the header is invalid.&amp;nbsp; &lt;strong&gt;(custom webserver headers - bugs on the part of the site owners)&lt;/strong&gt;&lt;br /&gt;292&amp;nbsp;The remote server returned an error: (400) Bad Request.&lt;br /&gt;248&amp;nbsp;The underlying connection was closed: An unexpected error occurred on a send.&amp;nbsp; &lt;strong&gt;(server reboots, bounces, load balancing)&lt;/strong&gt;&lt;br /&gt;215&amp;nbsp;The request was aborted: The request was canceled.&amp;nbsp; &lt;strong&gt;(server reboots, bounces, load balancing)&lt;/strong&gt;&lt;br /&gt;159&amp;nbsp;The remote server returned an error: (503) Server Unavailable.&lt;br /&gt;136&amp;nbsp;&amp;#39;ISO 8859-1&amp;#39; is not a supported encoding name. Parameter name: name&amp;nbsp; &lt;strong&gt;(custom webserver headers - bugs on the part of the site owners)&lt;/strong&gt;&lt;br /&gt;104&amp;nbsp;&amp;#39;ansi_x3.110-1983&amp;#39; is not a supported encoding name. Parameter name: name&amp;nbsp; &lt;strong&gt;(custom webserver headers - bugs on the part of the site owners)&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The last two could likely be enhanced.&amp;nbsp; Give me an example?&lt;/strong&gt;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Re: Exceptions query,  do my results look typical</title><link>http://arachnode.net/forums/thread/12185.aspx</link><pubDate>Fri, 09 Apr 2010 15:14:54 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:12185</guid><dc:creator>arachnode.net</dc:creator><slash:comments>0</slash:comments><comments>http://arachnode.net/forums/thread/12185.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=12185</wfw:commentRss><description>&lt;p&gt;Looks normal to me.&lt;/p&gt;
&lt;p&gt;You can always spot check a few.&lt;/p&gt;
&lt;p&gt;Things to keep in mind.&lt;/p&gt;
&lt;p&gt;1.) Sites may only allow you to retreive X Discoveries in any given period.&lt;/p&gt;
&lt;p&gt;2.) This value varies widely from site to site, athough you may not hit their thresholds due to crawling speeds and attempted politeness.&amp;nbsp; (Round-robin requests, delays.)&lt;/p&gt;
&lt;p&gt;3.) Your ISP may restrict the number of connections you can make in any given period.&lt;/p&gt;
&lt;p&gt;4.) There are a ton of broken links on the internet.&amp;nbsp; :D&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Exceptions query,  do my results look typical</title><link>http://arachnode.net/forums/thread/12184.aspx</link><pubDate>Fri, 09 Apr 2010 14:51:33 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:12184</guid><dc:creator>DataMan</dc:creator><slash:comments>0</slash:comments><comments>http://arachnode.net/forums/thread/12184.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=12184</wfw:commentRss><description>&lt;p&gt;When I run the following query&lt;/p&gt;
&lt;p&gt;SELECT&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; LEFT(Message, 39) AS Expr1, COUNT(ID) AS Expr2&lt;br /&gt;FROM&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Exceptions AS Exceptions_1&lt;br /&gt;GROUP BY LEFT(Message, 39)&lt;br /&gt;HAVING&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (LEFT(Message, 39) LIKE &amp;#39;The remote name could not be resolved:%&amp;#39;)&lt;br /&gt;UNION&lt;br /&gt;SELECT&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Message AS Expr1, COUNT(ID) AS Expr2&lt;br /&gt;FROM&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Exceptions&lt;br /&gt;GROUP BY Message&lt;br /&gt;HAVING&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (NOT (Message LIKE &amp;#39;The remote name could not be resolved:%&amp;#39;))&lt;br /&gt;ORDER BY Expr2 DESC&lt;/p&gt;
&lt;p&gt;I get the following resultswhich i&amp;#39;ve truncated for just the highest numbers:&lt;/p&gt;
&lt;p&gt;31604&amp;nbsp;The remote server returned an error: (404) Not Found.&lt;br /&gt;8855&amp;nbsp;Unable to connect to the remote server&lt;br /&gt;8390&amp;nbsp;The remote name could not be resolved: &lt;br /&gt;1263&amp;nbsp;The remote server returned an error: (401) Unauthorized.&lt;br /&gt;766&amp;nbsp;The remote server returned an error: (403) Forbidden.&lt;br /&gt;727&amp;nbsp;The remote server returned an error: (500) Internal Server Error.&lt;br /&gt;545&amp;nbsp;The operation has timed out&lt;br /&gt;501&amp;nbsp;Invalid URI: The hostname could not be parsed.&lt;br /&gt;494&amp;nbsp;&amp;#39;charset=iso-8859-1&amp;#39; is not a supported encoding name. Parameter name: name&lt;br /&gt;390&amp;nbsp;Too many automatic redirections were attempted.&lt;br /&gt;369&amp;nbsp;The value of the date string in the header is invalid.&lt;br /&gt;292&amp;nbsp;The remote server returned an error: (400) Bad Request.&lt;br /&gt;248&amp;nbsp;The underlying connection was closed: An unexpected error occurred on a send.&lt;br /&gt;215&amp;nbsp;The request was aborted: The request was canceled.&lt;br /&gt;159&amp;nbsp;The remote server returned an error: (503) Server Unavailable.&lt;br /&gt;136&amp;nbsp;&amp;#39;ISO 8859-1&amp;#39; is not a supported encoding name. Parameter name: name&lt;br /&gt;104&amp;nbsp;&amp;#39;ansi_x3.110-1983&amp;#39; is not a supported encoding name. Parameter name: name&lt;/p&gt;
&lt;p&gt;Do the top 4 exception counts seem typical to anyone elses results?&amp;nbsp; Does it look like I should be concerned about a count of 31604 for the&amp;nbsp;404 error?&lt;/p&gt;
&lt;p&gt;My web page count is 296587 at the moment.&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item></channel></rss>