<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://arachnode.net/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>arachnode.net</title><link>http://arachnode.net/forums/</link><description>All Posts</description><dc:language>en-US</dc:language><generator>CommunityServer 2008.5 SP1 (Debug Build: 31106.3070)</generator><item><title>How to avoid creating discoveries ?</title><link>http://arachnode.net/forums/thread/43901.aspx</link><pubDate>Tue, 21 May 2013 13:48:16 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:43901</guid><dc:creator>Dinesh</dc:creator><slash:comments>5</slash:comments><comments>http://arachnode.net/forums/thread/43901.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=43&amp;PostID=43901</wfw:commentRss><description>&lt;p&gt;Hi All,&lt;/p&gt;
&lt;p&gt;I wrote a small Plugin (crawlaction) which will extract the specific content from website and save the content into database. Here is piece of code from my plugin. I am taking the&amp;nbsp;crawl requests (websites)&amp;nbsp;from &amp;nbsp;CrawlRequests.txt and this .txt file has only one website name such as&amp;nbsp;acquia.com. So, I want to execute the below PerformAction method only once but this method executing more than 100 times. I think the&amp;nbsp;acquia.com has 100+ discoveries, thats why the PerformAction method executing 100 + times. How to avoid creating discoveries or how to avoid multiple times execution ?&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;public override void PerformAction(CrawlRequest crawlRequest, ArachnodeDAO arachnodeDAO)&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; {&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; _crawlRequest = crawlRequest.Parent.Uri.AbsoluteUri.ToString();&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;// Get the URL specified&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;var webGet = new HtmlWeb();&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;var document = webGet.Load(_crawlRequest);&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;//hyperlink weblinks&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;var companyLnks = document.DocumentNode.SelectNodes(&amp;quot;//a&amp;quot;);&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;if (companyLnks != null)&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;{&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;foreach (var lnk in companyLnks)&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;{&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;if (lnk.Attributes[&amp;quot;href&amp;quot;] != null)&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;{&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;if (lnk.Attributes[&amp;quot;href&amp;quot;].Value.Contains(&amp;quot;https://www.facebook.com/&amp;quot;))&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;{&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;_fbLink = lnk.Attributes[&amp;quot;href&amp;quot;].Value.ToString();&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;} &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;}&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;}&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;} &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;_activityDateTime = DateTime.Now.ToString();&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;arachnodeDAO.InsertWebLinks(_crawlRequest, _fbLink, _activityDateTime); &amp;nbsp;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>How to run schedule crawler (program.cs) automatically</title><link>http://arachnode.net/forums/thread/43906.aspx</link><pubDate>Thu, 23 May 2013 09:07:03 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:43906</guid><dc:creator>Dinesh</dc:creator><slash:comments>1</slash:comments><comments>http://arachnode.net/forums/thread/43906.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=43906</wfw:commentRss><description>&lt;p&gt;Hi&lt;/p&gt;
&lt;p&gt;I would like run my crawler everyday at 9:00 PM automatically. How do I accomplish this task ?&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Getting IndexReader is closed error :(</title><link>http://arachnode.net/forums/thread/43874.aspx</link><pubDate>Thu, 11 Apr 2013 20:57:22 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:43874</guid><dc:creator>InvestisDev</dc:creator><slash:comments>3</slash:comments><comments>http://arachnode.net/forums/thread/43874.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=43874</wfw:commentRss><description>&lt;p&gt;Hello Mike,&lt;/p&gt;
&lt;p&gt;We are using the older service.asmx in which we have the Hits object and on making a loop off it getting field values.&lt;/p&gt;
&lt;p&gt;If we will hit this service.asmx multiple times frequently to hit this service at the same time (almost), it throws error that IndexReader is closed. The frequency of this error increased. Please suggest some way to fix this.&lt;/p&gt;
&lt;p&gt;Thanks,&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Indexing and data mining</title><link>http://arachnode.net/forums/thread/43886.aspx</link><pubDate>Thu, 18 Apr 2013 11:26:17 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:43886</guid><dc:creator>john12890</dc:creator><slash:comments>1</slash:comments><comments>http://arachnode.net/forums/thread/43886.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=58&amp;PostID=43886</wfw:commentRss><description>&lt;p&gt;Hi all,&lt;/p&gt;
&lt;p&gt;Just going through the Demo version of AN, I would like to know how this tool indexing the data? Is the tool do the Data Mining of the crawled data? I would like to discuss in detail in this regard.&lt;/p&gt;
&lt;p&gt;My issue is that I&amp;#39;ve to crawl specific data from a authenticated web page and do the analytics on that. Will this tool can help me out?&lt;/p&gt;
&lt;p&gt;Thanks.&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Is there any way to run crawler as web service?</title><link>http://arachnode.net/forums/thread/43884.aspx</link><pubDate>Tue, 16 Apr 2013 05:51:27 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:43884</guid><dc:creator>Abhishek Gahlout</dc:creator><slash:comments>0</slash:comments><comments>http://arachnode.net/forums/thread/43884.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=43884</wfw:commentRss><description>&lt;p&gt;Hi , I need to know that is there any mechanism to run crawler as web service ? If yes then how?&lt;/p&gt;
&lt;p&gt;Actually I want to run crawler for some website links in some particular cases on my website.&lt;/p&gt;
&lt;p&gt;So can I run crawler as webservice?&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Crawling of webpage where data comes with scrollbar</title><link>http://arachnode.net/forums/thread/43873.aspx</link><pubDate>Thu, 11 Apr 2013 09:23:02 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:43873</guid><dc:creator>Abhishek Gahlout</dc:creator><slash:comments>1</slash:comments><comments>http://arachnode.net/forums/thread/43873.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=43873</wfw:commentRss><description>&lt;p&gt;
&lt;p&gt;Hi , I need to know , how should I crawl a webpage where the information is loading when page scrolls. eg: please visit the link&lt;/p&gt;
&lt;p&gt;http://www.wikido.com/us/ny/ny&lt;/p&gt;
&lt;p&gt;Though there are thousand of records for this page but at a time only 60 records are shown. further records are shown when we scroll the vertical scroll bar.Crawler only fetches data foar 60 records at a time as when page first loads it only shows 60 records. So to crawl and get complete information , how should I crawl these kind of webpages.&lt;/p&gt;
&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Crawling of webpages where data comes after page load</title><link>http://arachnode.net/forums/thread/43872.aspx</link><pubDate>Wed, 10 Apr 2013 09:15:13 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:43872</guid><dc:creator>Manoj</dc:creator><slash:comments>1</slash:comments><comments>http://arachnode.net/forums/thread/43872.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=24&amp;PostID=43872</wfw:commentRss><description>&lt;p&gt;Hi , Please tell me that how can I crawl the webpages where the information comes after the page loading is completed. Eg:&amp;nbsp;&lt;a href="http://www.spotcrime.com/#new%20york"&gt;http://www.spotcrime.com/#new%20york&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;When I crawl this webpage there is no relevant &amp;nbsp;information in the view source which gets stored in the DB , though that information is available on the webpage. The data comes after the page gets loaded by some javascript function.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;What steps should I follow to crawl such webpages.?&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Crawl specific domains, take screenshot, run javascript, ID ads, write to DB ... any ideas?</title><link>http://arachnode.net/forums/thread/418.aspx</link><pubDate>Sat, 14 Feb 2009 13:53:24 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:418</guid><dc:creator>jpntol</dc:creator><slash:comments>7</slash:comments><comments>http://arachnode.net/forums/thread/418.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=418</wfw:commentRss><description>&lt;p&gt;We want to capture info about AD SPOTS;&lt;/p&gt;
&lt;p&gt;- on specific list of approx 10,000 domains &lt;/p&gt;
&lt;p&gt;- capture a screenshot png / jpg&lt;/p&gt;
&lt;p&gt;- run javascript on each page (browser specific?)&lt;/p&gt;
&lt;p&gt;- read js and identify ad spot SIZES&lt;/p&gt;
&lt;p&gt;- identify PLACE on page that the adspot is located&lt;/p&gt;
&lt;p&gt;- relate this location back to the screenshot (draw it on the screenshot)&lt;/p&gt;
&lt;p&gt;- write all to DB&lt;/p&gt;
&lt;p&gt;- repeat above for next level down (max 1)&lt;/p&gt;
&lt;p&gt;- repeat all weekly, compare and highlight differences&lt;/p&gt;
&lt;p&gt;Are we looking at right toolkit with arachnode? Anyone out there wants to have a crack at coding this?&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Can AN crawl the authenticated website content ?</title><link>http://arachnode.net/forums/thread/43866.aspx</link><pubDate>Fri, 05 Apr 2013 08:18:52 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:43866</guid><dc:creator>john12890</dc:creator><slash:comments>1</slash:comments><comments>http://arachnode.net/forums/thread/43866.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=42&amp;PostID=43866</wfw:commentRss><description>&lt;p&gt;Hello&amp;nbsp;&lt;/p&gt;
&lt;p&gt;I am new to use the AN crawler, We would like to crawl the content from authenticated web page. Is this possible ? and also would like crawl no of likes, no of&amp;nbsp;followers&amp;nbsp;from Facebook and Twitter every day . Is this possible ??&lt;/p&gt;
&lt;p&gt;Thanks&lt;/p&gt;
&lt;p&gt;John Adams&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Requested registry access is not allowed while installing</title><link>http://arachnode.net/forums/thread/43867.aspx</link><pubDate>Fri, 05 Apr 2013 14:50:56 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:43867</guid><dc:creator>john12890</dc:creator><slash:comments>1</slash:comments><comments>http://arachnode.net/forums/thread/43867.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=42&amp;PostID=43867</wfw:commentRss><description>&lt;p&gt;Hi&lt;/p&gt;
&lt;p&gt;We downloaded the&amp;nbsp;DEMO_3.0 - VS2010 - SQL2008.zip file and&amp;nbsp;followed&amp;nbsp;the installation steps provided by arachnode, but&amp;nbsp;suddenly&amp;nbsp;we got an error while installing like &amp;quot;Requested registry access is not allowed&amp;quot; and &amp;quot;securityexception was unhandled by user code&amp;quot;. Can anyone help me on &amp;nbsp;this ?&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;Regards&lt;/p&gt;
&lt;p&gt;John Adams&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Crawling of webpage with HTML form</title><link>http://arachnode.net/forums/thread/43860.aspx</link><pubDate>Tue, 02 Apr 2013 05:15:14 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:43860</guid><dc:creator>Abhishek Gahlout</dc:creator><slash:comments>3</slash:comments><comments>http://arachnode.net/forums/thread/43860.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=43860</wfw:commentRss><description>&lt;p&gt;I have a question which is as follows:&lt;/p&gt;
&lt;p&gt;
If I have a webpage with a html form. In that html form I have a dropdown and a submit button with other controls. Now after clicking the submit button , depending on the combobox selected value , some result page is shown with some information.
Now If i crawl that website , will I get the result in DB for each possible pages which would have been formed by all the possible values of the dropdown and submitting them by cHi Mike, I have a question which is as follows:
If I have a webpage with a html form. In that html form I have a dropdown and a submit button with other controls. Now after clicking the submit button , depending on the combobox selected value , some result page is shown with some information.
Now If i crawl that website , will I get the result in DB for each possible pages which would have been formed by all the possible values of the dropdown and submitting them by clicking submit button.licking submit button.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;eg: go to the following link :&amp;nbsp;&lt;a href="http://nces.ed.gov/globallocator/"&gt;http://nces.ed.gov/globallocator/&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;In this webpage I can get the result including school details zipcode wise or state-city wise if I successfully submit the html form . So is our Crawler able to fetch the resultant data (by automatically filling the form and submit that form ). Please help in this regard.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>start another program while crawling</title><link>http://arachnode.net/forums/thread/40156.aspx</link><pubDate>Sat, 15 Dec 2012 15:28:40 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:40156</guid><dc:creator>samarhtc</dc:creator><slash:comments>19</slash:comments><comments>http://arachnode.net/forums/thread/40156.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=40156</wfw:commentRss><description>&lt;p&gt;
&lt;p&gt;Hi I have a question regarding the crawling. I have made a console program that I want to start running right after the crawling has started. I have noticed that the crawling is first started in the section mentioned below&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&amp;quot;Console, Program.cs file&amp;quot;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;catch (Exception exception) { System.Console.WriteLine(exception.StackTrace); }&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;//necessary for the Rendering functionality. if (_crawler != null) { //may be null if all configuration settings are not initialized in the database... while (!_hasCrawlCompleted &amp;amp;&amp;amp; _crawler.AreRenderersEnabled) { Application.DoEvents(); } }&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;System.Console.ReadLine();&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;//-----------------------------// The crawling begins now, that is after System.Console.ReadLine(); //-----------------------&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;//This is hit when the crawler is finished. if (_crawler != null &amp;amp;&amp;amp; _crawler.Engine != null) { _crawler.Engine.Stop(); }&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;What I&amp;#39;m wondering about is how can I start my program right after the crawling has begun. Shall I just call it after System.Console.ReadLine, this does not work, because at that time the crawler has finished crawling. Please note that for personal reasons I do not want to write a plugin. I just want to make a call to my (console) program right after the crawling has begun, and thereby have the two (crawler and my program) be running at the same time.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;//----------------------------------------------------------------------------------------------&lt;/p&gt;
&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title> "Function.dll" has to be signed.</title><link>http://arachnode.net/forums/thread/40155.aspx</link><pubDate>Sat, 15 Dec 2012 15:22:29 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:40155</guid><dc:creator>samarhtc</dc:creator><slash:comments>27</slash:comments><comments>http://arachnode.net/forums/thread/40155.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=40155</wfw:commentRss><description>&lt;p&gt;Hi I&amp;#39;m having a problem with the database. I&amp;#39;m not able to work with the arachnode.net database, since SQL Server is keep saying that &amp;quot;Function.dll&amp;quot; has to be signed. I&amp;#39;m not sure what this means, however I have gotten a signed certificate (myCA.cer and mySPC.pfx). My question is now, what do I do? Shall I sign the function.dll or function project with this certificate, if so then how is this done exactly.&lt;/p&gt;
&lt;p&gt;thanks&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Error : Type 'AbsoluteUri' already exists or you do not have permission to create it.</title><link>http://arachnode.net/forums/thread/387.aspx</link><pubDate>Tue, 03 Feb 2009 12:37:23 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:387</guid><dc:creator>jaydeep</dc:creator><slash:comments>24</slash:comments><comments>http://arachnode.net/forums/thread/387.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=387</wfw:commentRss><description>&lt;p&gt;Hello,&lt;/p&gt;
&lt;p&gt;I am new to arachnode.net and trying to make it work for me.&lt;/p&gt;
&lt;p&gt;After all set up and instalation while running i get deployement erroe which says &lt;strong&gt;&lt;em&gt;Type &amp;#39;AbsoluteUri&amp;#39; already exists or you do not have permission to create it.&lt;/em&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;Can you tell me the resolution?&lt;/p&gt;
&lt;p&gt;Thanks in advance.&lt;br /&gt;JD&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Site is not crawling completely</title><link>http://arachnode.net/forums/thread/38997.aspx</link><pubDate>Tue, 11 Dec 2012 05:53:11 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:38997</guid><dc:creator>InvestisDev</dc:creator><slash:comments>36</slash:comments><comments>http://arachnode.net/forums/thread/38997.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=38997</wfw:commentRss><description>&lt;p&gt;Hello Mike,&amp;nbsp;&lt;/p&gt;
&lt;p&gt;i was crawling &amp;quot;http://www.bodycote.com&amp;quot; site. I had set the depth level to 6 while crawling this site.&lt;/p&gt;
&lt;p&gt;if you will search the keyword &amp;quot;china&amp;quot; it will show some results but that result is not including &amp;quot;http://www.bodycote.com/en/contact-directory/asia/china.aspx&amp;quot; whose depth level is &amp;lt; 6.&amp;nbsp;&lt;/p&gt;
&lt;p&gt;i had even checked the DB too. the pages on the level of china and below are not in the list of hyperlinks or webpages tables.&lt;/p&gt;
&lt;p&gt;Even i am downloading the pages so if checking the &amp;quot;DownloadWebPages&amp;quot; folder, there is only one page thats of &amp;quot;world.aspx&amp;quot;.&amp;nbsp;&lt;/p&gt;
&lt;p&gt;Please suggest some way to solve this.&lt;/p&gt;
&lt;p&gt;Thanks,&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Consistency and fault tolerance</title><link>http://arachnode.net/forums/thread/39000.aspx</link><pubDate>Tue, 11 Dec 2012 13:22:30 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:39000</guid><dc:creator>Shakir</dc:creator><slash:comments>19</slash:comments><comments>http://arachnode.net/forums/thread/39000.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=39000</wfw:commentRss><description>&lt;p&gt;Hello dear&lt;/p&gt;
&lt;p&gt;How do you implement Replication consistency and Fault tolerance in this lovely and massive crawler ?&lt;/p&gt;
&lt;p&gt;Thank you in advance&lt;/p&gt;
&lt;p&gt;&lt;img src="http://arachnode.net/emoticons/emotion-1.gif" alt="Smile" /&gt;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>AN.Next configuration</title><link>http://arachnode.net/forums/thread/38993.aspx</link><pubDate>Mon, 10 Dec 2012 19:19:36 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:38993</guid><dc:creator>koolworld</dc:creator><slash:comments>52</slash:comments><comments>http://arachnode.net/forums/thread/38993.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=38993</wfw:commentRss><description>&lt;p&gt;Hi Mike,&lt;/p&gt;
&lt;p&gt;We&amp;#39;re looking to use AN.Next over the main Arachnode due to it being simply to learn and use. Can i just confirm&amp;nbsp; the following:&lt;/p&gt;
&lt;p&gt;1. Can it use SQL rather than CE&lt;/p&gt;
&lt;p&gt;2. Process files such as MS Office and PDF files&lt;/p&gt;
&lt;p&gt;3. Use lucene to create the index and add to the lucene index fields to suit our situation (i know we created our own Lucene index class which inherited from ManageLuceneDotNetIndexesso we could have our own index fields - does AN.Next use these same plugins?)&lt;/p&gt;
&lt;p&gt;4. Are the configuration settings taken from the database? There seems to be a configuration file passed to the crawler currently.&lt;/p&gt;
&lt;p&gt;5. Can you confirm the minimum requirements for the database objects so that we can strip out any tables which aren required?&lt;/p&gt;
&lt;p&gt;Sorry about all the questions!&lt;/p&gt;
&lt;p&gt;Thanks&lt;br /&gt;Dan&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>IndexSearcher not releasing file</title><link>http://arachnode.net/forums/thread/34972.aspx</link><pubDate>Wed, 21 Nov 2012 06:28:44 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:34972</guid><dc:creator>InvestisDev</dc:creator><slash:comments>46</slash:comments><comments>http://arachnode.net/forums/thread/34972.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=34972</wfw:commentRss><description>&lt;p&gt;Hello Mike,&lt;/p&gt;
&lt;p&gt;i had created one&amp;nbsp;separate&amp;nbsp;application which will have a list of URLS to crawl. Sequentially it will crawl all the sites and Copies index files into separate folders from were service.asmx will access the index files.&lt;/p&gt;
&lt;p&gt;Now for example there is one folder &amp;quot;www.abc.com&amp;quot; having all the index files and some one is accessing the index file through the service.asmx. At this time if i will remove all the index files to copy the new one,&amp;nbsp;it says _0.fdt file is in use. &lt;/p&gt;
&lt;p&gt;Can you suggest a way to release the resource and remove the files to override the new index files.&lt;/p&gt;
&lt;p&gt;Note: can i use Global.IndexSearcher.Close() to release the resource. [Please explain how can i use it]&lt;/p&gt;
&lt;p&gt;Thanks,&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>deploy to production server</title><link>http://arachnode.net/forums/thread/38037.aspx</link><pubDate>Fri, 30 Nov 2012 15:47:57 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:38037</guid><dc:creator>samarhtc</dc:creator><slash:comments>14</slash:comments><comments>http://arachnode.net/forums/thread/38037.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=38037</wfw:commentRss><description>&lt;p&gt;
&lt;p&gt;Hi I have arachnode.net running on a development server. I would like to&amp;nbsp;deploy this solution to another server (production server).&amp;nbsp;There will not be any development on the production server. When I right-click the solution and choose deploy nothing happens, so it seems. I was expecting that a dialog box would popup and ask me which folder I want to deploy to, I would then choose a folder and afterwards copy paste the files from development server to production server.&amp;nbsp;&lt;/p&gt;
&lt;p&gt;Is there something I have misunderstood?&lt;/p&gt;
&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>dbo.convertsource function does not exists anymore, how to convert source column in webpages tabel</title><link>http://arachnode.net/forums/thread/34968.aspx</link><pubDate>Tue, 20 Nov 2012 14:52:48 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:34968</guid><dc:creator>samarhtc</dc:creator><slash:comments>53</slash:comments><comments>http://arachnode.net/forums/thread/34968.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=34968</wfw:commentRss><description>&lt;p&gt;Hi&amp;nbsp;&lt;/p&gt;
&lt;p&gt;I would like to convert the Source column inside webpages table into text. In a similar post the answer is to use the function dbo.convertsource, but this function does not exits anymore in the arachnode.net database or rather its nowhere to find. I have used this to find it :&amp;nbsp;&lt;/p&gt;
&lt;p&gt;SELECT [TEXT] FROM SYS.SYSCOMMENTS &amp;nbsp;WHERE [TEXT] LIKE &amp;#39;%ConvertSource%&amp;#39;&lt;/p&gt;
&lt;p&gt;I have a &amp;quot;the newest version&amp;quot; that is a paid version of arachnode.net from sommer 2012&lt;/p&gt;
&lt;p&gt;So what can I do now?&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>disable encoding of source column within webpages table</title><link>http://arachnode.net/forums/thread/34973.aspx</link><pubDate>Wed, 21 Nov 2012 12:21:37 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:34973</guid><dc:creator>samarhtc</dc:creator><slash:comments>31</slash:comments><comments>http://arachnode.net/forums/thread/34973.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=34973</wfw:commentRss><description>&lt;p&gt;Hi&amp;nbsp;&lt;/p&gt;
&lt;p&gt;I would like to disable the encoding of source column within the webpages table. &lt;/p&gt;
&lt;p&gt;What I want instead is to have the crawled data be inserted into the Source column as text (nvarchar). &amp;nbsp;&lt;/p&gt;
&lt;p&gt;Where do I do this? (which solutions, which method?) Another thing is that I have a paid version of arachnode.net from this summer, however I am not able to debug into the SiteCrawler project. I am able to set the Console as start up project and start debugging from here, but as soon as I hit the Crawl method (Arachnode.SiteCrawler.Crawler.Crawl(Arachnode.SiteCrawler.Value.CrawlRequest)) I cannot continue into this with the debugger. Why not, this is a paid version.&lt;/p&gt;
&lt;p&gt;P.S. I&amp;#39;m running Visual Studio 2010 Ultimate, SQL Server 2008 R2, Windows 2008, IIS 7 and I have already tried deleting all break points and starting a new.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;Thanks.&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>debugging arachnode.net solution</title><link>http://arachnode.net/forums/thread/28682.aspx</link><pubDate>Thu, 18 Oct 2012 13:11:07 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:28682</guid><dc:creator>samarhtc</dc:creator><slash:comments>54</slash:comments><comments>http://arachnode.net/forums/thread/28682.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=24&amp;PostID=28682</wfw:commentRss><description>&lt;p&gt;Hi&lt;/p&gt;
&lt;p&gt;Our company have bought a version of Arachnode.net I am currently trying to modify its behaviour.&amp;nbsp;However I cannot seem to debug the solution that is my break points are never hit. I have set some break points in different projects, I then run (F5) the Console project and crawl a site, the crawling is done but the break points are never hit. Or rather the break point are hit from the DataAccess project, the ArachnodeDAO.cs file, but I cannot get it to hit anything within the SiteCrawler project&lt;/p&gt;
&lt;p&gt;I have also tried to set a break point right at the beginning of the crawler that is inside program.cs inside the console project and then F10/F11 all the way, but still no result its like the program flow never goes into other projects.&amp;nbsp;&lt;/p&gt;
&lt;p&gt;The visual studio is set on debug mode. &amp;nbsp;&lt;/p&gt;
&lt;p&gt;Am I missing something here.&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Making the crawler work faster</title><link>http://arachnode.net/forums/thread/34976.aspx</link><pubDate>Wed, 21 Nov 2012 16:12:56 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:34976</guid><dc:creator>samarhtc</dc:creator><slash:comments>26</slash:comments><comments>http://arachnode.net/forums/thread/34976.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=43&amp;PostID=34976</wfw:commentRss><description>&lt;p&gt;Hi&lt;/p&gt;
&lt;p&gt;I&amp;#39;m in a situation where I have to crawl some very large websites. The resulting db is apx 120 gb. This is actually ok, but the time it takes to crawl one website is way to long. I&amp;#39;m crawling with 100 thread, but still it takes days.&amp;nbsp;&lt;/p&gt;
&lt;p&gt;So I was wondering how to minimize the crawl time. What I need from the crawling is the webpages that is raw data only meaning html code only, no images, or anything else.&amp;nbsp;&lt;/p&gt;
&lt;p&gt;I was thinking if I somehow disable the encoding of the Source column within the webpages table, maybe this would speed up the process, since now AN does not have to encode/decode the webpages it simply insert the html inside the source column (or a new column next to it source_2!) in nvarchar format.&lt;/p&gt;
&lt;p&gt;what do you think, any suggestion would be&amp;nbsp;appreciated. &lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>crawler not crawling</title><link>http://arachnode.net/forums/thread/24431.aspx</link><pubDate>Tue, 17 Jul 2012 22:56:04 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:24431</guid><dc:creator>bmchunu</dc:creator><slash:comments>9</slash:comments><comments>http://arachnode.net/forums/thread/24431.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=42&amp;PostID=24431</wfw:commentRss><description>&lt;p&gt;(Please visit the site to view this media)&lt;/p&gt;
&lt;p&gt;(Please visit the site to view this media)The crawler is not crawling&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;Settings below and logs attached. Not sure what else I can send through. I had the demo version working fine, not sure what&amp;#39;s wrong here!&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;Key&lt;span&gt;	&lt;/span&gt;Value&lt;/p&gt;
&lt;p&gt;AssignCrawlRequestPrioritiesForFiles&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;AssignCrawlRequestPrioritiesForHyperLinks&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;AssignCrawlRequestPrioritiesForImages&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;AssignCrawlRequestPrioritiesForWebPages&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;AssignEmailAddressDiscoveries&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;AssignFileAndImageDiscoveries&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;AssignHyperLinkDiscoveries&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;ClassifyAbsoluteUris&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;ConsoleOutputLogsDirectory&lt;span&gt;	&lt;/span&gt;C:\Users\user\Downloads\arc\arc1\Console\bin\Debug\ConsoleOutputLogs&lt;/p&gt;
&lt;p&gt;CrawlRequestTimeoutInMinutes&lt;span&gt;	&lt;/span&gt;1&lt;/p&gt;
&lt;p&gt;CreateCrawlRequestsFromDatabaseCrawlRequests&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;CreateCrawlRequestsFromDatabaseFiles&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;CreateCrawlRequestsFromDatabaseHyperLinks&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;CreateCrawlRequestsFromDatabaseImages&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;CreateCrawlRequestsFromDatabaseWebPages&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;DesiredMaximumMemoryUsageInMegabytes&lt;span&gt;	&lt;/span&gt;1024&lt;/p&gt;
&lt;p&gt;DownloadedFilesDirectory&lt;span&gt;	&lt;/span&gt;C:\Users\user\Downloads\arc\arc1\Console\bin\Debug\DownloadedFiles&lt;/p&gt;
&lt;p&gt;DownloadedImagesDirectory&lt;span&gt;	&lt;/span&gt;C:\Users\user\Downloads\arc\arc1\Console\bin\Debug\DownloadedImages&lt;/p&gt;
&lt;p&gt;DownloadedWebPagesDirectory&lt;span&gt;	&lt;/span&gt;C:\Users\user\Downloads\arc\arc1\Console\bin\Debug\DownloadedWebPages&lt;/p&gt;
&lt;p&gt;EnableConsoleOutput&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;ExtractFileMetaData&lt;span&gt;	&lt;/span&gt;false&lt;/p&gt;
&lt;p&gt;ExtractImageMetaData&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;ExtractWebPageMetaData&lt;span&gt;	&lt;/span&gt;false&lt;/p&gt;
&lt;p&gt;HttpWebRequestRetries&lt;span&gt;	&lt;/span&gt;5&lt;/p&gt;
&lt;p&gt;InsertDisallowedAbsoluteUriDiscoveries&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertDisallowedAbsoluteUris&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertEmailAddressDiscoveries&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertEmailAddresses&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertExceptions&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertFileDiscoveries&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertFileMetaData&lt;span&gt;	&lt;/span&gt;false&lt;/p&gt;
&lt;p&gt;InsertFiles&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertFileSource&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertHyperLinkDiscoveries&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertHyperLinks&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertImageDiscoveries&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertImageMetaData&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertImages&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertImageSource&lt;span&gt;	&lt;/span&gt;false&lt;/p&gt;
&lt;p&gt;InsertWebPageMetaData&lt;span&gt;	&lt;/span&gt;false&lt;/p&gt;
&lt;p&gt;InsertWebPages&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;InsertWebPageSource&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;MaximumNumberOfCrawlRequestsToCreatePerBatch&lt;span&gt;	&lt;/span&gt;1000&lt;/p&gt;
&lt;p&gt;MaximumNumberOfCrawlThreads&lt;span&gt;	&lt;/span&gt;10&lt;/p&gt;
&lt;p&gt;MaximumNumberOfHostsAndPrioritiesToSelect&lt;span&gt;	&lt;/span&gt;10000&lt;/p&gt;
&lt;p&gt;OutputConsoleToLogs&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;OutputStatistics&lt;span&gt;	&lt;/span&gt;false&lt;/p&gt;
&lt;p&gt;SaveDiscoveredFilesToDisk&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;SaveDiscoveredImagesToDisk&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;SaveDiscoveredWebPagesToDisk&lt;span&gt;	&lt;/span&gt;true&lt;/p&gt;
&lt;p&gt;SqlCommandTimeoutInMinutes&lt;span&gt;	&lt;/span&gt;60&lt;/p&gt;
&lt;p&gt;UserAgent&lt;span&gt;	&lt;/span&gt;http://arachnode.net 1.4&lt;/p&gt;
&lt;p&gt;VerboseOutput&lt;span&gt;	&lt;/span&gt;false&lt;/p&gt;
&lt;p&gt;CacheTimeoutInMinutes&lt;span&gt;	&lt;/span&gt;15&lt;/p&gt;
&lt;p&gt;CreateCrawlRequestsForMissingFilesAndImages&lt;span&gt;	&lt;/span&gt;false&lt;/p&gt;
&lt;p&gt;DownloadedFilesVirtualDirectory&lt;span&gt;	&lt;/span&gt;\DownloadedFiles&lt;/p&gt;
&lt;p&gt;DownloadedImagesVirtualDirectory&lt;span&gt;	&lt;/span&gt;\DownloadedImages&lt;/p&gt;
&lt;p&gt;LuceneDotNetIndexDirectory&lt;span&gt;	&lt;/span&gt;C:\Users\user\Downloads\arc\arc1\Console\bin\Debug\LuceneDotNetIndex&lt;/p&gt;
&lt;p&gt;MaximumNumberOfDocumentsToReturnPerSearch&lt;span&gt;	&lt;/span&gt;200&lt;/p&gt;
&lt;p&gt;MaximumPageTitleLength&lt;span&gt;	&lt;/span&gt;64&lt;/p&gt;
&lt;p&gt;PageSize&lt;span&gt;	&lt;/span&gt;10&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>About the Calculation of "relevancyscore" field</title><link>http://arachnode.net/forums/thread/19220.aspx</link><pubDate>Fri, 27 Apr 2012 05:22:45 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:19220</guid><dc:creator>InvestisDev</dc:creator><slash:comments>37</slash:comments><comments>http://arachnode.net/forums/thread/19220.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=19220</wfw:commentRss><description>&lt;p&gt;Hello Mike,&lt;/p&gt;
&lt;p&gt;I want to know the logic of &amp;nbsp;the field &amp;quot;relevancyscore&amp;quot; (score) value, how the scoring of the records can be calculated.&lt;/p&gt;
&lt;p&gt;Please explain it with an example like if i have 10 results returned how the value of &amp;quot;relevancyscore&amp;quot; will be calculated.&lt;/p&gt;
&lt;p&gt;Thanks,&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item></channel></rss>