arachnode.net
An Open Source C# web crawler with Lucene.NET search using SQL 2005/2008/CE
Does arachnode.net scale? | VS2008/2010/2012 & SQL2008/2012 | Download the latest release

crawling continues

rated by 0 users
Answered (Verified) This post has 1 verified answer | 1 Reply | 1 Follower

Top 10 Contributor
58 Posts
InvestisDev posted on Fri, Mar 2 2012 5:37 AM

Hello,

At this time, crawling is taking too much time to crawl a site that had hyper links amount 145 and web pages around 92.

It is taking lots of time and so i need to exclude images to be excluded right now.

One more thing, while crawling a site, after some time the record count in the table "Hyperlinks" and "Webpages" are not increasing but crawler is still continues with same URL crawling multiple times. (i think it goes into a loop Confused)

 

Let me know if you need any further detail for the same.

Thanks,

 

 

 

Answered (Verified) Verified Answer

Top 10 Contributor
58 Posts
Verified by InvestisDev

Hi,

Thanks, i got the answer about this from your one of the forum: http://arachnode.net/forums/p/1397/12839.aspx#12839

I had made some flag on as shown in above link.

Thanks,

Page 1 of 1 (2 items) | RSS
An Open Source C# web crawler with Lucene.NET search using SQL 2005/2008/CE

copyright 2004-2013, arachnode.net LLC