arachnode.net
An Open Source C# web crawler with Lucene.NET search using SQL Server 2008/2012/2014/2016/CE An Open Source C# web crawler with Lucene.NET search using MongoDB/RavenDB/Hadoop

Completely Open Source @ GitHub

Does arachnode.net scale? | Download the latest release

Browse Forum Posts by Tags

Showing related tags and posts for the arachnode.net group. See all tags in the site
  • Crawl Restriction and dbo.Domain table

    Hi There - I have two questions: 1. I am trying to figure out why the crawler is not populating the dbo.domain and dbo.domain_discoveries table. I suspects its one of the application settings and have not been able to figure out which one. 2. How can a crawl request be restricted to a specific URL and...
    Posted to General Questions (Forum) by rlink12 on Wed, Nov 13 2013
  • Operations: selecting and managing a collection of crawl requests

    Hi Mike and all, I'm evaluating AN for a client and my goal is to crawl a collection of URIs on a regular schedule. Essentially I will run AN as a service and schedule crawls for various sites by specifying those site URIs to AN. My understanding is that the table CrawlRequests is working memory...
    Posted to General Questions (Forum) by jamesy on Wed, Jul 6 2011
Page 1 of 1 (2 items)
An Open Source C# web crawler with Lucene.NET search using SQL 2008/2012/CE

copyright 2004-2017, arachnode.net LLC