arachnode.net
An Open Source C# web crawler with Lucene.NET search using SQL Server 2008/2012/2014/2016/CE An Open Source C# web crawler with Lucene.NET search using MongoDB/RavenDB/Hadoop

Completely Open Source @ GitHub

Does arachnode.net scale? | Download the latest release

Browse Site by Tags

Showing related tags and posts across the entire site.
  • Image alt tag information

    Hello, I was playing with the code, and i was more interested in crawling images and gather as much information as possible for each image. One of the tag that provides valuable information about the image is the alt tag. Is there a flag to extract to that information during the discovery process. if...
    Posted to General Questions by jyotish on Sat, Feb 18 2012
  • Re: Plugin help

    Templater is a piece of code that can look at a webpage and extract the 'meat' of the page - it can look at a blog site and tell you which xpath will select the main post, the titles, or looking at a forum site, which posts are the forum posts. It basically solves a tough problem in web scraping...
    Posted to General Questions by arachnode.net on Sun, Aug 2 2009
Page 1 of 1 (20 items)
An Open Source C# web crawler with Lucene.NET search using SQL 2008/2012/CE

copyright 2004-2017, arachnode.net LLC