arachnode.net
An open source .NET web crawler written in C# using SQL 2005/2008
IT Professionals & Windows Deployment Professionals: SmartDeploy Enterprise is the first hardware-independent imaging toolset that uses boot time driver-injection, simplifying deployment and easing distribution by reducing total image count. [LINK]

Browse Forum Posts by Tags

Showing related tags and posts for the Forums application. See all tags in the site
  • passing state into AN

    Good morning, I need to pass some data into AN for processing by a plugin. I could obviously write that data into the DB and pick it up in the CrawlAction later, but I'm already retrieving that data before calling into AN as a matter of necessity and I'd rather not be more redundant than I have...
    Posted to General Questions (Forum) by offbored on 10 Feb 2010
  • Re: Plugin help

    Templater is a piece of code that can look at a webpage and extract the 'meat' of the page - it can look at a blog site and tell you which xpath will select the main post, the titles, or looking at a forum site, which posts are the forum posts. It basically solves a tough problem in web scraping...
    Posted to General Questions (Forum) by arachnode.net on 2 Aug 2009
  • Re: Crawling several sites with 1.2 version

    There isn't an explicit tutorial - but these are the steps... 1.) Find one of the existing plugins. 'Anonymizer.cs' is the simplest and shortest. 2.) Create a new class using the name of your choice. 3.) Examine the 'CrawlActions' database table and follow the present pattern. That's...
    Posted to General Questions (Forum) by arachnode.net on 26 Jul 2009
  • Re: Crawling several sites with 1.2 version

    1.) You will need to write separate rules for each site, but one plugin will work. Else, how would the plugin know what information you want to pull? You can use UserDefinedFunctions.ExtractDomain or UserDefinedFunctions.ExtractHost to perform the filtering/switching. 2.) The easiest would be to Create...
    Posted to General Questions (Forum) by arachnode.net on 26 Jul 2009
  • Re: Anonymouse Crawling

    I added an Anonymizer plugin to the branch so you can see how this would be implemented. Don't forget to check out the DB too... (Branch is a branch, but quite viable...) This code is checked into the trunk now. Mike
    Posted to Feature Requests (Forum) by arachnode.net on 1 Jun 2009
Page 1 of 1 (5 items)
An open source .NET web crawler written in C# using SQL 2005/2008

copyright 2004-2010, arachnode.net LLC

Powered by Community Server (Non-Commercial Edition), by Telligent Systems