arachnode.net v2.0
An open source .NET web crawler written in C# using SQL 2005/2008

Browse Forum Posts by Tags

Showing related tags and posts for the Forums application. See all tags in the site
  • Help Needed

    Hi, I would greatly appreciate any help, I am new to AN, i managed to get it up and running, It was actually not bad, just 4 easy steps and it's up and running, now here is what i am trying to achieve, 1) I have a list of web sites approximately 200 sites (these are job sites, job aggregators, companies...
    Posted to General Questions (Forum) by vishal on 10-06-2009
  • Re: crawling specific web sites for tag words

    Hey - I'm back from my mini-vacation to the Washington coast. The site was disallowed as, by default, arachnode.net follows robots.txt rules. If you want to turn off the robots.txt behavior, check the 'CrawlRules' table, find the robots.txt rule and turn it off. No worries on being new to...
    Posted to General Questions (Forum) by arachnode.net on 08-06-2009
  • Re: no robots.txt

    arachnode.net will crawl the site if no robots.txt file is present. We have a build of 1.1 in review right now. The latest check in should be a viable check-in if you're running a release from Sourgeforge or Codeplex.
    Posted to General Questions (Forum) by arachnode.net on 03-16-2009
  • no robots.txt

    Hi, When there is no robots.txt, will arachnode crawl the page? I've got an exception 'no robots.txt' and there are no urls showing up. Thanks! Roel
    Posted to General Questions (Forum) by Roel on 03-16-2009
Page 1 of 1 (4 items)
An open source .NET web crawler written in C# using SQL 2005/2008

copyright 2009, arachnode.net LLC

Powered by Community Server (Non-Commercial Edition), by Telligent Systems