-
You can modify that table. The better way to filter your crawl would be to perform these steps: Add your DisallowedDirectories as 'Words' in the 'cfg.DisallowedWords' database table, and ensure that the CrawlRule AbsoluteUri.cs is enabled. You will know if a CrawlAction, CrawlRule or...
-
Looks like you have been registered for quite some time, mopiola. Which version of AN are you using? I wouldn't add CR's to the CR table, but rather add them as shown in Program.cs. This table is used by the Cache/Engine. Read this post about restricting a crawl: http://arachnode.net/forums/t...
-
Hmmm... I will doublecheck the cookie handing and get back to you. Question though: Are you using the latest and greatest from SVN? To make exceptions for named anchors check AbsoluteUri.cs. You can use one of the existing rules (or create a new one which is executed before AbsoluteUri.cs) and modify...
-
Address has been merged with AbsoluteUri. The AbsoluteUri CrawlRule references the following tables: If you wanted to Crawl msn.com, delete all rows in the tables above, insert msn.com into DisallowedDomains, and set IsDisallowed=true for msn.com and set the following setting in CrawlRules.config: negateIsDisallowedForAbsoluteUri...