arachnode.net
An Open Source C# web crawler with Lucene.NET search using SQL Server 2008/2012/2014/2016/CE An Open Source C# web crawler with Lucene.NET search using MongoDB/RavenDB/Hadoop

Completely Open Source @ GitHub

Does arachnode.net scale? | Download the latest release

BayesianClassifier example

rated by 0 users
Answered (Verified) This post has 1 verified answer | 1 Reply | 2 Followers

Top 200 Contributor
1 Posts
Marcin Gorzynski posted on Thu, Sep 3 2009 3:03 PM

Hi

Do you have an example of what to set up in

Class1ExemplarDirectory and Class2ExemplarDirectory

 

 

for your BayesianClassifier implementation. Let say that I would only be interested in crawling website that have Mardaona , Pele and Rambo in its content.

Thanks

Marcin

Answered (Verified) Verified Answer

Top 10 Contributor
1,905 Posts

Add text files or html files to each directory.

Say, add 1000 pages that have to do with Madonna's first album to the first directory and 1000 pages that have to do with Madonna's last album to the second directory.

-Mike

For best service when you require assistance:

  1. Check the DisallowedAbsoluteUris and Exceptions tables first.
  2. Cut and paste actual exceptions from the Exceptions table.
  3. Include screenshots.

Skype: arachnodedotnet

Page 1 of 1 (2 items) | RSS
An Open Source C# web crawler with Lucene.NET search using SQL 2008/2012/CE

copyright 2004-2017, arachnode.net LLC