arachnode.net
An open source .NET web crawler written in C# using SQL 2005/2008

Searching pages for keywords

rated by 0 users
Not Answered This post has 0 verified answers | 1 Reply | 2 Followers

Top 50 Contributor
9 Posts
DataMan posted on 12 Mar 2010 6:00 PM

So I've been trying to figure out how to get AN to only return pages that have certain words on them.  I would think that there would be a table or a text file somewhere that you fill and voila,  only pages with those terms on it would be returned.

How would I accomplish that?  I've been trying to figure out CrawlRules and trying to figure out how to write a plugin but can't seem to get anywhere.

Thanks

All Replies

Top 10 Contributor
1,202 Posts

You mean filter them - only allow certain pages into the system that contain specific words?

Look at Source.cs in the SiteCrawler project.

-Mike

An open source .NET web crawler written in C# using SQL 2005/2008.

Join the arachnode.net group on Facebook: http://www.facebook.com/groups.php?ref=sb#/group.php?gid=166721755872

Twitter: http://twitter.com/arachnode_net

arachnode.net provides custom crawling and contracting resources.  Please ask.

http://bit.ly/TOFX4

C# crawler, C# web crawler, C# site crawler

Page 1 of 1 (2 items) | RSS
An open source .NET web crawler written in C# using SQL 2005/2008

copyright 2004-2010, arachnode.net LLC

Powered by Community Server (Non-Commercial Edition), by Telligent Systems