arachnode.net
An Open Source C# web crawler with Lucene.NET search using SQL Server 2008/2012/2014/2016/CE An Open Source C# web crawler with Lucene.NET search using MongoDB/RavenDB/Hadoop

Completely Open Source @ GitHub

Does arachnode.net scale? | Download the latest release

Downloads

Name
Link
Edit Profile
Balancing volume, quality and freshness in web crawling
Baeza-Yates, R. and Castillo, C. (2002). Balancing volume, quality and freshness in web crawling...
Home
Image
Search
Design and implementation of a distributed crawler and filtering...
Zeinalipour-Yazti, D. and Dikaiakos, M. D. (2002). Design and implementation of a distributed crawler...
Do your worst to make the best: Paradoxical effects in pagerank...
Boldi, P., Santini, M., and Vigna, S. (2004b). Do your worst to make the best: Paradoxical effects...
Crawling a Country: Better Strategies than Breadth-First...
Baeza-Yates, R., Castillo, C., Marin, M. and Rodriguez, A. (2005). Crawling a Country: Better Strategies...
Breadth-first crawling yields high-quality pages
Marc Najork and Janet L. Wiener. Breadth-first crawling yields high-quality pages. In Proceedings...
Modeling and managing content changes in text databases
Ipeirotis, P., Ntoulas, A., Cho, J., Gravano, L. (2005) Modeling and managing content changes in...
Synchronizing a database to improve freshness
Cho, J. and Garcia-Molina, H. (2000). Synchronizing a database to improve freshness. In Proceedings...
Blue Tag
Add Comment
Users Comments
Page 2 of 2 (35 items) < Previous 1 2
An Open Source C# web crawler with Lucene.NET search using SQL 2008/2012/CE

copyright 2004-2017, arachnode.net LLC