arachnode.net
An Open Source C# web crawler with Lucene.NET search using SQL 2005/2008/CE
Does arachnode.net scale? | VS2008/2010/2012 & SQL2008/2012 | Download the latest release

Search

  • Regarding term/phrase extraction

    Hello We need term/phrase extraction functionality for which we tried using Lucene.Net v3.0.3 with our current Arachnode.net solution v2.6.0. Lucene.Net v3.0.3 contains ShingleFilter funcitonality which can be used for keyword/phrase extraction. My question here is, we had to make some changes in the Arachanode code wherever lucene.net is referenced
    Posted to General Questions (Forum) by InvestisDev on Tue, May 28 2013
  • Re: Getting IndexReader is closed error :(

    Hi Mike, Sorry for delay in reply. 1) Yes we have done code changes in global.asax and service.asmx which I will send you in mail. 2) We have multiple domains in our environment and all the search requests are responded by single service.asmx so when there are concurrently heavy requests, many times we get the error. Thanks
    Posted to General Questions (Forum) by InvestisDev on Wed, May 1 2013
  • Getting IndexReader is closed error :(

    Hello Mike, We are using the older service.asmx in which we have the Hits object and on making a loop off it getting field values. If we will hit this service.asmx multiple times frequently to hit this service at the same time (almost), it throws error that IndexReader is closed. The frequency of this error increased. Please suggest some way to fix
    Posted to General Questions (Forum) by InvestisDev on Thu, Apr 11 2013
  • Re: Is there way to get Phrase in Lucene terms?

    Yes mike this is a good option but having many limitations like this will support only English language apart from this it will not support.
    Posted to General Questions (Forum) by InvestisDev on Wed, Mar 20 2013
  • Re: Is there way to get Phrase in Lucene terms?

    Superb explanation .... Thanks for such detail description..... Just FYI , there is one issue while using this Term Extraction that msdn link ( http://msdn.microsoft.com/en-us/library/ms141809.aspx ) says " The transformation works only with English text and it uses its own English dictionary and linguistic information about English. " So
    Posted to General Questions (Forum) by InvestisDev on Sun, Mar 3 2013
  • Re: Is there way to get Phrase in Lucene terms?

    Hello Mike, I had crawled one site "http://www.hex-plc.co.uk" and from its index file using lucene API, got a list of 9000+ terms (keywords) and even using any of that keywords gives me the result as well. now from the "TermExtraction.dtsx" of "Integration" project from arachnode.net solution, i am getting 147 Terms only
    Posted to General Questions (Forum) by InvestisDev on Wed, Feb 27 2013
  • Is there way to get Phrase in Lucene terms?

    Hello Mike, From the software Luke 1.0.1 while reading the index file, it shows the Terms as shown in below screen (highlighter in red color) Here you can check there are only single words, is there any way to get a phrase in it i mean like the Terms is ="technical" or "support" in place is there any way to get a Term phrase like
    Posted to General Questions (Forum) by InvestisDev on Mon, Feb 25 2013
  • Re: Site is not crawling completely

    Hello Mike, i think this will solve the issue .... sorry but i will apply this solution soon as i am on leave ..... will update you soon once the testing finishes. Thanks for the quick reply. Thanks :)
    Posted to General Questions (Forum) by InvestisDev on Wed, Jan 9 2013
  • Re: Site is not crawling completely

    Here is the actual URL http://www.aamal.com.qa/ar-DZ/default.aspx http://www.aamal.com.qa/en/default.aspx
    Posted to General Questions (Forum) by InvestisDev on Fri, Jan 4 2013
  • Re: Site is not crawling completely

    Happy Christmas & Happy New Year, sorry but facing one issue that while crawling a site, it is not including .pdf files. DB is not having any entries of PDF file in hyper link tables. but if we check the page, its having more then 5 links of pdf file. i had used the option as below: wasTheCrawlRequestAddedForCrawling = _crawler.Crawl(new CrawlRequest
    Posted to General Questions (Forum) by InvestisDev on Wed, Jan 2 2013
Page 1 of 6 (58 items) 1 2 3 4 5 Next > ... Last ยป | More Search Options
An Open Source C# web crawler with Lucene.NET search using SQL 2005/2008/CE

copyright 2004-2013, arachnode.net LLC