-
Hello We need term/phrase extraction functionality for which we tried using Lucene.Net v3.0.3 with our current Arachnode.net solution v2.6.0. Lucene.Net v3.0.3 contains ShingleFilter funcitonality which can be used for keyword/phrase extraction. My question here is, we had to make some changes in the Arachanode code wherever lucene.net is referenced
-
Hi Mike, Sorry for delay in reply. 1) Yes we have done code changes in global.asax and service.asmx which I will send you in mail. 2) We have multiple domains in our environment and all the search requests are responded by single service.asmx so when there are concurrently heavy requests, many times we get the error. Thanks
-
Hello Mike, We are using the older service.asmx in which we have the Hits object and on making a loop off it getting field values. If we will hit this service.asmx multiple times frequently to hit this service at the same time (almost), it throws error that IndexReader is closed. The frequency of this error increased. Please suggest some way to fix
-
Yes mike this is a good option but having many limitations like this will support only English language apart from this it will not support.
-
Superb explanation .... Thanks for such detail description..... Just FYI , there is one issue while using this Term Extraction that msdn link ( http://msdn.microsoft.com/en-us/library/ms141809.aspx ) says " The transformation works only with English text and it uses its own English dictionary and linguistic information about English. " So
-
Hello Mike, I had crawled one site "http://www.hex-plc.co.uk" and from its index file using lucene API, got a list of 9000+ terms (keywords) and even using any of that keywords gives me the result as well. now from the "TermExtraction.dtsx" of "Integration" project from arachnode.net solution, i am getting 147 Terms only
-
Hello Mike, From the software Luke 1.0.1 while reading the index file, it shows the Terms as shown in below screen (highlighter in red color) Here you can check there are only single words, is there any way to get a phrase in it i mean like the Terms is ="technical" or "support" in place is there any way to get a Term phrase like
-
Hello Mike, i think this will solve the issue .... sorry but i will apply this solution soon as i am on leave ..... will update you soon once the testing finishes. Thanks for the quick reply. Thanks :)
-
Here is the actual URL http://www.aamal.com.qa/ar-DZ/default.aspx http://www.aamal.com.qa/en/default.aspx
-
Happy Christmas & Happy New Year, sorry but facing one issue that while crawling a site, it is not including .pdf files. DB is not having any entries of PDF file in hyper link tables. but if we check the page, its having more then 5 links of pdf file. i had used the option as below: wasTheCrawlRequestAddedForCrawling = _crawler.Crawl(new CrawlRequest