arachnode.net
An Open Source C# web crawler with Lucene.NET search using SQL Server 2008/2012/2014/2016/CE An Open Source C# web crawler with Lucene.NET search using MongoDB/RavenDB/Hadoop

Completely Open Source @ GitHub

Does arachnode.net scale? | Download the latest release

Wildcard search fails with exception

rated by 0 users
Answered (Verified) This post has 1 verified answer | 13 Replies | 2 Followers

Top 25 Contributor
23 Posts
victor posted on Tue, Feb 7 2017 1:24 PM

In AN Web project, I'm getting a NotSupportedException, when using query with wildcard *, however wildcards were claimed to be supported. What can be the problem here?

Answered (Verified) Verified Answer

Top 25 Contributor
23 Posts
Verified by arachnode.net

you mean, you updated svn repository?

All Replies

Top 10 Contributor
1,905 Posts

What is the exact query?

For best service when you require assistance:

  1. Check the DisallowedAbsoluteUris and Exceptions tables first.
  2. Cut and paste actual exceptions from the Exceptions table.
  3. Include screenshots.

Skype: arachnodedotnet

Top 25 Contributor
23 Posts

I type 'finl*nd' into search input. In debugger I see Query object with 2 clauses: 'absoluteuri:finl*nd host:finl*nd text:finl*nd title:finl*nd' and 'discoverytype:webpage'. This query object is passed to Lucene .Search method

Top 10 Contributor
1,905 Posts

Try now - fix checked in - SVN.

For best service when you require assistance:

  1. Check the DisallowedAbsoluteUris and Exceptions tables first.
  2. Cut and paste actual exceptions from the Exceptions table.
  3. Include screenshots.

Skype: arachnodedotnet

Top 25 Contributor
23 Posts
Verified by arachnode.net

you mean, you updated svn repository?

Top 25 Contributor
23 Posts

I see this after running svn update

the update from 2015. Am I doing anything wrong?

Top 10 Contributor
1,905 Posts

Try now.

For best service when you require assistance:

  1. Check the DisallowedAbsoluteUris and Exceptions tables first.
  2. Cut and paste actual exceptions from the Exceptions table.
  3. Include screenshots.

Skype: arachnodedotnet

Top 25 Contributor
23 Posts

Thanks, Mike!! working now. 

U replace wildcard with empty string when getting summary that causes the summary to be empty, if wildcard is in the middle of the query. Is there a possibility to overcome this with the use of regex maybe?

Top 10 Contributor
1,905 Posts

Not sure - that would be an "under the hood" thing with lucene.net.

Get latest for the space change.

For best service when you require assistance:

  1. Check the DisallowedAbsoluteUris and Exceptions tables first.
  2. Cut and paste actual exceptions from the Exceptions table.
  3. Include screenshots.

Skype: arachnodedotnet

Top 25 Contributor
23 Posts

Now it's working, however summaries differs pretty much. Consider example:

Maybe you know, how to improve this? Or I can look deeper into it later

Top 25 Contributor
23 Posts
victor replied on Wed, Feb 15 2017 6:11 PM

Any updates on summary stuff here?

Top 10 Contributor
1,905 Posts

The summaries come from Lucene.net, beyond the borders of AN.

I would expect the summaries to be different as the queries are different - beyond this, how exactly Lucene.net summarizes, I do not know.

This conveyed, I did look at updating Lucene.net to see if there is any improvements - 86 breaking changes to sort out.  I'll be back...  (will be a few days)

Let me know if you see an obvious solution.

Thanks,
Mike

For best service when you require assistance:

  1. Check the DisallowedAbsoluteUris and Exceptions tables first.
  2. Cut and paste actual exceptions from the Exceptions table.
  3. Include screenshots.

Skype: arachnodedotnet

Top 10 Contributor
1,905 Posts

I took a look at the latest - no discernible difference.

Thanks.

For best service when you require assistance:

  1. Check the DisallowedAbsoluteUris and Exceptions tables first.
  2. Cut and paste actual exceptions from the Exceptions table.
  3. Include screenshots.

Skype: arachnodedotnet

Top 25 Contributor
23 Posts
victor replied on Wed, Feb 22 2017 1:23 PM

that is true, but in second case summary is poorly represent comparing to first

Page 1 of 1 (14 items) | RSS
An Open Source C# web crawler with Lucene.NET search using SQL 2008/2012/CE

copyright 2004-2017, arachnode.net LLC