You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
While working on fihristorg/fihrist-mss#28 I've realized that, while stopwords files exists (for English, Arabic, and other languages) in the standard Solr installation, they haven't been enabled in the full text search query parser. So, for example, searching for History of the Afghans (without quotes) returns 19,266 hits, whereas History Afghans returns 795 (still a lot, but without quotes Solr does an implicit OR.)
Most people seriously looking for things would quickly figure out they need to use either quotes or AND, but I think this is worth implementing for more casual users.
@holfordm: This change would be across all catalogues. Do you have any objections/comments? Solr doesn't have a built-in stopwords list for Latin, but if you want we could import one, such as this.
The text was updated successfully, but these errors were encountered:
Sounds like a good idea. One question - would this affect a search like "History of the Afghans" (with quotes), where you were trying to match an exact phrase?
I think, if it is configured correctly, a phrase-search like "History of the Afghans" should still find the same matches. It might find a few more than before, if there are any documents containing something like "History for the Afghans". But we'll make sure to test it.
While working on fihristorg/fihrist-mss#28 I've realized that, while stopwords files exists (for English, Arabic, and other languages) in the standard Solr installation, they haven't been enabled in the full text search query parser. So, for example, searching for History of the Afghans (without quotes) returns 19,266 hits, whereas History Afghans returns 795 (still a lot, but without quotes Solr does an implicit OR.)
Most people seriously looking for things would quickly figure out they need to use either quotes or AND, but I think this is worth implementing for more casual users.
@holfordm: This change would be across all catalogues. Do you have any objections/comments? Solr doesn't have a built-in stopwords list for Latin, but if you want we could import one, such as this.
The text was updated successfully, but these errors were encountered: