Compatibility of stemmed searches and generic language support
11 September 2014 02:02 PM
In MarkLogic Server v7.0-2, the tokenizer keys, for languages where MarkLogic provides generic language support, were removed so that they now all use the same key. For example, Greek falls into this class of languages. This change was made as part of an optimization for languages in which MarkLogic Server has advanced stemming and tokenization support.
Stemmed searches that include characters from languages that do not have advanced language support, performed on MarkLogic Server v7.0-2 or later releases, against content loaded on a version previous to v7.0-2, may not return the expected results.
In order to successfully run these stemmed searches, you can either:
If these are not possible in your environment, you can always run the query unstemmed.
The following example demonstrates the issue