30 January 2023 11:21 AM
In the more recent versions of MarkLogic Server, there are checks in place to prevent the loading of invalid documents (such as documents with multiple root nodes). However, documents loaded in earlier versions of MarkLogic Server can now result in duplicate URI or duplicate document errors being reported.
Additionally, under normal operating conditions, a document/URI is saved in a single forest. If somehow the load process gets compromised, then user may see issues like duplicate URI (i.e. same URI in different forests) and duplicate documents (i.e. same document/URI in same forest).
If one doesn't see
To check that the problem is actually duplicate documents, one can either do an xdmp:describe(fn:doc(...)) or fn:count(fn:doc((...)). If these commands return more than 1 e.g.
To fix duplicate documents, the document will need to be reloaded.
Before reloading, you can take a look at the two version to see if there is a difference. Check
If there is a difference, that may also that may point the operation that created the situation.