-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mlcp exports binary documents with XML content of other documents #83
Comments
Maybe this helps: I was able to reproduce the issue on a clean CentOS 7 VM with a new installation of MarkLogic Server.
The only significant difference: under Linux all 8 binary files are broken, not just 3 out of 8 like under Windows 10 |
Good bug report! |
import.zip Attached to this comment is a zip with more files for testing purposes. |
Situation
I am using mlcp to export all documents of a database to the local filesystem. In the end, I have the correct number of local files, but some files that should be binaries actually contain XML content from other documents. The XML documents themselves are okay.
Steps to reproduce the issue
C:\Temp\mlcp\import
:mlcp.bat import -host localhost -port 8070 -username **** -password **** -mode local -input_file_path C:\Temp\mlcp\import -output_uri_replace "/C:/Temp/mlcp/import,''"
For comparison with later results and to make sure everything is still okay after the import I used XQuery to determine the size of all documents:
for $doc in fn:doc() let $uri := fn:document-uri($doc) let $size := if (fn:exists($doc/binary())) then xdmp:binary-size($doc/binary()) else xdmp:binary-size(xdmp:unquote(xdmp:quote($doc),(),"format-binary")/binary()) order by $uri ascending return $uri || " -> " || $size
Everything looks good so far.
mlcp.bat export -host localhost -port 8070 -username **** -password **** -mode local -output_file_path C:\Temp\mlcp\export
The XML documents and 5 out of 8 binary documents are okay. The problem is, that image-003.gif and image-008.gif now have to same content as doc-A.xml and image-007.gif has the same content as doc-B.xml.
My system environment
The text was updated successfully, but these errors were encountered: