Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[xml import] find matching text in XMLs and copy that to the APIS version of the bio to circumvent the issues with the special characters #246

Open
sennierer opened this issue Apr 24, 2024 · 4 comments

Comments

@sennierer
Copy link
Collaborator

sennierer commented Apr 24, 2024

see eg this history

During the export of the XMLs originally used for the APIS import some (not all) special characters got lost. This problem does not exist anymore in the current exports (whether Verlag dump, or GIDEON export). In order to get resolve the special character issue we should find the correct "Haupttext" and "Kurzinfo" in the XMLs, compare it with the APIS text and copy it over in case the len(text_XML) == len(text_APIS). Like that we can be certain all the special characters are in the new version.

Example from the link above: CVŠT (APIS) vs. ČVŠT (XML)

@b1rger
Copy link
Contributor

b1rger commented Apr 24, 2024

Can you please describe the issue you are referring to? I don't know what the described solutions tries to fix and if we revisit this issue in a couple of weeks we save a lot of time if we don't have to find out what this could have meant

@sennierer
Copy link
Collaborator Author

should be more verbose now.

@b1rger
Copy link
Contributor

b1rger commented Apr 24, 2024

should be more verbose now.

Thanks!

@b1rger
Copy link
Contributor

b1rger commented May 14, 2024

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants