Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stable IDs of exported features #4960

Open
tpluscode opened this issue Jul 25, 2024 · 2 comments
Open

Stable IDs of exported features #4960

tpluscode opened this issue Jul 25, 2024 · 2 comments
Labels
⭐️ Enhancement New feature or request

Comments

@tpluscode
Copy link

I noticed that when exporting in CAS RDF format, the features are identified by has URLs like <doc:example.pdf#48603>.

Unfortunately, adding or removing annotations and links appears to change these numbers.

Would it be possible to also export some stable identifier that is assigned to an annotation and does not change?

@reckart
Copy link
Member

reckart commented Jul 25, 2024

It would require first adding some kind of persistent ID management to INCEpTION itself. There are currently not persistent IDs generated internally, so none can be exported. INCEpTION can import IDs on tokens and sentences and reproduce them again during export, but there are no such IDs fields on other annotation types. Also, INCEpTION has no mechanism of managing such IDs, e.g. to avoid duplicates, etc.

There are IDs used internally, but they are not stable for a long time. The ID space is compacted every time you open a document. Also, these IDs are an implementation detail of UIMA that is not (fully) under the control of INCEpTION.

@tpluscode
Copy link
Author

Well, on second thought, the document text is immutable and so I can use the combination of sofa start/end of annotations as stable identifiers. At least for non-overlapping layers.

@reckart reckart added the ⭐️ Enhancement New feature or request label Jul 28, 2024
@reckart reckart added this to the ⭐️ Feature backlog milestone Sep 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
⭐️ Enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants