Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Decoration of gff transfers semicolon from description field creating new attribute #535

Open
jolbi opened this issue Sep 27, 2024 · 0 comments

Comments

@jolbi
Copy link

jolbi commented Sep 27, 2024

Hi,

Some descriptions in *.emapper.annotations.tsv contain semicolon e.g.: Domain in the RNA-binding Lupus La protein; unknown function
The semicolon is transferred to gff in the decoration step, creating a new no-tag attribute value.
I don't have the original tsv output anymore (I replaced semicolons with commas), so currently I cannot provide you with more examples, but there were at least three distinct descriptions containing semicolon in my output.

To me this looks like a bug, that breaks the gff and can cause problems in downstream analysis (e.g.: some software recognizes no-tag attributes as ID fields).

I am running emapper v2.1.12 with eggNOG DB v5.0.2.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant