-
Notifications
You must be signed in to change notification settings - Fork 105
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Weird GO terms of some genes occurred in annotation #505
Comments
Dear @Prunoideae , Thank you for reporting this. It is difficult for me to know whether the original gene is not properly annotated, or if the orthologous group is actually broader than "compound eye development", or if eggNOG-mapper is actually reporting a GO term which is too clade specific and it shouldn't be transferred to other orthologous. Maybe you can have a clue by checking the diamond alignment stats of your query to the eggNOG-mapper seed ortholog. Is it a close or distant hit? Best, |
I think it is because the term is too clade-specific. This gene is in the EggNOG 5.0 database. I queried the database by the query: SELECT name, gos
FROM prots WHERE name="10090.ENSMUSP00000046177"; And so the result is:
You can see a "P|GO:0048749|IMP" is there. |
Dear @Prunoideae , Thank you very much for looking into this. I searched your gene "ENSMUSP00000046177" in eggnog5 (http://eggnog5.embl.de/) to double check. I went to the narrowest OG (ENOG504Q28F from Rodentia), and under "Functional profile" there was this GO and also "compound eye corneal lens development". Since it is annotated like this in eggNOG 5, I guess that there is little that eggNOG-mapper can do about it. Best, |
I think I will filter out these terms manually then. Thank you for your clarification. |
I will leave the issue open, if you don't mind. Since it is a clear example of an issue with GO term annotation scope. |
Ok, no problem. |
From querying the database, I found that the protein "10090.ENSMUSP00000046177" has a GO term "GO:0048749", which is the "compound eye development", a term for insect eye development.
However, the protein comes from Mus musculus, which should not have such term, and it makes downstream analysis really confusing when seeing a mouse has genes related to such terms enriched...
From looking up in String-DB it seems that the same gene does not have this term.
The text was updated successfully, but these errors were encountered: