Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bioutils translate_cds does not correctly translate all human codons #74

Open
bsgiles73 opened this issue Sep 24, 2024 · 0 comments
Open
Labels
enhancement New feature or request

Comments

@bsgiles73
Copy link

bsgiles73 commented Sep 24, 2024

The Human Mitochondrial codon table includes the alternate initiator codon ATT. The correct translation of the first three codons in MT-ND2 should be MNP. But the translate_cds method in the sequences module does not return the correct translation.

>>> from bioutils.sequences import translate_cds
>>> from bioutils.sequences import TranslationTable
>>> translate_cds("ATTAATCCC",translation_table=TranslationTable.vertebrate_mitochondrial)
'INP'

According to the NCBI translation table, ATT encodes for both Methionine and Isoleucine. If the codon is the first codon of the coding gene it should be a M, else it is a I. These alternate initiator codons are species specific. For human there is ATT, for mouse ATC, and for Coturnix, Gallus GTG.

The translate_cds method should enable users to pass in enough information that the method can return the correct amino acid if the first codon is included and the species matches.

@bsgiles73 bsgiles73 added the enhancement New feature or request label Sep 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant