Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extract and emit UniProt classes for proteins #326

Open
dustine32 opened this issue Nov 5, 2024 · 1 comment
Open

Extract and emit UniProt classes for proteins #326

dustine32 opened this issue Nov 5, 2024 · 1 comment
Assignees

Comments

@dustine32
Copy link
Collaborator

To help transition out of the REACTO convention, a low-hanging fruit is simply extracting the UniProt ID xrefs for proteins from the Reactome BioPAX and instantiate using these UniProt classes rather than the REACTO class.

Ex protein is in pathway R-HSA-163765 "ChREBP activates metabolic gene expression," reaction R-HSA-163666 having enabler protein MLX R-HSA-163695:
image
This change should switch from using class R-HSA-163695 to UniProtKB:Q9UH92.

@deustp01
Copy link
Collaborator

deustp01 commented Nov 5, 2024

This sounds right.

Can the protein's R-HSA-# ID be preserved as a cross-ref in the resulting GO-CAM instance? The rationale is that these identifiers for human proteins are true, well-behaved stable IDs, so will persist for as long as we have evidence that the protein (with its location and covalent modifications) really exists. If the evidence changes and we obsolete the protein, our history-tracking system should allow a user to see what, if anything, has replaced it.

Feature creep: this cross-rferencing will also be useful for the MOD projections. @ukemi

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants