Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Saving SAM read alignment with match chromosome/position, not the kme… #24

Merged
merged 1 commit into from
Sep 16, 2024

Conversation

vineetbansal
Copy link
Collaborator

Earlier version of guidescan enumerate --format sam ... assumed that the kmer chromosome and position can be ascertained directly from the incoming kmer file. While this is reasonable, a lot of times this information is not available beforehand, and blank/dummy values for chromosome/position are filled in only to satisfy guidescan enumerate.

In these cases, the output of guidescan enumerate --format csv ... is correct since it obtains the match positions directly from the offtargets (including at distance 0), but the output of guidescan enumerate --format sam ... is incorrect w.r.t. the reference name and reference position fields in the SAM file (because that information was unavailable to begin with).

This PR fixes this issue by getting that information directly from the off-targets at distance 0.

Note that this means that multiple matches at distance 0 will end up producing multiple lines in the SAM file, with identical off-target hex information. This is already happening in the CSV file generation, so this also makes the behavior consistent.

…r chromosome/position (which may be invalid)
@vineetbansal vineetbansal merged commit 84234cd into master Sep 16, 2024
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant