Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Result output truncates reference files at first dot #19

Open
smehringer opened this issue Mar 3, 2023 · 1 comment
Open

Result output truncates reference files at first dot #19

smehringer opened this issue Mar 3, 2023 · 1 comment

Comments

@smehringer
Copy link

Hello,

We built an index over RefSeq genomes. The downloaded filenames are named like this:

/path/GCF_000019125.1_ASM1912v1_genomic.fna.gz
/path/GCF_000019165.1_ASM1916v1_genomic.fna.gz
...

When searching the index, the result looks as follows:

*query1 XXX
GCF_000019125 XXX
GCF_000019165 XXX
...

Luckily for us, the names are still unique and we should be able to compare the output with some effort to reconstruct the full reference name.

This format is lossy if the names weren't unique before the first dot and might even lead to severe false negatives if not noticed by the user.

Best,
Svenja

@iqbal-lab
Copy link

thanks for pointing this out @smehringer . I don't understand why i didnt get notified of your comment. Will follow this up, but leandro has left the project so there will be a delay

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants