-
Notifications
You must be signed in to change notification settings - Fork 228
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SortMeRNA: update version 4.3.6 #1316
Conversation
Cool. One of my users just asked for an update. Can I help here? |
Tomorrow I'll continue working on it; I'll ping you if I find any problem. |
This is the problem that I found and that temporarily paralyzed the PR @bernt-matthias ; apparently Sortmerna generates an alignment in the temporary folder, and galaxy tries to index it without success, generating this error: I tried to specify the path of this folder in order to provide an adequate extension, but I think it is not possible. |
Can you check if the file is empty? |
You are right, this is indeed the problem. I'll try to find a better input file. |
For the linting. Sorry for such a messy tool :( |
Now should be fine; some scripts were removed (e.g. merge-paired-reads.sh and unmerge-paired-reads.sh), and replaced by equivalent functionalities. |
@gallardoalba a profile version enables an own HOME dir for every job. |
Perfect, thanks for including it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Quite a large update. Good work. Here a few comments from my side.
</conditional> | ||
<param name="strand_search" value="" /> | ||
<conditional name="databases_type"> | ||
<param name="databases_selector" value="history" /> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would be great to have a test for the cached case.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, I need to review it, because neither in the previous version nor in the update the test datatables seem to be available.
$ref.append('%s' % $db ) | ||
#end for | ||
#else | ||
#for $db in $databases_type.input_databases.fields.path.split(",") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So the idea is here that a single entry is selected and split by comma? How does this look like in practice?
Wondering if making the cached case multiple="true"
would be an option? Might be more flexible (backward compatibility might be a bit tricky .. but not very)?
#for $i, $reference in enumerate($ref) | ||
--ref '$reference' | ||
#end for | ||
#if str( $databases_type.databases_selector ) != 'cached' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Regarding the cached
/ cached_to_index
question. This seems to happen now in sortmerna
.
But its still not clear how this works in practice, because for both options the user selects from the same data table?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also I cannot completely understand. Reference FASTA and indices can be provided in that way ./sortmerna --ref ./rRNA_databases/silva-bac-16s-id90.fasta,./index/silva-bac-16s-db
Perhaps @bebatut could have a look, since she wrote the datamanager. Otherwise I would propose to keep this section without changes. I think it should still work properly.
Do you think it could be merged @bernt-matthias? I would like to test if it works with the installed indexed genomes. |
Would be cool to have a test. Let me know if the PR is ready from your side and I will review and merge. |
Hi @bernt-matthias, I'm trying to create the test for the database, but I'm not sure how to create the file structure. According this https://github.com/bgruening/galaxytools/blob/master/data_managers/data_manager_sortmerna_database_downloader/data_manager/data_manager_sortmerna_download.py#L122 it seems to be fine, but don't know why the tool is not able to recognize it. Would you mind to have a look? Thanks a lot! |
Hi @gallardoalba what is the state here?
What exactly should I look at? Is there a failing test that I could examine? |
Will add a test for cached data. Wondering if the loops are correct, i.e. in
multiple="true" .
|
otherwise tests do not use container
I get the impression that the use of (multiple?) cached references was already wrong in 2.1. But I guess most of the time a single one is used. The docs state
But Galaxy just executes with Also the |
tool writes to `$output_bam`
Hi @bgruening .. I was still fixing bugs and adding tests wrt refereces. I stopped CI, but feel free to restart if you need the current state. I will open a followup PR. |
followup on bgruening#1316 which was not deployed - use the same chached data for chached and cached_to_index i.e. now they differ only in that the later uses the dbprep macro if I get it right previously the datamanager precomputed indexes which could be used. this seems not possible anymore I suggest to leave the dm untouched (than the provided data will also work for old sortmerna versions) - fix usage of cached data (did not work for multiple provided values) and add tests
followup on bgruening#1316 which was not deployed - use the same chached data for chached and cached_to_index i.e. now they differ only in that the later uses the dbprep macro if I get it right previously the datamanager precomputed indexes which could be used. this seems not possible anymore I suggest to leave the dm untouched (than the provided data will also work for old sortmerna versions) - fix usage of cached data (did not work for multiple provided values) and add tests
* sortmerna: finish update followup on #1316 which was not deployed - use the same chached data for chached and cached_to_index i.e. now they differ only in that the later uses the dbprep macro if I get it right previously the datamanager precomputed indexes which could be used. this seems not possible anymore I suggest to leave the dm untouched (than the provided data will also work for old sortmerna versions) - fix usage of cached data (did not work for multiple provided values) and add tests * add missing test file * also chached references are not optional selects default to optional="true" which should not apply here. also checkboxes do not work therefore. * eliminate cached_to_index option
Main changes: