bowtie build #12

jkanbar · 2016-11-02T19:20:03Z

Continuing from #10

scripts

Merge branch 'master' of https://github.com/jkanbar/tcga-1

wasade · 2016-11-08T05:23:43Z

python_scripts/cgc_bowtie2_build.py

+                           taxa_levels,
+                           read_per_taxa):
+    """Absolute abundance of number of reads matching a defined taxa level.
+    Parameters


newline please

wasade · 2016-11-08T05:25:41Z

python_scripts/cgc_bowtie2_build.py

+    taxonomic_abundances= []
+    for report_fp in kraken_mpa_report_fp:
+        with open(report_fp) as report_fp:
+            for line in report_fp:


would it be possible to break this parser out? seems like a logical point for decomposition. i'm not seeing a paired unit test or set of tests either...?

wasade · 2016-11-08T05:26:58Z

python_scripts/cgc_bowtie2_build.py

+                    taxonomic_abundances.append(taxonomy_parse)
+
+    taxonomies = set([k for k, v in
+                      collections.Counter(taxonomic_abundances).iteritems()


this is a py2 codebase?

wasade · 2016-11-08T05:27:08Z

python_scripts/cgc_bowtie2_build.py

+                     output_filename):
+
+    """Return sets for sample IDs and taxonomy strings.
+    Parameters


newline please

wasade · 2016-11-08T05:27:41Z

python_scripts/cgc_bowtie2_build.py

+
+
+@click.command()
+


remove the extra newline please

wasade · 2016-11-08T05:34:51Z

scripts/generate_kraken_db.py

+    with open(scores_repophlan_fp) as scores_repophlan_f:
+        # header
+        line = scores_repophlan_f.readline()
+        line = line.strip().split('\t')


yea... i think pandas would be so nice here

wasade · 2016-11-08T05:35:44Z

scripts/generate_kraken_db.py

+            # only want tax_ids for genomes passing quality filter
+            if genome_id in genomes:
+                tax_id = line[tax_id_idx]
+                # tax_id must be an integer, if not check the field


that sucks. has this bug been reported to the repophlan maintainers?

wasade · 2016-11-08T05:36:18Z

scripts/generate_kraken_db.py

+        taxid = info[1]
+        genome_fp_name = basename(splitext(genome_fp)[0])
+        # check FNA file exists for genome
+        if genome_fp_name != "":


if genome_fp_name:

wasade · 2016-11-08T05:37:04Z

scripts/generate_kraken_db.py

+            output_fp = join(repophlan_scores_filtered_genomes_dp,
+                             genome_fp_name)
+            # skip files already modified (e.g. from previous run)
+            if not isfile(output_fp):


should this be os.path.exists?

wasade · 2016-11-08T05:37:42Z

scripts/generate_kraken_db.py

+    """Edit qualified genomes' labels to Kraken format.
+    """
+    # .fna.bz2 genomes folder
+    all_genomes_bz2_dp = sys.argv[1]


ekopylova and others added 11 commits July 8, 2016 07:04

Initial commit

b6bc4f5

scripts

58858fa

Merge pull request biocore#1 from ekopylova/code

3382f26

scripts

adding to python scripts

04668bc

Merge branch 'master' of https://github.com/jkanbar/tcga

c09670d

Made changes to add tuple of input kraken mpa reports

b79a084

Delete cgc_bowtie2_build.py

6c4628a

Update README.md

87971fc

Can now add input tuple of krakne mpa reports

85ef3a2

Update

4344f93

Merge branch 'master' of https://github.com/jkanbar/tcga-1

pull request ready for review

8599710

wasade reviewed Nov 8, 2016

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bowtie build #12

bowtie build #12

jkanbar commented Nov 2, 2016

wasade Nov 8, 2016

wasade Nov 8, 2016

wasade Nov 8, 2016

wasade Nov 8, 2016

wasade Nov 8, 2016

wasade Nov 8, 2016

wasade Nov 8, 2016

wasade Nov 8, 2016

wasade Nov 8, 2016

wasade Nov 8, 2016



		@click.command()

bowtie build #12

Are you sure you want to change the base?

bowtie build #12

Conversation

jkanbar commented Nov 2, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment