Only outputting one file? #64

theheking · 2019-03-24T16:19:56Z

Hi,

I am running polyester using the following code:

simulate_experiment(EnsemblfastaFile, numreps=c(1,1),fold_changes=fold_changes, reads_per_transcript=readspertx, paired=FALSE, outdir="/mnt/lustre/users/k1632479/polyester/simulatedread", distr="empirical", error_model="illumina5", bias="rnaf")

It runs for hours and outputs a large 8 GB fasta file.
I don't understand what I'm running wrong.

The text was updated successfully, but these errors were encountered:

JMF47 · 2019-03-24T16:22:01Z

It’s likely that this is an input issue and would be more appropriate on the bioconductor user forum.

What does your readspertx look like?

theheking · 2019-03-24T21:58:11Z

If it is an input issue, I don't know why I would get a output fasta file for the first sample_01.fasta which is 14G.

I have already removed the zeroes from the readspertx. So I don't think it is that.
`fasta_File <- readDNAStringSet(EnsemblfastaFile)
#replace all 0 counts with 1
fastaFile_nozero <- replace(width(fasta_File), width(fasta_File) == 0, 1)

readspertx = round(20 * fastaFile_nozero / 100)`

JMF47 · 2019-03-24T22:03:18Z

What does the following show:

head(readspertx)
sum(readspertx)

theheking · 2019-03-24T22:10:37Z

head(readspertx)
[1] 214 22 831 598 727 96
sum(readspertx)
[1] 47182274

JMF47 · 2019-03-25T00:15:09Z

If I’m understanding your query and the output you’ve shared with me, the output file size is within the realm of expectation. The input specified generates one file per sample, since paired=F. If I recall correctly, 47 million 100bp reads stored in fasta format should be >10Gb. If you could let me know a bit more about what your goal of the simulation is (such as simulating 20x coverage of every transcript in the transcriptome) I could try and help you tweak your commands.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only outputting one file? #64

Only outputting one file? #64

theheking commented Mar 24, 2019

JMF47 commented Mar 24, 2019

theheking commented Mar 24, 2019

JMF47 commented Mar 24, 2019

theheking commented Mar 24, 2019

JMF47 commented Mar 25, 2019

Only outputting one file? #64

Only outputting one file? #64

Comments

theheking commented Mar 24, 2019

JMF47 commented Mar 24, 2019

theheking commented Mar 24, 2019

JMF47 commented Mar 24, 2019

theheking commented Mar 24, 2019

JMF47 commented Mar 25, 2019