Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Way to access NCBI STAT data in bulk #11

Open
Jalapenobadger opened this issue Jan 24, 2020 · 3 comments
Open

Way to access NCBI STAT data in bulk #11

Jalapenobadger opened this issue Jan 24, 2020 · 3 comments

Comments

@Jalapenobadger
Copy link

Hi,
I'm wondering is there any way to access the taxonomic data that STAT is automatically generating on each NCBI run? Every metagenomic upload on the SRA has this analysis generated and displayed as a Krona, but is there a route by which we could download this data in simple text form for playing around with association rule mining?

Also, is there a roadmap or website besides github anywhere dedicated to this project? Is there anywhere people can find more information about STAT like who works on it or what your future goals for it might be?

Thanks!
-Pete

@Jalapenobadger
Copy link
Author

Jalapenobadger commented May 20, 2021 via email

@babarlelephant
Copy link

babarlelephant commented May 21, 2021

Thanks a lot @Jalapenobadger. I could get all the accessions mentioning Coronaviridae in the taxonomy analysis (the full one visible in the html source code, the analysis tab of https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR2063951 is only showing the best matches)

I created a gmail account (I had to enter my phone number) then in https://console.cloud.google.com/bigquery I ran

SELECT acc FROM nih-sra-datastore.sra_tax_analysis_tool.tax_analysis WHERE name= "Coronaviridae"

I saved it as "local csv" obtaining 16000 results. To obtain the whole 229293 results I did "save on google drive".

Be careful that this interface is limited for free accounts, unless you enter a credit card number and get 300$ free tokens.

@Jalapenobadger
Copy link
Author

Jalapenobadger commented May 22, 2021 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants