Skip to content

sodascience/kansenkaart_analysis

Repository files navigation

Parallel estimation for the Kansenkaart on the OSSC

pipeline.png

This repository holds the code for parallel estimation of the Kansenkaart project estimates on the ODISSEI Secure Supercomputer.

This is a work in progress!

Steps to run the analysis

Repository preparation

  1. Add the cohort datasets resulting from the preprocessing pipeline to the data/ folder.
  2. Point to the right resources in lines 8-9 in 02_create_grid.R.
  3. Ensure the right COROP code year (default: 2019) is being used in line 72 of 02_create_grid.R.
  4. Check that the right datasets are loaded in lines 33-37 of 03_array_job.R.

OSSC running

  1. Create and store the data/model_grid.feather file by running 02_create_grid.R.
  2. Perform a test run by running Rscript 03_array_job.R 1 in bash on the OSSC
  3. Check how many jobs are needed (nrow(model_grid) / batch_size)
  4. Create an array job using array_job.sh.

About

Analysis code for the kansenkaart project

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published