Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consolidate data pipeline scripts #180

Open
laurafeeney opened this issue Nov 11, 2020 · 1 comment
Open

Consolidate data pipeline scripts #180

laurafeeney opened this issue Nov 11, 2020 · 1 comment
Assignees

Comments

@laurafeeney
Copy link
Collaborator

]Create single notebook / script for the data flow from deidentified-but-still-raw data to ‘prosecution_charges_detailed’. Right now, prosecution_charges is both an input and output of two different scripts, without a clear indication of what should be run first. Would be helpful to just condense those steps into a single script.

The general pipeline is in the readme in the /notebooks page.

Thoughts on how to do this are also drafted here: Procedure for adding new MA prosecution data

@linnalihe
Copy link
Collaborator

linnalihe commented Aug 12, 2021

@agathaalmunir , @mknotts623 , and @linnalihe will review the scripts and write out a summary what the scripts are doing / did.
Date to work on this - Thursday 8/26/2021

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants