data-engineering-swiss-army-knife

A collection of various tools, notebooks and virtual environments for a number of data engineering task.

Notebooks

Postgres Parquet ingest

This notebook provides the tooling for ingesting parquet files of an AWS RDS backup in a S3 bucket into a postgres database.

It allows to specify the databases, schemas or tables that should be read from the parquet files and ingested into the database. Schema and tables names can be altered on the fly with an template string. The notebook also creates the schemas and tables in the database if they do not exist.

The notebook is available here. In order to use the notebook, create the virtual environment that the notebook is using. The virtual environment can be created using the following command:

cd notebooks/postgres_parquet_ingest
pipenv install
pipenv run python -m ipykernel install --user --name=postgres_parquet_ingest
pipenv run jupyter notebook

This will install the necessary dependencies (S3Path, SQLAlchemy, pandas and others) and create a virtual environment for the notebook. The virtual environment will be named postgres_parquet_ingest. The notebook can then be started by running jupyter notebook in the same directory. The notebook will be available in the browser at http://localhost:8888/notebooks/postgres_parquet_ingestion.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
notebooks/postgres-parquet-ingest		notebooks/postgres-parquet-ingest
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

data-engineering-swiss-army-knife

Table of Contents

Notebooks

Postgres Parquet ingest

About

Releases

Packages

Languages

License

Leibniz-HBI/data-engineering-swiss-army-knife

Folders and files

Latest commit

History

Repository files navigation

data-engineering-swiss-army-knife

Table of Contents

Notebooks

Postgres Parquet ingest

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages