Skip to content

Simple ETL from a zipped flat file to a PostegreSQL database - Assignment 5 for the course "Data Acquisition and Management" of the Artificial Intelligence Diploma program from The University of Winnipeg.

License

Notifications You must be signed in to change notification settings

ygalvao/UoW-Data-Acquisition-and-Management-ETL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Simple ETL - Assignment 5 - Data Acquisition and Management - AI - The University of Winnipeg

Overview

This project is a simple Extract, Transform, Load (ETL) data pipeline using Bash, Python, and PSQL.

Prerequisites

  • Python 3.7 or higher
  • Pandas 1.5.3 or higher
  • Bash shell
  • PSQL

Project Structure

  • transform.py: Python script responsible for transforming the data.
  • etl.sh: Bash script that orchestrates the ETL process and does the extraction and loading of the data.

Running the ETL Pipeline

To run the ETL pipeline, execute the etl.sh from your terminal (after making it executable).

chmod +x etl.sh
./etl.sh

This script will call transform.py as part of its process.

Customizing the Pipeline

The transformation logic can be customized by editing the transform.py script.

License

MIT - see the LICENSE file for more details.

About

Simple ETL from a zipped flat file to a PostegreSQL database - Assignment 5 for the course "Data Acquisition and Management" of the Artificial Intelligence Diploma program from The University of Winnipeg.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published