AI Project

This repository contains an end-to-end AI project that covers data ingestion, preprocessing, model training, and deployment, along with CI/CD and orchestration tools.

CIFAR-10 Image Classification

A complete project using CIFAR-10 for image classification, showcasing Data Engineering, AI, Machine Learning, and DevOps skills. The project includes a fully automated pipeline using Airflow, Docker, and Kubernetes.

Main Features:

Model Training: CNN with TensorFlow.
Deployment: Flask API containerized and deployed on Kubernetes (EKS).
Monitoring: Real-time monitoring with Prometheus and Grafana.

Setup Instructions

Clone the repository.
Install dependencies with:
```
  pip install -r requirements.txt 
```
Run the Docker container:
```
`docker-compose up`.
```
Access the Flask API at
```
   `http://localhost:5000/predict`
```

Key Technologies

Apache Airflow for orchestration.
TensorFlow for model training.
Flask for API deployment.
Docker & Kubernetes for containerization and orchestration.
Terraform for infrastructure as code.

Features

Data Ingestion: Uses Apache Airflow for ETL
Model Training: Scikit-learn Random Forest Classifier
Deployment: Flask API serving the trained model
CI/CD: GitHub Actions for Continuous Integration and Continuous Deployment
Containerization: Docker for containerizing the API and pipeline
Orchestration: Kubernetes for deployment on EKS

Project Structure

   AI-Project/
             │
             ├── data/                                # Raw and processed data
                     │   
                     ├── raw/                             # Raw data (downloaded)
                     │   
                     └── processed/                       # Processed data after ETL
             │
             ├── dags/                                # Apache Airflow DAGs for orchestration
                     │ 
                     ├── data_ingestion.py                # Airflow DAG for data ingestion
                     │
                     └── data_preprocessing.py            # Airflow DAG for data preprocessing
             │
             ├── src/                                 # Core source code
                    │ 
                    ├── preprocessing.py                 # Python script for data preprocessing
                    │
                    ├── train_model.py                   # Script to train the ML model
                    │ 
                    ├── app.py                           # Flask API for serving the model
                    │
                    └── model/                           # Model-related code and artifacts
                             │     
                             └── model.pkl                    # Serialized trained model
             │
             ├── notebooks/                           # Jupyter Notebooks for EDA, experiment tracking
                          │
                          └── eda.ipynb                        # Exploratory Data Analysis notebook
             │
             ├── mlruns/                              # MLflow artifacts (tracked experiments)
                       │
                       └── 0/                               # MLflow experiment folders
             │
             ├── docker/                              # Docker-related files
                       │
                       ├── Dockerfile                       # Dockerfile for containerizing the app
                       │
                       └── docker-compose.yml               # Compose file for multi-container setup
             │
             ├── k8s/                                 # Kubernetes configuration for EKS deployment
                    │
                    ├── deployment.yaml                  # Kubernetes deployment configuration
                    │ 
                    ├── service.yaml                     # Kubernetes service configuration
                    │
                    └── configmap.yaml                   # ConfigMap for environment variables
             │
             ├── .github/                             # GitHub Actions CI/CD pipeline
                        │
                        └── workflows/
                                     │
                                     └── main.yml                     # GitHub Actions for CI/CD pipeline
             │
             ├── requirements.txt                     # Python dependencies
             ├── Dockerfile                           # Docker configuration for app containerization
             ├── README.md                            # Documentation
             ├── app.py                               # Flask API for model inference
             ├── config.py                            # Configuration file (paths, hyperparameters, etc.)
             └── setup.py                             # Python package setup file

Installation

Prerequisites

Python 3.8+
Docker
Kubernetes (for deployment)
Apache Airflow (for orchestration)

Step-by-step Instructions

Clone the repo:

git clone https://github.com/your_username/ai_project.git
cd ai_project

Install dependencies:
```
 pip install -r requirements.txt
```
Run Preprocessing:
```
python src/preprocessing.py
```
Train the Model:
```
python src/train_model.py
```
Run the Flask API:
```
 python src/app.py
```

Running with Docker

Build the Docker image:
```
docker build -t flask-app .
```
Run the container:
```
docker run -p 5000:5000 flask-app
```

Deployment with Kubernetes

Apply the deployment:
```
kubectl apply -f k8s/deployment.yaml
```
Apply the service:
```
 kubectl apply -f k8s/service.yaml
```

CI/CD Pipeline

The project uses GitHub Actions for CI/CD:

Build: The code is built and tested.
Test: Unit tests are executed using pytest.
Deploy: Docker images are built and pushed to a registry, followed by Kubernetes deployment.

License

This project is licensed under the MIT License.

Summary

configmap.yaml: Stores environment variables and configuration for Kubernetes pods.
config.py: Centralized configuration file for Python app, loading environment variables and constants.
setup.py: Allows packaging the project as a Python package and defining console commands.
README.md: Documentation that describes the project structure, setup, installation, and usage.

This completes the advanced and professional project setup, covering AI, Data Engineering,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Project

CIFAR-10 Image Classification

Main Features:

Setup Instructions

Key Technologies

Features

Project Structure

Installation

Prerequisites

Step-by-step Instructions

Running with Docker

Deployment with Kubernetes

CI/CD Pipeline

License

Summary

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
.github/workflows		.github/workflows
dags		dags
data		data
docker		docker
k8s		k8s
monitoring		monitoring
notebooks		notebooks
scripts		scripts
src		src
terraform		terraform
tests		tests
.gitattributes		.gitattributes
LICENSE.txt		LICENSE.txt
Makefile		Makefile
README.md		README.md
config.py		config.py
requirements.txt		requirements.txt
setup.py		setup.py

License

karimosman89/AI-Project

Folders and files

Latest commit

History

Repository files navigation

AI Project

CIFAR-10 Image Classification

Main Features:

Setup Instructions

Key Technologies

Features

Project Structure

Installation

Prerequisites

Step-by-step Instructions

Running with Docker

Deployment with Kubernetes

CI/CD Pipeline

License

Summary

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages