Skip to content

Latest commit

 

History

History
356 lines (245 loc) · 18 KB

README.md

File metadata and controls

356 lines (245 loc) · 18 KB

Machine Learning

This course provides an introduction to machine learning. Machine learning explores the study and construction of algorithms that can learn from and make predictions on data. Such algorithms operate by building a model from example inputs in order to make data-driven predictions or decisions, rather than following strictly static program instructions.

Topic categories include supervised, unsupervised, and reinforcement learning. Students will learn how to apply machine learning methods to solve problems in computer vision, natural language processing, classification, and prediction. Fundamental and current state-of-the-art methods including boosting and deep learning will be covered. Students will reinforce their learning of machine learning algorithms with hands-on tutorial oriented laboratory exercises using Jupyter Notebooks.

Prerequisites: MA-262 Probability and Statistics; programming maturity, and the ability to program in Python.

Helpful: CS3851 Algorithms, MA-383 Linear Algebra, Data Science.

ABET: Math/Science, Engineering Topics.

2-2-3 (class hours/week, laboratory hours/week, credits)

Lectures are augmented with hands-on tutorials using Jupyter Notebooks. Laboratory assignments will be completed using Python and related data science packages: NumPy, Pandas, ScipPy, StatsModels, Scikit-learn, Matplotlib, TensorFlow, Keras, PyTorch.

Outcomes:

  • Understand the basic process of machine learning.
  • Understand the concepts of learning theory, i.e., what is learnable, bias, variance, overfitting.
  • Understand the concepts and application of supervised, unsupervised, semi-supervised, and reinforcement learning.
  • The ability to analyze a data set including the ability to understand which data attributes (dimensions) affect the outcome.
  • Understand the application of learned models to problems in classification, prediction, clustering, computer vision, and NLP.
  • Understand deep learning concepts and architectures including representation learning Multi-layer Perceptrons, Convolutional Neural Networks, Recurrent Neural Networks, and Attention Mechanisms.
  • The ability to assess the quality of predictions and inferences.
  • The ability to apply methods to real world data sets.

References:

*Hands-On Machine Learning with Scikit-Learn and TensorFlow Concepts, Tools, and Techniques to Build Intelligent Systems (MLSLT), Aurélien Géron. O'Reilly Media, 2017

Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition, by Aurélien Géron, Publisher: O'Reilly Media, Inc. Release Date: June 2019 ISBN: 9781492032649

*Deep Learning with Python (DLP), François Chollet. Manning, 2017.

Deep Learning (DL), Ian Goodfellow, Yoshua Bengio, and Aaron Courville. MIT Press, 2016.

An Introduction to Statistical Learning: with Applications in R (ISLR), Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani. 2015 Edition, Springer.

Python Data Science Handbook (PDSH), Jake VanderPlas, O'Reilly.

Mining of Massive Datasets (MMDS). Anand Rajaraman and Jeffrey David Ullman. http://www.mmds.org/


Week 1: Intro to Machine Learning

Lecture:

  1. Syllabus

  2. Introduction to Machine Learning

  • Demonstrations
  • Reading: MLSLT Ch. 1
  1. Introduction to Git and GitHub
  1. Machine Learning Foundations

Lab Notebooks:

Optional tutorial notebooks:

Outcomes addressed in week 1:

  • Understand the basic process of machine learning:
  • Understand the concepts and application of supervised, unsupervised, semi-supervised, and reinforcement learning.

Week 2: Linear Regression, Multivariate Regression

Lecture:

  1. Linear Regression 1
  • Reading: PDSH Ch. 5 p. 331-375, 390-399
  • Reading: ISLR Ch. 1, 2
  1. Linear Regression Notebook Use for second lecture
  1. Generalized Linear Models Notebook Optional

Lab Notebooks:

Outcomes addressed in week 4:

  • The ability to analyze a data set including the ability to understand which data attributes (dimensions) affect the outcome.
  • The ability to perform basic data analysis and statistical inference.
  • The ability to perform supervised learning of prediction models.
  • The ability to perform data visualization and report generation.
  • The ability to apply methods to real world data sets.

Week 3: Introduction to Classification, KNN, Model Evaluation and Metrics. Logistic Regression

Lecture:

  1. Linear Model Selection and Regularization
  • Reading: ISLR Ch. 6
  1. Introduction to Machine Learning with KNN
  • Reading: ISLR Ch. 4.6.5
  1. Logistic Regression Classification
  • Reading: ISLR Ch. 4

Lab Notebooks:

Outcomes addressed in week 4:

  • The ability to assess the quality of predictions and inferences.
  • The ability to apply methods to real world data sets.
  • The ability to perform supervised learning of prediction models.

Week 4: Logistic Regression, Model Selection and Regularization, ROC

Lecture:

  1. Logistic Regression Classification
  • Reading: ISLR Ch. 4
  1. Model Evaluation and Metrics, ROC
  1. Regularization and overfitting

Lab Notebooks:

Outcomes:

  • Understand the concepts and application of supervised, unsupervised, semi-supervised, and reinforcement learning.
  • The ability to analyze a data set including the ability to understand which data attributes (dimensions) affect the outcome.
  • Understand the application of learned models to problems in classification, prediction, clustering, computer vision, and NLP.
  • The ability to assess the quality of predictions and inferences.
  • The ability to apply methods to real world data sets.

Week 5: Decision Trees, Bagging, Random Forests

Lecture:

  1. Decision Trees
  1. Bagging, Random Forests, Boosting
  • Reading: PDSH Ch. 5 p. 421-432
  • Reading: ISLR Ch. 8.2

Lab Notebooks:

Outcomes:

  • Understand the concepts and application of supervised, unsupervised, semi-supervised, and reinforcement learning.
  • The ability to analyze a data set including the ability to understand which data attributes (dimensions) affect the outcome.
  • Understand the application of learned models to problems in classification, prediction, clustering, computer vision, and NLP.
  • The ability to assess the quality of predictions and inferences.
  • The ability to apply methods to real world data sets.

Week 6: Boosting, XGBoost, Midterm

Lecture:

  1. Gradient Boosting, XGBoost

  2. Midterm Exam

Lab Notebooks:

Outcomes:

Outcomes:

  • Understand the concepts and application of supervised, unsupervised, semi-supervised, and reinforcement learning.
  • The ability to analyze a data set including the ability to understand which data attributes (dimensions) affect the outcome.
  • Understand the application of learned models to problems in classification, prediction, clustering, computer vision, and NLP.
  • The ability to assess the quality of predictions and inferences.
  • The ability to apply methods to real world data sets.

Week 7: Introduction to Deep Learning and Backpropagation

Lecture:

  1. Deep Learning Introduction 1
    Reference for earlier in class:
  1. Deep Learning Introduction 2

  2. Backpropagation

Lab Notebooks:

  • NeuralNetworkIntro submission required
  • Additional Google Colab notebooks:
    Install TensorFlow:
    conda install -c conda-forge tensorflow

  Complete the following TensorFlow "Learn and use ML" tutorials. Submit   screen shot demonstrating complete of the tutorial along with feedback to Blackboard.

  Get Started with TensorFlow

  Train your first neural network: basic classification

  Explore overfitting and underfitting

Optional:

Reading:

  • DLP Chs. 1-4

Outcomes addressed in week 8:

  • Understand the concepts of learning theory, i.e., what is learnable, bias, variance, overfitting.
  • Understand the concepts and application of supervised, unsupervised, semi-supervised, and reinforcement learning.
  • Understand the application of learned models to problems in classification, prediction, clustering, computer vision, and NLP.
  • Understand deep learning concepts and architectures including representation learning Multi-layer Perceptrons, Convolutional Neural Networks, Recurrent Neural Networks, and Attention Mechanisms.

Week 8: Deep Learning for Computer Vision

Lecture:

  1. Convolutional Neural Networks 1

  2. Convolutional Neural Networks 2

Reading:

  • DLP Ch. 5

Lab Notebooks:

Setting up your environment:

  • First, do a git pull on the course repository.

Note: You may run the lab notebooks in Google Collab. You will need a Google account and will need to store datasets in your own Google Drive folder. https://colab.research.google.com/notebooks/welcome.ipynb

Use the follow procedure when opening one of our course notebooks in Collab:

  • Select: NEW PYTHON 3 NOTEBOOK
  • Once the notebook opens, select EDIT, NOTEBOOK SETTINGS, and select GPU. You can also select Python 3 here as well.
  • Select FILE, OPEN, UPLOAD and upload your notebook.
  • For the notebooks that use the cat and dog images, you will need to upload those files to a folder on your Google Drive.

If you want to run tensorflow and keras locally, use the following installation procedure:

Notebooks:

Optional material:

Outcomes addressed in week 9:

  • Understand the concepts of learning theory, i.e., what is learnable, bias, variance, overfitting.
  • Understand the application of learned models to problems in classification, prediction, clustering, computer vision, and NLP.
  • Understand deep learning concepts and architectures including representation learning Multi-layer Perceptrons, Convolutional Neural Networks, Recurrent Neural Networks, and Attention Mechanisms.

Week 9: Deep Learning for NLP

Lecture:

  1. NLP Classification

  2. Optional Convnets for Structured Prediction

Lab Notebooks:

Outcomes addressed in week 9:

  • Understand the concepts of learning theory, i.e., what is learnable, bias, variance, overfitting.
  • Understand the concepts and application of supervised, unsupervised, semi-supervised, and reinforcement learning.
  • Understand the application of learned models to problems in classification, prediction, clustering, computer vision, and NLP.
  • Understand deep learning concepts and architectures including representation learning Multi-layer Perceptrons, Convolutional Neural Networks, Recurrent Neural Networks, and Attention Mechanisms.

Week 10:  Generative Deep Learning

Lecture:

  1. NLP Translation

  2. Deep Learning Trends NLP [DeepLearningTrendsNLP2019]

  3. Final Review Study Guide

Lab Notebooks:

Complete assignments

Outcomes addressed in week 10:

  • Understand the concepts of learning theory, i.e., what is learnable, bias, variance, overfitting.
  • Understand the concepts and application of supervised, unsupervised, semi-supervised, and reinforcement learning.
  • Understand the application of learned models to problems in classification, prediction, clustering, computer vision, and NLP.
  • Understand deep learning concepts including representation learning.

Final Exam: Monday, 8-10AM, S243.

Thomas More Catholic Church, 215 Thomas More Drive, Elgin

where she was a parishioner. Visitation will be held on Friday morning at the church from 9:30 AM until the time of mass. In lieu of flowers, the family requests donations be made to Ronald McDonald House.