Summer Course 2018 | TTI-Japan

This is a short (1 week) course aimed towards a formal introduction to the basics of probability theory and machine learning. The goal of the first half of the course is to make us comfortable and rigorously reason about probability arising in the context of computer science and machine learning. In the second half of the course, we will start with basics of Machine Learning and build our way up to Neural Networks.

Course Plan

Day 1: PyTorch Setup and Introduction to Probability

Python, PyTorch and NumPy setup
Definitions, Events, Conditional Probability, Chain Rule, Bayes Rule, Independent Events
Random Variables, Expectation, Linearity of Expectation, Independent Random Variables, Markov Inequality

Day 2: Concentration Inequalities and MLE

Coin Toss
Maximum Likelihood Estimator for Biased Coin Tosses and Dice Rolls
How to Differentiate between two Biased Coins?

Day 3: Generalization Error and Introduction to ML

Please Validate your Hypothesis
How many Samples do You Need to Learn Bias?
Machine Learning: Motivation and Introduction
Naive Bayes
slides

Day 4: Regression and Neural Networks

Logistic Regression
Optimization and Model Selection
Feedforward Neural Networks
Backpropagation
slides

Day 5: Advanced Neural Networks

Convolutional Neural Networks
Coding Session
slides

Resources

Chapter 14 of Great Ideas in Theoretical Computer Science by Anil Ada and Klaus Sutner

Lectures 10-17 of Mathematical Toolkit by Madhur Tulsiani

Lecture 6 of Introduction to the Theory of Machine Learning by Avrim Blum

TTIC Machine Learning Summer School by Suriya Gunasekar, Karl Stratos and Mesrob Ohannessian

Deep Learning Textbook by Ian Goodfellow, Yoshua Bengio and Aaron Courville

Day 1

Additional Exercises

Day 2

MLE Codes

Please find the code we used for MLE for coins here. As a guideline for the homework, we are also uploading the code for estimating bias of a 6-sided dice here.

Toy Example | Stock Market

The goal of this homework is to predict the movement of a dummy stock-market. You will be given $n$ data points: $X=x_1,x_2,…,x_n$ . The data point $x_i$ corresponds to the movement of the stock-market on $i^{th}$ day. Assume that there are $5$ working hours, and $x_i[j]$ corresponds to whether the market went up or down during $j^{th}$ hour on $i^{th}$ day. That is, $x_i[j]=1$ if market went up, and $x_i[j]=0$ if it went down.

Assume that market movements on different days are i.i.d. You may not assume anything about dependence of the market movement during a single day. For example, it may happen that if market went up in first hour, it will surely go down in the last hour irrespective of what happened in the middle three hours.

You can find the data for 10,000 days of market movement here. Your goal is to predict market movement on the last hour for the 10 days of data present here.

We will discuss the solution tomorrow.

Day 3

Toy Example | Stock Market

Naive Bayes Python Code
Naive Bayes Python Notebook
Naive Bayes Python Code View Online
Data

Day 4

logreg_numpy
logreg_numpy python file

logreg_pytorch
logreg_pytorch python file

feedforward
feedforward python file

toy_problem
toy_problem python file

Summer Course on Probability and Machine Learning

TTI-Japan | 2018

Course Plan

Day 1: PyTorch Setup and Introduction to Probability

Day 2: Concentration Inequalities and MLE

Day 3: Generalization Error and Introduction to ML

Day 4: Regression and Neural Networks

Day 5: Advanced Neural Networks

Resources

Day 1

Day 2

MLE Codes

Toy Example | Stock Market

Day 3

Toy Example | Stock Market

Naive Bayes Python Code
Naive Bayes Python Notebook
Naive Bayes Python Code View Online
Data

Day 4

logreg_numpy
logreg_numpy python file

logreg_pytorch
logreg_pytorch python file

feedforward
feedforward python file

toy_problem
toy_problem python file

Day 5

conv_imdb
conv python file

feedforward_imdb
feedforward_imdb python file

dataset

Course Plan

Day 1: PyTorch Setup and Introduction to Probability

Day 2: Concentration Inequalities and MLE

Day 3: Generalization Error and Introduction to ML

Day 4: Regression and Neural Networks

Day 5: Advanced Neural Networks

Resources

Day 1

Day 2

MLE Codes

Toy Example | Stock Market

Day 3

Toy Example | Stock Market Naive Bayes Python Code Naive Bayes Python Notebook Naive Bayes Python Code View Online Data

Day 4

logreg_numpy logreg_numpy python file logreg_pytorch logreg_pytorch python file feedforward feedforward python file toy_problem toy_problem python file

Day 5

conv_imdb conv python file feedforward_imdb feedforward_imdb python file dataset

Toy Example | Stock Market

Naive Bayes Python Code
Naive Bayes Python Notebook
Naive Bayes Python Code View Online
Data

logreg_numpy
logreg_numpy python file

logreg_pytorch
logreg_pytorch python file

feedforward
feedforward python file

toy_problem
toy_problem python file

conv_imdb
conv python file

feedforward_imdb
feedforward_imdb python file

dataset