Machine Learning

Definition

Machine Learning
: A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E

Task T

ML tasks are usually described in terms of how the ML system should process an example.

An example is a collection of features that have been quantitatively measured from some object or event that we want the ML system to process. We typically represent an example as a vector xRn where each entry xi of the vector is an feature.

Most common ML tasks:

Performance Measure P

In order to evaluate the abilities of a machine learning algorithm, we must design a quantitative measure of its performance. Usually this performance measure P is specific to the task T being carried out by the system.

Most common performance measures:

Usually we are interested in how well the machine learning algorithm performs on data that it has not seen before. We therefore evaluate these performance measures using a test set of data that is separate from the data used for training the machine learning system.

Experience E

The experience of most ML algorithms is the dataset, a collection of many examples/data points, and an example is a collection of features.

ML algorithms can be broadly categorized as Unsupervised Learning or Supervised Learning by what kind of experience they are allowed to have during the learning process.

Blurry line between UL and SL

The chain rule of probability states that for a vector xRn, the joint distribution can be decomposed as

p(x)=i=1np(xix1,,xi1)

This decomposition means that we can solve the ostensibly unsupervised problem of modeling p(x) by splitting it into n SL problems.

Alternatively we can solve the SL problem of learning p(y|x) by applying UL to learn the joint distribution p(x,y) and inferring

p(yx)=p(x,y)yp(x,y)

Describing a Dataset

Methods

General Concepts & Techniques

Course Log

2023-01-17

graph TD
B[0. Build Model] --> C[2. Infer hidden variables]
A[1. Data] --Optimization--> C
C --> D[3. Predict & Explore]
A --Supervise--> D
Gaussian Distribution

We want to estimate the distribution of some data.

  • Block 0. We assume it is a multivariate Gaussian distribution
  • Block 1. Sample data
  • Block 2. Maximum Likelihood Estimation
  • Block 3. Predict the properties of the whole dataset

2023-01-19

2023-01-24

2023-01-26

2023-01-31

2023-02-02

2023-02-07

2023-02-09

2023-02-14

2023-02-016

2023-02-21/23

2023-02-28

2023-03-2

2023-03-21

2023-03-23

2023-03-28

2023-03-30

2023-04-04

2023-04-06

2023-04-11

2023-04-13

2023-04-18

2023-04-20

Creative Commons License by zcysxy