study_machine_learning: A Python repository from YoungsoonLee

홍콩 과기대 김성훈 교수님의 수업.
[모두를 위한 머신러닝/딥러닝 강의] 공부 정리

- https://hunkim.github.io/ml/

- Python(v3.5), window 10 64bit 환경에서 코딩.

00. 개요.

machine learning 이란?
- Supervised learning
  ex, Image labeling, Email spam filter, Predicting exam score
  type
  + Predicting final exam score based on time spent - regression
  + Pass/non-pass based on time spent - binary classification
  + Letter grade(A, B, C, E and F) based on time spent - multi-label classification
- Unsupervised learning
  un-labeled data

01. Linear regression

Goal: predicting
Hypothesis and Cost(Loss) function

Goal: Minimize cost

  minimize cost(W, b)  
     W,b   

  Gradient descent algorithm

02. Logistic Classification

Goal: Spam Detection(Spam ot Ham), Facebook feed(show or hide), Credit Crad Fraud Detect(legitimate/fraud), Stock...
Hypothesis

Cost function
Goal: Minimize cost

	Gradient descent algorithm

03. Softmax classification: Multinomial classification -> again

여러개의 class가 있을때 그것을 예측. like grade

Hypothesis

Cost function

Goal: Minimize cost

04. MNIST

...

05. Neural Network

XOR  

How can we learn W, and b from trading data?  -> !!!  
	Back propagarion(chain rule)

06. TensorBoard & Better Deep learning

learning_rate-> affect to cost step
Data Preprocessing -> Standardzation ex, X_std[:,0] = (X[:,0] - X[:,0].mean()) / X[:,0].std()
Online learning -> 

Sigmoid -> ReLU ...  
Weights -> RBM (Restricted Boltzmann Machine, encoder/decorder)  
		-> Xavier initialization  
Overfitting -> More tranining data  
			-> reduce the number of features
			-> Regularization(not have too big numbers in the weight) -> l2reg  
			-> Dropout  
Ensemble

07. CNN (Convolutional Neural Network)

filters  
	Weights(depth), how many focus at once  
	output is one value  

how many numbers can we get? (how many output with filter)  
	Output size:
		( (N - F) / stride ) + 1  

Pad  
	block to small image , know the edge  
	add zero pad the border  
	make same input size and output size  

How many weight variables?  
	ex. 5*5*3*6

Pooling(sampling)  
	why sampling? -> make layer to smaller  
	max pooling

08. RNN (Recurrent Neural Network)

Sequence data  
state

ex.
	language Modeling  
	Speech Recognition  
	Machine Translation  
	Bot  
	image/video captioning  

Algol:
	Long Short Term Memory (LSTM)  
	GRU

09. Reinforcement Learning

Environment  
Actor(Agent)  
Action ->  
		<- state, reward

10. Q-Learning

Q function  
	Q(state, action) -> quality(reward)  
Policy  
	Max Q = maxQ(s, a)  
	π = argmaxQ(s, a)  
How learn Q?
	Q(s, a) <- r + maxQ(s`, a`)  
Exploit VS Exploration  
	E-greedy  
		decaying E-greedy  
	add random noise  
Discounted reward  
	Q(s, a) <- r + γmaxQ(s`, a`)  
Non-deterministic(Stochastic)    
	learning rate  
	α = 0.1
		Q(s, a) <- (1-α)Q(s, a) + α [r + γmaxQ(s`, a`) ]  
		Q(s, a) <- Q(s, a) + α [r + γmaxQ(s`, a`) - Q(s, a)]

11. Q-Network

Q-Table?  
	too big in real world  
Q-function network  
	input: state
	output: all action

12. DQN

Q-Network problems
	1. correlations sample  
	2. non-stationary targets  
Solve  
	1. Go deep  
	2. experience reply -> store result to buffer and then batch randomly  
	3. Separate target network

YoungsoonLee/study_machine_learning

홍콩 과기대 김성훈 교수님의 수업. [모두를 위한 머신러닝/딥러닝 강의] 공부 정리

- https://hunkim.github.io/ml/

- Python(v3.5), window 10 64bit 환경에서 코딩.

00. 개요.

01. Linear regression

02. Logistic Classification

03. Softmax classification: Multinomial classification -> again

04. MNIST

05. Neural Network

06. TensorBoard & Better Deep learning

07. CNN (Convolutional Neural Network)

08. RNN (Recurrent Neural Network)

09. Reinforcement Learning

10. Q-Learning

11. Q-Network

12. DQN

홍콩 과기대 김성훈 교수님의 수업.
[모두를 위한 머신러닝/딥러닝 강의] 공부 정리