- This repo is to maintain the work of the data analysis club.
- Programming activities in weekly seminar
- Nov. 5, 2019 ~ Mar. 11, 2021
- Practice Numpy, Pandas and Matplotlib | Presentation
[1] Wes McKinney. (2012). Python for Data Analysis. O'Reilly Media, Inc.
- Build in Data structure functions and files
- Arrays and Vectorized Computations
- Getting started with Pandas
- Data Loading Storage and File Formats
- Data Cleaning and Preparation
- Data Wrangling Join Combine and Reshape
- Plotting and Visualization
- Data Aggregation and Group Operations
- Time Series
- Advanced pandas
- Practice Plot, Treemap, Bubble Chart and Mosaic Plot
- Practice Simple, Multiple Regression and Logistic Regression | Presentation
- Web Crawler for downloading images
- English Text mining with 'tm' | Presentation
- Visualization with 'wordcloud'
- KNN (K-Nearest_Neighbor)
- Linear regression
- Ridge, Lasso
- Elastic net
- Logistic regression
- Naive Bayes Classifier
- Decision tree
- Random Forest
- Gradient Boosting
- Ada Boost Algorithm
- SVM (Support Vector Machine) | Presentation
- MLP (Multi-layer Perceptron) | Presentation
- PCA
- K-Means
- Agglomerative Clustering
- DBSCAN
- Practice MATLAB basic
- Gradient Descent with 1 weight and 2 weight (adjust the learning rate)
- Simple Linear Regression with women height&weight dataset
- Multiple Linear Regression
- Optimal Valuable Selection with AIC and K-fold Cross Validation
- LASSO Regression
- Decision Tree
- KNN Algorithm
- Correlation analysis
- Clustering Analysis with iris
- Hierarchical clustering and PCA
-
Titanic Tutorial | Presentation
Exploratory data analysis, visualization, machine_learning
Cross Validation, Confusion Matrix, Hyperparameter-Tuning, Ensembling (Voting, Bagging, Boosting), Feature Importance -
Household Poverty Level Prediction | Presentation
Feature Engineering, Machine Learning, Model Selection, Feature Selection, Gradient Boosting -
Tensorflow Speech Recognition Challenge | Presentation
Speech representation and data exploration, Light-Weight CNN, 1D Inception approach
[1] PyTorch로 시작하는 딥 러닝 입문, https://wikidocs.net/book/2788
- PyTorch Basic
- Linear Regression | Presentation
[1] OpenCV 4로 배우는 컴퓨터 비전과 머신 러닝, https://thebook.io/006939/
[2] sunkyoo/opencv4cvml, https://github.com/sunkyoo/opencv4cvml/tree/main/python
- Opencv Basic Classes | Presentation
- Edge detection, Hough transform, Color inverse | Presentation
[1] 딥러닝을 이용한 자연어 처리, https://wikidocs.net/book/2155
- Pandas Profiling
- Tokenization, Cleaning and Normalization, Stemming and Lemmatization, Stopword, Regular Expression