Pinned Repositories
2014
Official content for the Fall 2014 Harvard CS109 Data Science course
2014_data
Data directory for the CS109 Data Science course
advanceddb_code
Angular-and-ASPNET-Build2014
Source code from the Build 2014 presentation by David Catuhe and Jon Galloway
angular-lynda
Building a Data-Driven App with AngularJS with Ray Villalobos
AutoBuildServerTesting
dmlc-mxnet
Leetcode
ml_spark
machine learning code in spark
scikit-learn
scikit-learn: machine learning in Python
YanLiang1102's Repositories
YanLiang1102/Leetcode
YanLiang1102/advanceddb_code
YanLiang1102/AutoBuildServerTesting
YanLiang1102/dmlc-mxnet
YanLiang1102/ml_spark
machine learning code in spark
YanLiang1102/Compiler_Project_OU
YanLiang1102/Computer_Graphics_OU
YanLiang1102/course-Neural-Networks-for-Machine-Learning
Neural Networks for Machine Learning (University of Toronto)
YanLiang1102/coursera-natural-language-processing-specialization
Programming assignments from all courses in the Coursera Natural Language Processing Specialization offered by deeplearning.ai.
YanLiang1102/cryptography-code
YanLiang1102/CS224n
CS224n: Natural Language Processing with Deep Learning Assignments Winter, 2017
YanLiang1102/DeepLearning
YanLiang1102/EasyNLP
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
YanLiang1102/EventExtractionPapers
A list of NLP resources focused on event extraction task
YanLiang1102/free-programming-books-zh_CN
免费的计算机编程类中文书籍,欢迎投稿
YanLiang1102/HMEAE
Source code for EMNLP-IJCNLP 2019 paper "HMEAE: Hierarchical Modular Event Argument Extraction".
YanLiang1102/JAVE
YanLiang1102/lihang-code
《统计学习方法》的代码实现
YanLiang1102/machine-learning-classify-handwritten-digit
Classify handwritten digits using machine learning techniques Yan Liang, Yunzhi Wang and Delong Zhao Project scope For our machine learning project, we propose to build several machine learning classifiers that recognize handwritten digits. Handwritten digit recognition is a classic problem in machine learning studies for many years. We plan to do several experiments using different machine learning algorithms and compare the pattern recognition performance. We hope to create a classifier that has same or better categorization accuracy than record performance from previous studies. Yan will focus on neural network, Delong will focus on the random forests methods, and Yunzhi will focus on SVMs and KNNs. We will also develop a final novel classifier that combines the best models from our different experiments. We hypothesize that the final classifier will archive a categorization accuracy of 0.99. This indicates that the classifier correctly classified all the handwritten digits but 1% of the images. The goal of handwritten digit recognition is to determine what digit is from an image of a single handwritten digit. It can be used to test pattern recognition theories and machine learning algorithms. Preprocessed standard handwritten digit image database has been developed to compare different digit recognizers. In our semester project, we will use modified National Institute of Standards and Technology (MNIST) handwritten digit images dataset from kaggle digit recognizer project. The Kaggle MNIST dataset is freely available and collected 28,000 training images and 42,000 test images. Each image is a preprocessed single black and white digit image with 28 x 28 pixels. Each pixel is an integer value range from 0 to 255 which represent the brightness of the pixel, the higher value meaning darker. Each image also has a label which is the correct digit for the handwritten image. For each input handwritten image, our model will output which digit we predict and evaluate with the correct label. We will use 28,000 training images to train our machine learning model and use 42,000 test images to test the performance. Then we will calculate the percentage of the test images that are correctly classified and compare the performance of different machine learning algorithms.
YanLiang1102/MAVEN-dataset
Source code and dataset for EMNLP 2020 paper "MAVEN: A Massive General Domain Event Detection Dataset".
YanLiang1102/n3-collection
N3 - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format
YanLiang1102/OU-ISE-STOCK-PREDICTION
OU-ISE5103-FinalProject-StockPrediction
YanLiang1102/Parallel-Programming-Yan
parallel programming code
YanLiang1102/PEPLER
Personalized Prompt Learning for Explainable Recommendation
YanLiang1102/PrefixTuning
Prefix-Tuning: Optimizing Continuous Prompts for Generation
YanLiang1102/Product_Knowledge_Graph_Tutorial_KDD2021
YanLiang1102/restangular
AngularJS service to handle Rest API Restful Resources properly and easily
YanLiang1102/text-analytics-with-python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
YanLiang1102/xpert
Code for XPERT algorithm from Personalized Retrieval over Millions of Items
YanLiang1102/YanLiang1102.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes