cocoxu
assistant professor at Georgia Tech, research natural language processing, machine learning, and social media.
Georgia Institute of TechnologyUnited States
Pinned Repositories
5525_perceptron_tagger
CS7650_spring2024
CS 7650 (graduate-level NLP class) at Georgia Tech
multip
source code of Multiple-instance Learning Paraphrase (MultiP) Model for Twitter
par4sem
Adaptive Paraphrasing for Semantic Writing Aid tools
SemEval-PIT2015
data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015
Shakespeare
simplification
Text Simplification System and Dataset
tweet_deduplicator
remove duplicate (identical or near-identical tweets); sentence splitter for Twitter data.
twitterparaphrase
paraphrase models using Twitter as data resource
twittersummarization
Twitter Summarization
cocoxu's Repositories
cocoxu/simplification
Text Simplification System and Dataset
cocoxu/SemEval-PIT2015
data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015
cocoxu/multip
source code of Multiple-instance Learning Paraphrase (MultiP) Model for Twitter
cocoxu/CS7650_spring2024
CS 7650 (graduate-level NLP class) at Georgia Tech
cocoxu/par4sem
Adaptive Paraphrasing for Semantic Writing Aid tools
cocoxu/tweet_deduplicator
remove duplicate (identical or near-identical tweets); sentence splitter for Twitter data.
cocoxu/5525_perceptron_tagger
cocoxu/5525_sentiment
cocoxu/acl-pub
Place to collect updated documents needed for ACL publications.
cocoxu/acl17-handbook
ACL 2017 conference handbook
cocoxu/alignment-scripts
Scripts to preprocess training and test data and to run fast_align and giza
cocoxu/aritter.github.io
aritter.github.io
cocoxu/awesome-bert
bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目
cocoxu/cocoxu.github.io
Wei Xu's Homepage
cocoxu/CRF-AE
Code for EMNLP 2018 paper https://arxiv.org/pdf/1808.09075.pdf
cocoxu/CS7650_spring2024_projects
CS 7650 (graduate-level NLP class) at Georgia Tech
cocoxu/Datasets
Datasets for various projects
cocoxu/english-words
:memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
cocoxu/incubator-joshua
Mirror of Apache Joshua (Incubating)
cocoxu/lexi-frontend
Frontend for the Lexi web extension
cocoxu/lexi-server
cocoxu/NeuralTextSimplification
Exploring Neural Text Simplification
cocoxu/OpenNMT-py
Open Source Neural Machine Translation in PyTorch
cocoxu/socialmedia-class.github.io
Social Media and Text Analytics Course at UPenn
cocoxu/Stanceosaurus
cocoxu/SurveyMan
SurveyMan programming language.
cocoxu/ubscrape
ubscrape is an Urban Dictionary scraper for NLP or other large scale analyses.
cocoxu/WIKIBIAS
cocoxu/WLP-Dataset
cocoxu/WLP-Parser
This repository contains a collection of neural network models that we used to demonstrate the utility of our dataset.