Pinned Repositories
sharedtask2021
Repository for DISRPT2021 shared task
amalgum
English web corpus with 4M tokens and several annotation types
lab
corpling@GU lab website
acl-anthology
Data and software for building the ACL Anthology.
allennlp
An open-source NLP research library, built on PyTorch.
crossGENRE4RST
Supplementary Materials for EACL2023: Why Can’t Discourse Parsing Generalize? A Thorough Investigation of the Impact of Data Diversity
GUMSum4EVAL
Repository for ACL2023 (Findings): GUMSum: Multi-Genre Data and Evaluation for English Abstractive Summarization
rst2dep
Converter for RST Trees (.rs3/4) and RST Dependency (.rsd) - Signal Handling (adapted)
rstWeb
Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory
sharedtask2019
Repository for DISRPT2019 shared task
janetlauyeung's Repositories
janetlauyeung/crossGENRE4RST
Supplementary Materials for EACL2023: Why Can’t Discourse Parsing Generalize? A Thorough Investigation of the Impact of Data Diversity
janetlauyeung/GUMSum4EVAL
Repository for ACL2023 (Findings): GUMSum: Multi-Genre Data and Evaluation for English Abstractive Summarization
janetlauyeung/acl-anthology
Data and software for building the ACL Anthology.
janetlauyeung/allennlp
An open-source NLP research library, built on PyTorch.
janetlauyeung/rst2dep
Converter for RST Trees (.rs3/4) and RST Dependency (.rsd) - Signal Handling (adapted)
janetlauyeung/rstWeb
Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory
janetlauyeung/sharedtask2019
Repository for DISRPT2019 shared task
janetlauyeung/bert
TensorFlow code and pre-trained models for BERT
janetlauyeung/broadway-data-analysis
janetlauyeung/Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
janetlauyeung/gum
Repository for the Georgetown University Multilayer Corpus (GUM)
janetlauyeung/handson-ml
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.
janetlauyeung/handson-ml2
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
janetlauyeung/jieba
结巴中文分词
janetlauyeung/lazynlp
Library to scrape and clean web pages to create massive datasets.
janetlauyeung/learning-area
Github repo for the MDN Learning Area.
janetlauyeung/ling504-discpar
janetlauyeung/nlp-in-ling
Natural Language Processing Research in North American Linguistics Departments
janetlauyeung/NLPErrors4RST
Code and data repository for SIGDIAL 2023: What’s Hard in English RST Parsing? Predictive Models for Error Analysis
janetlauyeung/nltk
NLTK Source
janetlauyeung/python-is-cool
Cool Python features for machine learning that I used to be too afraid to use
janetlauyeung/PyTorchNLPBook
Code and data accompanying Natural Language Processing with PyTorch published by O'Reilly Media https://nlproc.info
janetlauyeung/rst-converter-service
REST API to convert between different Rhetorical Structure Theory file formats
janetlauyeung/rst-coref
RST discourse parsing with coreference information.
janetlauyeung/Shallow-Discourse-Annotation-for-Chinese-TED-Talks
Datasets for "Shallow Discourse Annotation for Chinese TED Talks" Accepted by LREC 2020
janetlauyeung/streusle
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)
janetlauyeung/THULAC-Python
An Efficient Lexical Analyzer for Chinese
janetlauyeung/udapi-python
Python framework for processing Universal Dependencies data