zysszy's Stars
mbadolato/iTerm2-Color-Schemes
Over 385 terminal color schemes/themes for iTerm/iTerm2. Includes ports to Terminal, Konsole, PuTTY, Xresources, XRDB, Remmina, Termite, XFCE, Tilda, FreeBSD VT, Terminator, Kitty, MobaXterm, LXTerminal, Microsoft's Windows Terminal, Visual Studio, Alacritty, Ghostty, and many more
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
flairNLP/flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
google-research/text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
coq/coq
Coq is a formal proof management system. It provides a formal language to write mathematical definitions, executable algorithms and theorems together with an environment for semi-interactive development of machine-checked proofs.
dbiir/UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
google-research/electra
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
IBM/Project_CodeNet
This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX
ymcui/Chinese-ELECTRA
Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)
weihua916/powerful-gnns
How Powerful are Graph Neural Networks?
Helsinki-NLP/Tatoeba-Challenge
google-research/bleurt
BLEURT is a metric for Natural Language Generation based on transfer learning.
Vincentzyx/DouZero_For_HLDDZ_FullAuto
将DouZero用于欢乐斗地主自动化
davidhalter/parso
A Python Parser
princeton-vl/CoqGym
A Learning Environment for Theorem Proving with the Coq proof assistant
quincyliang/nlp-public-dataset
Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
liuhuigmail/GrowingBugRepository
A bug repository that keeps growing
linanqiu/reddit-dataset
Dataset of threads and comments from reddit
zysszy/TreeGen
A Tree-Based Transformer Architecture for Code Generation. (AAAI'20)
CoderPat/structured-neural-summarization
A repository with the code for the paper with the same title
pkuzqh/Recoder
RobustNLP/TestTranslation
A toolkit for testing machine translation [ICSE'20, '21, ESEC/FSE'20]
DJjjjhao/FIRA-ICSE
This repository is the replication package of the ICSE22 paper "FIRA: Fine-Grained Graph-Based Code Change Representation for Automated Commit Message Generation"
zysszy/TreeGen-Pytorch
zysszy/CAT
Improving Machine Translation Systems via Isotopic Replacement
pkuzqh/OCoR
zysszy/Preferential-Labeling
Preferential Labeling for Unattributed Node Classification in GNNs
SunflowerPKU/FSE20Dataset
Dataset of FSE20 paper