Pinned Repositories
gemini-benchmark
Augmentation-Adapted-Retriever
[ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In".
Seq2Seq-Prompt
Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"
BioLAMA-1
EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?
dataset-recommendation-pub
FiD
Fusion-in-Decoder
LM-BFF
ACL'2021: LM-BFF: Better Few-shot Fine-tuning of Language Models
OOP_QA
OOP QAList
pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
yuzc19.github.io
Personal homepage for Zichun Yu
yuzc19's Repositories
yuzc19/yuzc19.github.io
Personal homepage for Zichun Yu
yuzc19/BioLAMA-1
EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?
yuzc19/dataset-recommendation-pub
yuzc19/FiD
Fusion-in-Decoder
yuzc19/LM-BFF
ACL'2021: LM-BFF: Better Few-shot Fine-tuning of Language Models
yuzc19/OOP_QA
OOP QAList
yuzc19/pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
yuzc19/Ray-tracing-engine
yuzc19/SimCSE
EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
yuzc19/unifiedqa
UnifiedQA: Crossing Format Boundaries With a Single QA System
yuzc19/WA-AC
This repository is used to save algorithm learning materials.
yuzc19/zcore-tests
Test scripts for zCore OS
yuzc19/dclm
DataComp for Language Models
yuzc19/doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
yuzc19/galactic
data cleaning and curation for unstructured text
yuzc19/lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
yuzc19/NeMo-Curator
Scalable data pre processing and curation toolkit for LLMs
yuzc19/SemDeDup
Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).