yuzc19

PhD student at CMU

Carnegie Mellon University

Pinned Repositories

gemini-benchmark
Language:Jupyter Notebook149 8 1422
Augmentation-Adapted-Retriever
[ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In".
Language:Python59 4 65
Seq2Seq-Prompt
Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"
Language:Python24 6 04
BioLAMA-1
EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?
Language:Python1 0 00
dataset-recommendation-pub
Language:Python1 0 00
FiD
Fusion-in-Decoder
Language:Python1 0 00
LM-BFF
ACL'2021: LM-BFF: Better Few-shot Fine-tuning of Language Models
Language:Python1 0 00
OOP_QA
OOP QAList
Language:C++1 0 00
pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
Language:Python1 0 00
yuzc19.github.io
Personal homepage for Zichun Yu
Language:HTML2 0 00

yuzc19's Repositories

yuzc19/yuzc19.github.io
Personal homepage for Zichun Yu
Language:HTML2 0 00
yuzc19/BioLAMA-1
EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?
Language:Python1 0 00
yuzc19/dataset-recommendation-pub
Language:Python1 0 00
yuzc19/FiD
Fusion-in-Decoder
Language:Python1 0 00
yuzc19/LM-BFF
ACL'2021: LM-BFF: Better Few-shot Fine-tuning of Language Models
Language:Python1 0 00
yuzc19/OOP_QA
OOP QAList
Language:C++1 0 00
yuzc19/pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
Language:Python1 0 00
yuzc19/Ray-tracing-engine
Language:C1 1 00
yuzc19/SimCSE
EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Language:Python1 0 00
yuzc19/unifiedqa
UnifiedQA: Crossing Format Boundaries With a Single QA System
Language:Python1 0 00
yuzc19/WA-AC
This repository is used to save algorithm learning materials.
Language:C++1 0 0
yuzc19/zcore-tests
Test scripts for zCore OS
Language:Python1 0 0
yuzc19/dclm
DataComp for Language Models
Language:HTML0 0
yuzc19/doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
Language:Python0 0
yuzc19/galactic
data cleaning and curation for unstructured text
Language:Python0 0
yuzc19/lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python0 0
yuzc19/NeMo-Curator
Scalable data pre processing and curation toolkit for LLMs
Language:Jupyter Notebook
yuzc19/SemDeDup
Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).
Language:Python0 0

yuzc19

Pinned Repositories

gemini-benchmark

Augmentation-Adapted-Retriever

Seq2Seq-Prompt

BioLAMA-1

dataset-recommendation-pub

FiD

LM-BFF

OOP_QA

pet

yuzc19.github.io

yuzc19's Repositories

yuzc19/yuzc19.github.io

yuzc19/BioLAMA-1

yuzc19/dataset-recommendation-pub

yuzc19/FiD

yuzc19/LM-BFF

yuzc19/OOP_QA

yuzc19/pet

yuzc19/Ray-tracing-engine

yuzc19/SimCSE

yuzc19/unifiedqa

yuzc19/WA-AC

yuzc19/zcore-tests

yuzc19/dclm

yuzc19/doremi

yuzc19/galactic

yuzc19/lit-gpt

yuzc19/NeMo-Curator

yuzc19/SemDeDup