ZhangShiyue

UNC-CHChapel Hill, NC, US

ZhangShiyue's Stars

tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29k 341 2664k
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python14.2k 263 2012.5k
facebookresearch/ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
Language:Python10.4k 284 1.5k2.1k
facebookresearch/metaseq
Repo for external large-scale work
Language:Python6.4k 109 292719
openai/gpt-2-output-dataset
Dataset of GPT-2 outputs for research in detection, biases, and more
Language:Python1.9k 76 47542
AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
Language:Python1.6k 42 20101
harvardnlp/pytorch-struct
Fast, general, and tested differentiable structured prediction in PyTorch
Language:Jupyter Notebook1.1k 34 5592
hendrycks/test
Measuring Massive Multitask Language Understanding | ICLR 2021
Language:Python1k 20 1979
gregdurrett/berkeley-doc-summarizer
The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploits syntactic information to compress it, and uses coreference constraints to ensure clarity.
Language:Scala742 26 664
giuven95/chatgpt-failures
Failure archive for ChatGPT and similar models
Language:Python583 24 923
Yale-LILY/SummEval
Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper
Language:Python348 9 4141
salesforce/factCC
Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper
Language:Python266 10 1431
Alex-Fabbri/Multi-News
Large-scale multi-document summarization dataset and code
Language:Python263 3 3654
krishnap25/mauve
Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.
Language:Python263 4 1325
addpipe/simple-web-audio-recorder-demo
A simple HTML/JS demo that uses WebAudioRecorder.js to record audio on a web page
Language:JavaScript181 11 12107
yzpang/gold-off-policy-text-gen-iclr21
Language:Python49 3 06
uwnlp/qamr
Question-Answer Meaning Representation
Language:Scala48 8 410
danieldeutsch/repro
Repro is a library for easily running code from published papers via Docker.
Language:Python40 1 106
martiansideofthemoon/longeval-summarization
Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https://arxiv.org/abs/2301.13298).
Language:Python39 1 04
ItzikMalkiel/MTAdam
MTAdam: Automatic Balancing of Multiple Training Loss Terms
Language:Python35 1 26
Yale-LILY/ROSE
Language:Python31 13 11
swarnaHub/SummarizationPrograms
[ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees
Language:Python23 4 21
kleinay/QANom
Language:Python20 5 03
bloomberg/MixCE-acl2023
Implementation of MixCE method described in ACL 2023 paper by Zhang et al.
Language:Python17 8 03
julianmichael/qasrl
Tools for working with QA-SRL data and annotating it with crowdsourcing.
Language:Scala127
OriShapira/LitePyramids
Method for evaluating system summaries manually, via crowdsourcing, using a summarization dataset that includes reference summaries.
Language:Python10 1 00
john-hewitt/truncation-sampling
Codebase describing experiments in Truncation Sampling as Language Model Desmoothing
Language:Jupyter Notebook9 1 03
DanielaBWeiss/QA-ALIGN
QA-ALIGN: Representing Cross-Text Content Overlap by Aligning Question-Answer Propositions
Language:Jupyter Notebook81
ginn-org/ginn
A minimalistic, header only neural net library
Language:C++4 1 321
kleinay/qasrl-crowdsourcing
Language:Scala11

ZhangShiyue

ZhangShiyue's Stars

tatsu-lab/stanford_alpaca

openai/evals

facebookresearch/ParlAI

facebookresearch/metaseq

openai/gpt-2-output-dataset

AetherCortex/Llama-X

harvardnlp/pytorch-struct

hendrycks/test

gregdurrett/berkeley-doc-summarizer

giuven95/chatgpt-failures

Yale-LILY/SummEval

salesforce/factCC

Alex-Fabbri/Multi-News

krishnap25/mauve

addpipe/simple-web-audio-recorder-demo

yzpang/gold-off-policy-text-gen-iclr21

uwnlp/qamr

danieldeutsch/repro

martiansideofthemoon/longeval-summarization

ItzikMalkiel/MTAdam

Yale-LILY/ROSE

swarnaHub/SummarizationPrograms

kleinay/QANom

bloomberg/MixCE-acl2023

julianmichael/qasrl

OriShapira/LitePyramids

john-hewitt/truncation-sampling

DanielaBWeiss/QA-ALIGN

ginn-org/ginn

kleinay/qasrl-crowdsourcing