ZhangShiyue's Stars
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
facebookresearch/ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
facebookresearch/metaseq
Repo for external large-scale work
openai/gpt-2-output-dataset
Dataset of GPT-2 outputs for research in detection, biases, and more
AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
harvardnlp/pytorch-struct
Fast, general, and tested differentiable structured prediction in PyTorch
hendrycks/test
Measuring Massive Multitask Language Understanding | ICLR 2021
gregdurrett/berkeley-doc-summarizer
The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploits syntactic information to compress it, and uses coreference constraints to ensure clarity.
giuven95/chatgpt-failures
Failure archive for ChatGPT and similar models
Yale-LILY/SummEval
Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper
salesforce/factCC
Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper
Alex-Fabbri/Multi-News
Large-scale multi-document summarization dataset and code
krishnap25/mauve
Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.
addpipe/simple-web-audio-recorder-demo
A simple HTML/JS demo that uses WebAudioRecorder.js to record audio on a web page
yzpang/gold-off-policy-text-gen-iclr21
uwnlp/qamr
Question-Answer Meaning Representation
danieldeutsch/repro
Repro is a library for easily running code from published papers via Docker.
martiansideofthemoon/longeval-summarization
Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https://arxiv.org/abs/2301.13298).
ItzikMalkiel/MTAdam
MTAdam: Automatic Balancing of Multiple Training Loss Terms
Yale-LILY/ROSE
swarnaHub/SummarizationPrograms
[ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees
kleinay/QANom
bloomberg/MixCE-acl2023
Implementation of MixCE method described in ACL 2023 paper by Zhang et al.
julianmichael/qasrl
Tools for working with QA-SRL data and annotating it with crowdsourcing.
OriShapira/LitePyramids
Method for evaluating system summaries manually, via crowdsourcing, using a summarization dataset that includes reference summaries.
john-hewitt/truncation-sampling
Codebase describing experiments in Truncation Sampling as Language Model Desmoothing
DanielaBWeiss/QA-ALIGN
QA-ALIGN: Representing Cross-Text Content Overlap by Aligning Question-Answer Propositions
ginn-org/ginn
A minimalistic, header only neural net library
kleinay/qasrl-crowdsourcing