S1s-Z's Stars
UKPLab/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
THUDM/LongBench
LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
epfml/landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
armancohan/long-summarization
Resources for the NAACL 2018 paper "A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents"
IBM/transition-amr-parser
SoTA Abstract Meaning Representation (AMR) parsing with word-node alignments in Pytorch. Includes checkpoints and other tools such as statistical significance Smatch.
microsoft/FILM
Official repo for "Make Your LLM Fully Utilize the Context"
tianyi-lab/Cherry_LLM
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models
bjascob/amrlib
A python library that makes AMR parsing, generation and visualization simple.
nikhil-ghosh-berkeley/loraplus
git-cloner/llama2-lora-fine-tuning
llama2 finetuning with deepspeed and lora
THUDM/LongAlign
LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation
vihangd/alpaca-qlora
Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA
tianyi-lab/Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
tianyi-lab/Superfiltering
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
IronBeliever/CaR
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
luisfredgs/LSA-Text-Summarization
October2001/ProLong
[ACL 2024] A Prospector of Long-Dependency Data for Large Language Models
OceannTwT/era-cot
[ACL 2024] ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis.
bjascob/amrlib-models
Repository for model used in amrlib
causalNLP/amr_llm
This repo explores how AMR to address tasks difficult for LLMs
MayDomine/Burst-Attention
Distributed IO-aware Attention algorithm
baoguangsheng/gemini
Code base for "GEMINI: Controlling the Sentence-level Writing Style for Abstractive Text Summarization".
nttcslab-nlp/RSTParser_EACL24
Implementation of "Can we obtain significant success in RST discourse parsing by using Large Language Models?" (accepted by EACL 2024)
nightdessert/SkipAlign
AbineshSivakumar/Llama-2-7B-QLoRA-Vicuna
This repository contains code to fine-tune a Llama-7B-Uncensored model using the Vicuna 70k dataset using Quantised Low Rank Adapations (LoRA).
JianGuanTHU/LLMforFV
herrxy/RST-DocMT