AkariAsai's Stars
openai/openai-cookbook
Examples and guides for using the OpenAI API
meta-llama/llama
Inference code for Llama models
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
pytorch/torchtune
PyTorch native post-training library
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
nmslib/hnswlib
Header-only C++/python library for fast approximate nearest neighbors
allenai/open-instruct
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
gururise/AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
MLGroupJLU/LLM-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
lucidrains/RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
primeqa/primeqa
The prime repository for state-of-the-art Multilingual Question Answering research and development.
amazon-science/RAGChecker
RAGChecker: A Fine-grained Framework For Diagnosing RAG
facebookresearch/atlas
Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)
zeno-ml/zeno-build
Build, evaluate, understand, and fix LLM-based apps
princeton-nlp/ALCE
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
bigscience-workshop/t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
zphang/minimal-llama
aditya-grover/climate-learn
Source code for ClimateLearn
p-lambda/dsir
DSIR large-scale data selection framework for language model training
yizhongw/Tk-Instruct
Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
AlexTMallen/adaptive-retrieval
wellecks/ntptutorial
Tutorial on neural theorem proving
RulinShao/retrieval-scaling
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
cmu-l3/llmlean
LLMs + Lean, on your laptop or in the cloud
wellecks/llmstep
llmstep: [L]LM proofstep suggestions in Lean 4.
code-rag-bench/code-rag-bench
CodeRAG-Bench: Can Retrieval Augment Code Generation?
cmu-l3/ntptutorial-II
Neural theorem proving tutorial, version II
masakhane-io/afriqa
Crosslingual Question Answering for African Languages