SihengLi99's Stars
meta-llama/codellama
Inference code for CodeLlama models
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
modelscope/modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
openai/weak-to-strong
atfortes/Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓
maitrix-org/llm-reasoners
A library for advanced large language model reasoning
stanford-oval/WikiChat
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
lupantech/chameleon-llm
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
ysymyth/awesome-language-agents
List of language agents based on paper "Cognitive Architectures for Language Agents"
salesforce/xgen
Salesforce open-source LLMs with 8k sequence length.
jzbjyb/FLARE
Forward-Looking Active REtrieval-augmented generation (FLARE)
princeton-nlp/ALCE
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
voidism/DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
OpenBMB/UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
shmsw25/FActScore
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"
allenai/FineGrainedRLHF
felipemaiapolo/tinyBenchmarks
Evaluating LLMs with fewer examples
architsharma97/dpo-rlaif
SihengLi99/LLM-Honesty-Survey
A Survey on the Honesty of Large Language Models
marzenakrp/nocha
activatedgeek/calibration-tuning
facebookresearch/llm-cross-capabilities
Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"
alphadl/OOP-eval
The first Object-Oriented Programming (OOP) Evaluaion Benchmark for LLMs
TianHongZXY/qaap
[EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions
SihengLi99/NewsDialogues
[2023-ACL]: NewsDialogues: Towards Proactive News Grounded Conversation