huangshenno1's Stars
HITsz-TMG/awesome-llm-attributions
A Survey of Attributions for Large Language Models
felipemaiapolo/tinyBenchmarks
Evaluating LLMs with fewer examples
FullStackRetrieval-com/RetrievalTutorials
dedupeio/dedupe
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
songquanpeng/one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
Alibaba-NLP/CDQA
CDQA: Chinese Dynamic Question Answering Benchmark
QwenLM/Qwen-Agent
Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
microsoft/ToolTalk
Evaluating tool-augmented LLMs in conversation settings
open-compass/T-Eval
[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step
InternLM/lagent
A lightweight framework for building LLM-based agents
fanqiwan/KCA
Knowledge Verification to Nip Hallucination in the Bud
Tongji-KGLLM/RAG-Survey
AkariAsai/self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
adityatelange/hugo-PaperMod
A fast, clean, responsive Hugo theme.
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
OpenBMB/XAgent
An Autonomous LLM Agent for Complex Task Solving
defog-ai/sql-eval
Evaluate the accuracy of LLM generated outputs
chen700564/RGB
Alibaba-NLP/SeqGPT
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding
Alibaba-NLP/EcomGPT
An Instruction-tuned Large Language Model for E-commerce
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
streamlit/streamlit
Streamlit — A faster way to build and share data apps.
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.