huangshenno1

AlibabaChina

huangshenno1's Stars

HITsz-TMG/awesome-llm-attributions
A Survey of Attributions for Large Language Models
1428
felipemaiapolo/tinyBenchmarks
Evaluating LLMs with fewer examples
Language:Jupyter Notebook10610
FullStackRetrieval-com/RetrievalTutorials
Language:Jupyter Notebook49880
dedupeio/dedupe
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Language:Python4k545
songquanpeng/one-api
OpenAI 接口管理 & 分发系统，支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元，可用于二次分发管理 key，仅单可执行文件，已打包好 Docker 镜像，一键部署，开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
Language:JavaScript15.9k3.7k
Alibaba-NLP/CDQA
CDQA: Chinese Dynamic Question Answering Benchmark
Language:Python12
QwenLM/Qwen-Agent
Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Language:Python2.5k244
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Language:Python2k132
microsoft/ToolTalk
Evaluating tool-augmented LLMs in conversation settings
Language:Python6514
open-compass/T-Eval
[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step
Language:Python17410
InternLM/lagent
A lightweight framework for building LLM-based agents
Language:Python98198
fanqiwan/KCA
Knowledge Verification to Nip Hallucination in the Bud
Language:Python18
Tongji-KGLLM/RAG-Survey
1.5k111
AkariAsai/self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
Language:Python1.6k140
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python21.7k3.1k
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
Language:HTML107k14.6k
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
5.7k339
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Python88.2k13.8k
adityatelange/hugo-PaperMod
A fast, clean, responsive Hugo theme.
Language:HTML9.1k2.5k
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python5.8k421
OpenBMB/XAgent
An Autonomous LLM Agent for Complex Task Solving
Language:Python7.8k794
defog-ai/sql-eval
Evaluate the accuracy of LLM generated outputs
Language:Jupyter Notebook44047
chen700564/RGB
Language:Python21417
Alibaba-NLP/SeqGPT
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding
Language:Python19410
Alibaba-NLP/EcomGPT
An Instruction-tuned Large Language Model for E-commerce
Language:Python20213
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python5.7k1.5k
streamlit/streamlit
Streamlit — A faster way to build and share data apps.
Language:Python33k2.9k
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
Language:Python1.5k70
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python12.3k997
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python3.1k331

huangshenno1

huangshenno1's Stars

HITsz-TMG/awesome-llm-attributions

felipemaiapolo/tinyBenchmarks

FullStackRetrieval-com/RetrievalTutorials

dedupeio/dedupe

songquanpeng/one-api

Alibaba-NLP/CDQA

QwenLM/Qwen-Agent

THUDM/AgentBench

microsoft/ToolTalk

open-compass/T-Eval

InternLM/lagent

fanqiwan/KCA

Tongji-KGLLM/RAG-Survey

AkariAsai/self-rag

vllm-project/vllm

f/awesome-chatgpt-prompts

WooooDyy/LLM-Agent-Paper-List

langchain-ai/langchain

adityatelange/hugo-PaperMod

FlagOpen/FlagEmbedding

OpenBMB/XAgent

defog-ai/sql-eval

chen700564/RGB

Alibaba-NLP/SeqGPT

Alibaba-NLP/EcomGPT

EleutherAI/lm-evaluation-harness

streamlit/streamlit

hkust-nlp/ceval

QwenLM/Qwen

open-compass/opencompass