jianguoz

Senior Research Scientist at Salesforce AI Research

Palo Alto, CA

jianguoz's Stars

haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.6k 159 1.5k2.2k
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Language:Python12.6k 71 3.2k1.5k
ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Language:Python11.3k 98 225949
OpenBMB/XAgent
An Autonomous LLM Agent for Complex Task Solving
Language:Python8.1k 75 305827
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.5k 108 133393
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Language:Python4.1k 56 19482
xlang-ai/OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Language:Python3.9k 45 98431
MeetKai/functionary
Chat language model that can use tools and interpret the results
Language:Python1.4k 20 119107
THUDM/AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Language:Python1.3k 16 5295
Xwin-LM/Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
Language:Python1k 37 2041
billxbf/ReWOO
Decoupling Reasoning from Observations for Efficient Augmented Language Models
Language:Python878 23 1671
ruixiangcui/AGIEval
Language:Python691 9 2746
microsoft/CodeT
Language:Python596 16 3276
hkust-nlp/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
Language:Python474 6 2727
SalesforceAIResearch/AgentLite
Language:Jupyter Notebook461 11 1248
allenai/lumos
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
Language:Python442 10 728
nexusflowai/NexusRaven-V2
Language:Jupyter Notebook385 8 932
InternLM/Agent-FLAN
[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
323 4 119
nexusflowai/NexusRaven
NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRaven-13B and baselines.
Language:Python305 9 523
SalesforceAIResearch/xLAM
Language:Python258 9 521
hkust-nlp/AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents
Language:SAS233 4 1224
Ber666/ToolkenGPT
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)
Language:Python231 4 2321
snap-stanford/MLAgentBench
Language:Python227 6 630
copilot-us/chatgpt-plugins
Official ChatGPT Plugins🧩
Language:C#216 7 025
open-compass/T-Eval
[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step
Language:Python214 3 4913
Data-Provenance-Initiative/Data-Provenance-Collection
Language:Jupyter Notebook185 9 040
CASIA-LM/MoDS
Language:Python111 1 511
xingyaoww/mint-bench
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and Heng Ji.
Language:Python102 4 46
gpt4life/alpagasus
Unofficial implementation of AlpaGasus
Language:Python84 3 66
Junjie-Ye/ToolEyes
ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
Language:Python63 4 37

jianguoz

jianguoz's Stars

haotian-liu/LLaVA

BerriAI/litellm

ShishirPatil/gorilla

OpenBMB/XAgent

huggingface/alignment-handbook

yizhongw/self-instruct

xlang-ai/OpenAgents

MeetKai/functionary

THUDM/AgentTuning

Xwin-LM/Xwin-LM

billxbf/ReWOO

ruixiangcui/AGIEval

microsoft/CodeT

hkust-nlp/deita

SalesforceAIResearch/AgentLite

allenai/lumos

nexusflowai/NexusRaven-V2

InternLM/Agent-FLAN

nexusflowai/NexusRaven

SalesforceAIResearch/xLAM

hkust-nlp/AgentBoard

Ber666/ToolkenGPT

snap-stanford/MLAgentBench

copilot-us/chatgpt-plugins

open-compass/T-Eval

Data-Provenance-Initiative/Data-Provenance-Collection

CASIA-LM/MoDS

xingyaoww/mint-bench

gpt4life/alpagasus

Junjie-Ye/ToolEyes