Pinned Repositories
AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents
BOLAA
benchmarking and orchestrating LLM-augmented Agents
webarena
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
BOLAA
DialogStudio
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI
SRMA
Contrastive Learning with Model Augmentation
AgentLite
JimSalesforce's Repositories
JimSalesforce/BOLAA
benchmarking and orchestrating LLM-augmented Agents
JimSalesforce/webarena
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"