XLANG Lab
Developing embodied AI agents that empower users to use language to interact with digital and physical environments to carry out real-world tasks.
Pinned Repositories
aguvis
[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
DS-1000
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
OpenCUA
OpenCUA: Open Foundations for Computer-Use Agents
OSWorld
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Spider2
[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
text2reward
[ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"
UnifiedSKG
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
xlang-paper-reading
Paper collection on building and evaluating language model agents via executable language grounding
XLANG Lab's Repositories
xlang-ai/OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
xlang-ai/OSWorld
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
xlang-ai/Spider2
[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
xlang-ai/UnifiedSKG
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
xlang-ai/OpenCUA
OpenCUA: Open Foundations for Computer-Use Agents
xlang-ai/xlang-paper-reading
Paper collection on building and evaluating language model agents via executable language grounding
xlang-ai/aguvis
[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
xlang-ai/Binder
[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"
xlang-ai/DS-1000
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
xlang-ai/text2reward
[ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"
xlang-ai/BRIGHT
[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
xlang-ai/Spider2-V
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
xlang-ai/icl-selective-annotation
[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"
xlang-ai/OSWorld-G
[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis
xlang-ai/batch-prompting
[EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.
xlang-ai/EVOR
xlang-ai/computer-agent-arena
Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!
xlang-ai/AgentTrek
[ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials
xlang-ai/AgentNetTool
This is the official code base of AgentNetTool in OpenCUA. Website: https://opencua.xlang.ai/
xlang-ai/diagrams_toolkit
Source code for diagrams in the paper of NLPers from HKU.
xlang-ai/xlang-ai.github.io
The official website of xlang.ai
xlang-ai/.github
xlang-ai/verl
veRL: Volcano Engine Reinforcement Learning for LLM
xlang-ai/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.