XLANG NLP Lab

Building language model agents that ground language instructions into code or actions executable in real-world environments

Pinned Repositories

Binder
[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"
Language:Python304 10 1036
DS-1000
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
Language:Python228 8 2127
instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Language:Python1.9k 18 111139
OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Language:Python4k 47 99455
OSWorld
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Language:Python1.5k 31 54164
Spider2
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Language:HTML254 13 2717
Spider2-V
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Language:Jupyter Notebook113 4 17
text2reward
[ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"
Language:Jupyter Notebook136 7 38
UnifiedSKG
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
Language:Python550 12 3958
xlang-paper-reading
Paper collection on building and evaluating language model agents via executable language grounding
342 10 012

XLANG NLP Lab's Repositories

xlang-ai/OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Language:Python4k 47 99455
xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Language:Python1.9k 18 111139
xlang-ai/OSWorld
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Language:Python1.5k 31 54164
xlang-ai/UnifiedSKG
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
Language:Python550 12 3958
xlang-ai/xlang-paper-reading
Paper collection on building and evaluating language model agents via executable language grounding
342 10 012
xlang-ai/Binder
[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"
Language:Python304 10 1036
xlang-ai/Spider2
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Language:HTML254 13 2717
xlang-ai/DS-1000
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
Language:Python228 8 2127
xlang-ai/text2reward
[ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"
Language:Jupyter Notebook136 7 38
xlang-ai/Spider2-V
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Language:Jupyter Notebook113 4 17
xlang-ai/icl-selective-annotation
[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"
Language:Python107 5 216
xlang-ai/aguvis
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
Language:Python993
xlang-ai/batch-prompting
[EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.
Language:Python71 7 26
xlang-ai/BRIGHT
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
Language:Python60 4 83
xlang-ai/EVOR
Language:Python516
xlang-ai/diagrams_toolkit
Source code for diagrams in the paper of NLPers from HKU.
Language:Python5 4 01
xlang-ai/.github
1 3 00
xlang-ai/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

XLANG NLP Lab

Pinned Repositories

Binder

DS-1000

instructor-embedding

OpenAgents

OSWorld

Spider2

Spider2-V

text2reward

UnifiedSKG

xlang-paper-reading

XLANG NLP Lab's Repositories

xlang-ai/OpenAgents

xlang-ai/instructor-embedding

xlang-ai/OSWorld

xlang-ai/UnifiedSKG

xlang-ai/xlang-paper-reading

xlang-ai/Binder

xlang-ai/Spider2

xlang-ai/DS-1000

xlang-ai/text2reward

xlang-ai/Spider2-V

xlang-ai/icl-selective-annotation

xlang-ai/aguvis

xlang-ai/batch-prompting

xlang-ai/BRIGHT

xlang-ai/EVOR

xlang-ai/diagrams_toolkit

xlang-ai/.github

xlang-ai/Pai-Megatron-Patch