WilliamZR
Master student at Wangxuan Institute of Computer Technology, Peking University
Peking UniversityBeijing
WilliamZR's Stars
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
stitionai/devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
OpenCodeInterpreter/OpenCodeInterpreter
OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. It significantly enhances code generation capabilities by integrating execution and iterative refinement functionalities.
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
GraphPKU/PiSSA
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)
zorazrw/awesome-tool-llm
THUNLP-MT/StableToolBench
A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
xlang-ai/Spider2-V
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
xlang-ai/Spider2
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Yifan-Song793/ETO
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
Edward-Sun/easy-to-hard
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
TIGER-AI-Lab/StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
xCompass-AI/GeneCompass
GeneCompass
xlang-ai/BRIGHT
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
HKUNLP/RSA
Retrieved Sequence Augmentation for Protein Representation Learning
qtli/GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
google/spiqa
Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers"
google-research/chain-of-table
Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
xxxiaol/QRData
Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data
ZhenweiAn/Dynamic_MoE
Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"
yale-nlp/FinanceMath
Data and Code for the paper "FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains"
CambridgeNLIP/verification-real-world-info-needs
lfy79001/Awesome-Table-QA
A comprehensive paper list of Table-based Question Answering.
luciusssss/ZhuangBench
[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly
KunhangL/finemotiondiffuse
Motion Generation from Fine-grained Textual Descriptions (LREC-COLING 2024)
Y-Sui/Table-meets-LLM
GPT4Table is a useful benchmark for detecting table structural understanding capabilities.
jtonglet/SEER
Code implementation for the EMNLP 2023 paper "SEER A Knapsack approach to Exemplar Selection for In-Context HybridQA"
ykzhang721/TimeArena