junwucs

Xiong Jun Wu @ Ant Group, Beijing

Universe

junwucs's Stars

MMMU-Benchmark/MMMU
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Language:Python35424
deepseek-ai/DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Language:Python2k87
chuanyang-Zheng/Progressive-Hint
This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"
Language:Python20014
idavidrein/gpqa
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
Language:Jupyter Notebook18010
fchollet/ARC-AGI
The Abstraction and Reasoning Corpus
Language:JavaScript3.5k586
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.1k2.2k
mlcommons/modelbench
Run safety benchmarks against AI models and view detailed reports showing how well they performed.
Language:Python6110
pytorch/torchtune
PyTorch native finetuning library
Language:Python4.3k430
meta-llama/PurpleLlama
Set of tools to assess and improve LLM security.
Language:Python2.7k448
Magnetic2014/RoleEval
A Bilingual Role Evaluation Benchmark for Large Language Models
34
SalesforceAIResearch/AgentLite
Language:Jupyter Notebook52360
SalesforceAIResearch/xLAM
Language:Python31125
llmeval/llmeval-3
中文大语言模型评测第三期
241
Nanbeige/Nanbeige
Language:Python859
deepseek-ai/DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Language:Python83051
xingyaoww/mint-bench
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and Heng Ji.
Language:Python1047
thunlp/LEGENT
Open Platform for Embodied Agents
Language:Python26815
thunlp/MatPlotAgent
Language:Python507
KimMeen/Time-LLM
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
Language:Python1.4k252
AutonomousAgentsLab/curiousreplay
Implementations of Curious Replay for model-based adaptation.
361
LiveCodeBench/LiveCodeBench
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
Language:Python20932
huggingface/llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
Language:Python23623
mgramin/awesome-db-tools
Everything that makes working with databases easier
4.2k347
jsbroks/awesome-dataset-tools
🔧 A curated list of awesome dataset tools
854124
awesomedata/awesome-public-datasets
A topic-centric list of HQ open datasets.
60.9k9.9k
jianzhnie/awesome-instruction-datasets
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
53428
Value4AI/Awesome-LLM-in-Social-Science
Awesome papers involving LLMs in Social Science.
28522
google-deepmind/concordia
A library for generative social simulation
Language:Python673161
zhaorw02/FlexiDreamer
An official implementation of FlexiDreamer: Single Image-to-3D Generation with FlexiCubes.
741
OpenBMB/Eurus
Language:Python28315

junwucs

junwucs's Stars

MMMU-Benchmark/MMMU

deepseek-ai/DreamCraft3D

chuanyang-Zheng/Progressive-Hint

idavidrein/gpqa

fchollet/ARC-AGI

meta-llama/llama-recipes

mlcommons/modelbench

pytorch/torchtune

meta-llama/PurpleLlama

Magnetic2014/RoleEval

SalesforceAIResearch/AgentLite

SalesforceAIResearch/xLAM

llmeval/llmeval-3

Nanbeige/Nanbeige

deepseek-ai/DeepSeek-Math

xingyaoww/mint-bench

thunlp/LEGENT

thunlp/MatPlotAgent

KimMeen/Time-LLM

AutonomousAgentsLab/curiousreplay

LiveCodeBench/LiveCodeBench

huggingface/llm-swarm

mgramin/awesome-db-tools

jsbroks/awesome-dataset-tools

awesomedata/awesome-public-datasets

jianzhnie/awesome-instruction-datasets

Value4AI/Awesome-LLM-in-Social-Science

google-deepmind/concordia

zhaorw02/FlexiDreamer

OpenBMB/Eurus