shiwk20's Stars
salesforce/DialogStudio
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI
ysymyth/ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
open-compass/BotChat
Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
chujiezheng/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
camel-ai/camel
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
InfiMM/Awesome-Multimodal-LLM-for-Math-STEM
Paper collections of multi-modal LLM for Math/STEM/Code.
ZubinGou/math-evaluation-harness
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
CLUEbenchmark/SuperCLUE-Math6
SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅
Vincentqyw/cv-arxiv-daily
🎓Automatically Update CV Papers Daily using Github Actions (Update Every 2days)
QwenLM/Qwen2.5-Math
A series of math-specific large language models of our Qwen2 series.
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
ECNU-ICALK/SocraticMath
[CIKM 2024] Boosting Large Language Models with Socratic Method for Conversational Mathematics Teaching
project-numina/aimo-progress-prize
Khan/tutoring-accuracy-dataset
This repository hosts the paper “LLM Based Math Tutoring: Challenges and Dataset”, along with the accompanying dataset. It explores the performance and challenges of Large Language Models (LLMs) in math tutoring scenarios, providing a benchmark dataset for evaluating LLM accuracy in educational contexts.
gpoesia/mathcamps
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
ytyz1307zzh/RefAug
Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"
mem0ai/mem0
The Memory layer for your AI apps
RunzheYang/SocraticAI
Problem solving by engaging multiple AI agents in conversation with each other and the user.
NTAIX/Chinese-Python-QA-Dataset
An Annotated Question Answering Dataset for Assisting Chinese Python Programming Learners
Kent0n-Li/ChatDoctor
rossant/awesome-math
A curated list of awesome mathematics resources
eth-nlped/mathdial
🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023
ChengpengLi1003/DotaMath
Learnware-LAMDA/Beimingwu
Beimingwu is the first systematic open-source implementation of the learnware dock system, providing a preliminary research platform for learnware studies and enabling effective learnware search and reuse without building machine learning models from scratch.
OpenLMLab/GAOKAO-Bench
GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.
g1y5x3/llm_gaokao
Curious whether LLMs can ace the Chinese college entrance exam 高考
llmeval/Llmeval-Gaokao2024-Math
中文大语言模型评测2024高考数学专题
xianshang33/llm-paper-daily
Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个