dohaengleee's Stars
THUDM/MathGLM
Official Pytorch Implementation for MathGLM
pixas/TAIA_LLM
microsoft/rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
wellecks/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
FreedomIntelligence/ReasoningNLP
paper list on reasoning in NLP
allenai/open-instruct
tengxiaoliu/XoT
EMNLP 2023 Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
InternLM/InternLM-Math
State-of-the-art bilingual open-sourced Math reasoning LLMs.
cyzhh/MMOS
Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math reasoning.
deepseek-ai/DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
lz1oceani/verify_cot
Kipok/NeMo-Skills
A pipeline to improve skills of large language models
princeton-nlp/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
FreedomIntelligence/OVM
Yiwei98/TDG
zirui-HIT/Bridge_for_Numerical_Reasoning
EleutherAI/math-lm
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
zjunlp/EasyInstruct
[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.
Lionelsy/Conference-Accepted-Paper-List
Some Conferences' accepted paper lists (including AI, ML, Robotic)
Yiwei98/ESC
prometheus-eval/prometheus
[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score rubric, Prometheus is a good alternative for human evaluation and GPT-4 evaluation.
ziqiangyuan/FinLLMs
d223302/Over-Reasoning-of-LLMs
Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models
maitrix-org/llm-reasoners
A library for advanced large language model reasoning
XinyuanLu00/SciTab
The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"
yilunzhao/Awsome-Table-Reasoning
A comprehensive paper list of Reasoning over Tables.
google-research-datasets/ToTTo
ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: given a Wikipedia table and a set of highlighted table cells, produce a one-sentence description. We hope it can serve as a useful research benchmark for high-precision conditional text generation.
JiwooKimAR/dmath