[Roadmap] Interesting research papers integrate

Opened this issue 6 months ago · 0 comments

lightaime commented 6 months ago

Required prerequisites

I have searched the Issue Tracker and Discussions that this hasn't already been reported. (+1 or comment there if it has.)
Consider asking first in a Discussion.

Motivation

Data generation:

Data curation:

TracIn: https://arxiv.org/abs/2405.17490
CKNN-Shapley: https://arxiv.org/pdf/2405.17489
More: https://github.com/tongyx361/Awesome-LLM4Math

Multi-modality:

Anole: https://x.com/stefan_fee/status/1810695036432232576?s=46
InternLM-XComposer-2.5: https://x.com/wjqdev/status/1808751706970489287?s=46
Mira: https://mira-space.github.io/
More: Efficient Multimodal Large Language Models:A Survey: https://github.com/lijiannuist/Efficient-Multimodal-LLMs-Survey

RAG:

GraphEval: https://x.com/hitesh_lpatel/status/1813335989034635515?s=46
G-Retriever: https://arxiv.org/pdf/2402.07630

Reward model / judge:

MJ-Bench: https://x.com/huaxiuyaoml/status/1810728309182861462?s=46
Synthetic Critiques: https://arxiv.org/pdf/2405.20850

Agent:

EvoAgent: https://arxiv.org/abs/2406.14228
CodeNav: https://x.com/tanmay2099/status/1806020668175376809?s=46
AgentGym: https://arxiv.org/abs/2406.04151
LLM Reasoners: https://x.com/maitrixorg/status/1782801835423850881?s=46
Structured output: https://simmering.dev/blog/structured_output/

Efficiency:

SGLang: https://arxiv.org/abs/2312.07104

Prompt Optimization

DSPy: https://github.com/stanfordnlp/dspy
TextGrad: https://github.com/zou-group/textgrad

Benchmark:

ARC: https://www.kaggle.com/competitions/arc-prize-2024/
StableToolBench: https://github.com/THUNLP-MT/StableToolBench
CodeRAG-Bench: https://x.com/zhiruow/status/1804138112303378487?s=46
Berkeley Function-Calling Leaderboard: https://gorilla.cs.berkeley.edu/leaderboard.html#leaderboard
R2E: Repository to Environment: https://x.com/slimshetty_/status/1787860906439029022?s=46

Solution

No response

Alternatives

No response

Additional context

No response