AliArmani3397

Pinned Repositories

-
does what it says on the tin
Language:Python0 0 00
ADAS
Automated Design of Agentic Systems
Language:Python0 0 00
al
0 1 00
Avalon-LLM
This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
Language:Python0 0 00
entropix
Entropy Based Sampling and Parallel CoT Decoding
Language:Python0 0 00
FinAgent
Language:Python0 0 00
LaTRO
Language:Python00
learning-to-plan-for-language-modeling-from-unlabeled-data
Code for https://arxiv.org/pdf/2404.00614
Language:Python0 0 00
marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
Language:Python00
minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user develop their prompts into full models.
Language:Python00

AliArmani3397's Repositories

AliArmani3397/-
does what it says on the tin
Language:Python0 0 00
AliArmani3397/ADAS
Automated Design of Agentic Systems
Language:Python0 0 00
AliArmani3397/al
0 1 00
AliArmani3397/Avalon-LLM
This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
Language:Python0 0 00
AliArmani3397/entropix
Entropy Based Sampling and Parallel CoT Decoding
Language:Python0 0 00
AliArmani3397/FinAgent
Language:Python0 0 00
AliArmani3397/LaTRO
Language:Python00
AliArmani3397/learning-to-plan-for-language-modeling-from-unlabeled-data
Code for https://arxiv.org/pdf/2404.00614
Language:Python0 0 00
AliArmani3397/marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
Language:Python00
AliArmani3397/minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user develop their prompts into full models.
Language:Python00
AliArmani3397/MS
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Language:Python0 0 00
AliArmani3397/muzero_sketch
Language:Python0 0 00
AliArmani3397/OpenQ
The open-source implementation of Q*, achieved in context as a zero-shot reprogramming of the attention mechanism. (synthetic data)
0 0 00
AliArmani3397/rStar
Language:Python0 0 00
AliArmani3397/WebRL