Pinned Repositories
-
does what it says on the tin
ADAS
Automated Design of Agentic Systems
al
Avalon-LLM
This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
entropix
Entropy Based Sampling and Parallel CoT Decoding
FinAgent
LaTRO
learning-to-plan-for-language-modeling-from-unlabeled-data
Code for https://arxiv.org/pdf/2404.00614
marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user develop their prompts into full models.
AliArmani3397's Repositories
AliArmani3397/-
does what it says on the tin
AliArmani3397/ADAS
Automated Design of Agentic Systems
AliArmani3397/al
AliArmani3397/Avalon-LLM
This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
AliArmani3397/entropix
Entropy Based Sampling and Parallel CoT Decoding
AliArmani3397/FinAgent
AliArmani3397/LaTRO
AliArmani3397/learning-to-plan-for-language-modeling-from-unlabeled-data
Code for https://arxiv.org/pdf/2404.00614
AliArmani3397/marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
AliArmani3397/minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user develop their prompts into full models.
AliArmani3397/MS
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
AliArmani3397/muzero_sketch
AliArmani3397/OpenQ
The open-source implementation of Q*, achieved in context as a zero-shot reprogramming of the attention mechanism. (synthetic data)
AliArmani3397/rStar
AliArmani3397/WebRL