kunato
Typhoon Creator | Lead AI Scientist @ SCB 10X | Ex. CTO & Founder @ KUNANA AI
KUNANA AIBangkok, Thailand
kunato's Stars
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
princeton-nlp/SWE-agent
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
nilsherzig/LLocalSearch
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
mishushakov/llm-scraper
Turn any webpage into structured data using LLMs
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
chenxwh/insanely-fast-whisper
Incredibly fast Whisper-large-v3
McGill-NLP/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
facebookresearch/generative-recommenders
Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
xfactlab/orpo
Official repository for ORPO
ContinualAI/colab
Continual Learning tutorials and demo running on Google Colaboratory.
Ledzy/BAdam
[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models
lm-sys/arena-hard
Arena-Hard benchmark
arcee-ai/PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
npuichigo/openai_trtllm
OpenAI compatible API for TensorRT LLM triton backend
NVIDIA/trt-llm-as-openai-windows
This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows instead of cloud.
asappresearch/simple-tts
Contains the code associated with the ICLR submission for our text-to-speech diffusion model
amirarsalan90/promptrefiner
declare-lab/resta
Restore safety in fine-tuned language models through task arithmetic
AlignInc/aligner-replication
The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction
ex3ndr/supervoice-dataset
60k hours of phoneme-aligned audio from audio books
Aratako/Task-Vector-Merge-Optimzier
MeNicefellow/Mixtral-Expert-Trimmer
argilla-io/distilabel-workbench
A working repository for experimental pipelines in distilabel
josejg/instruction_following_eval
Instruction Following Eval