yun-kwak

Seoul National UniversitySouth Korea

yun-kwak's Stars

unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Language:Python20k1.4k
AIDC-AI/Marco-o1
An Open Large Reasoning Model for Real-World Solutions
Language:Python1.3k68
gicheonkang/clip-rt
📎 + 🦾 CLIP-RT: Learning Language-Conditioned Robotic Policies from Natural Language Supervision
Language:Python91
lazaratan/dyn-gfn
DynGFN: Bayesian Dynamic Causal Discovery using Generative Flow Networks
Language:Python5113
GFNOrg/gfn-lm-tuning
Language:Jupyter Notebook16922
recursionpharma/gflownet
GFlowNet library specialized for graph & molecular data
Language:Python22543
facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Language:Python4.4k226
xjdr-alt/entropix
Entropy Based Sampling and Parallel CoT Decoding
Language:Python3.2k316
YuxiXie/MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
Language:Jupyter Notebook25827
XinyuanWangCS/PromptAgent
This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgent is a novel automatic prompt optimization method that autonomously crafts prompts equivalent in quality to those handcrafted by experts, i.e., expert-level prompts.
Language:Python22929
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python19.1k1.4k
meta-llama/llama-models
Utilities intended for use with Llama models.
Language:Python5.5k912
NVIDIA/warp
A Python framework for high performance GPU simulation and graphics
Language:Python4.4k252
lichess-org/mobile
Lichess mobile app v2
Language:Dart1.4k211
jopetty/word-problem
Experiments on the impact of depth in transformers and SSMs.
Language:Python213
yun-kwak/efficient-mcts
[UAI'24 Oral] Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction
Language:Python4
iwhwang/NCD
On Discovery of Local Independence over Continuous Variables via Neural Contextual Decomposition (CLeaR 2023)
3
iwhwang/Fine-Grained-Causal-RL
Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning (ICML 2024)
Language:Python111
abdulhaim/LMRL-Gym
Language:Python759
maitrix-org/llm-reasoners
A library for advanced large language model reasoning
Language:Python1.6k139
rtqichen/ffjord
code for "FFJORD: Free-form Continuous Dynamics for Scalable Reversible Generative Models".
Language:Python632141
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda24.9k2.8k
espanso/espanso
Cross-platform Text Expander written in Rust
Language:Rust10.3k282
state-spaces/s4
Structured state space sequence models
Language:Jupyter Notebook2.5k301
state-spaces/mamba
Mamba SSM architecture
Language:Python13.7k1.2k
openai/transformer-debugger
Language:Python4.1k241
gicheonkang/prograsp
🦾 PyTorch Implementation for the ICRA'24 Paper, "PROGrasp: Pragmatic Human-Robot Communication for Object Grasping"
Language:Python61
google-deepmind/mujoco_menagerie
A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.
Language:Python1.6k232
sotetsuk/pgx
♟️ Vectorized RL game environments in JAX
Language:Python42730
open-spaced-repetition/srs-benchmark
A benchmark for spaced repetition schedulers/algorithms
Language:Jupyter Notebook7010

yun-kwak

yun-kwak's Stars

unslothai/unsloth

AIDC-AI/Marco-o1

gicheonkang/clip-rt

lazaratan/dyn-gfn

GFNOrg/gfn-lm-tuning

recursionpharma/gflownet

facebookresearch/lingua

xjdr-alt/entropix

YuxiXie/MCTS-DPO

XinyuanWangCS/PromptAgent

black-forest-labs/flux

meta-llama/llama-models

NVIDIA/warp

lichess-org/mobile

jopetty/word-problem

yun-kwak/efficient-mcts

iwhwang/NCD

iwhwang/Fine-Grained-Causal-RL

abdulhaim/LMRL-Gym

maitrix-org/llm-reasoners

rtqichen/ffjord

karpathy/llm.c

espanso/espanso

state-spaces/s4

state-spaces/mamba

openai/transformer-debugger

gicheonkang/prograsp

google-deepmind/mujoco_menagerie

sotetsuk/pgx

open-spaced-repetition/srs-benchmark