seungju-k1m

Hello ~

Machine Learning Enigneer at MakinarocksHanyang University

seungju-k1m's Stars

meta-llama/llama
Inference code for Llama models
Language:Python57.2k 528 1.1k9.7k
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python32.9k 318 9504.8k
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python31k 335 5.9k2.9k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21.1k 158 1.6k2.3k
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Language:Python20.7k 136 1.2k1.5k
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
Language:C++10.5k 127 7581.2k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10.2k 97 681990
mistralai/mistral-inference
Official inference library for Mistral models
Language:Jupyter Notebook9.9k 128 148873
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.9k 78 587637
vikhyat/moondream
tiny vision language model
Language:Jupyter Notebook6.8k 60 140540
google/flax
Flax is a neural network library for JAX that is designed for flexibility.
Language:Jupyter Notebook6.3k 86 941659
princeton-nlp/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Language:Python5k 121 59476
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Language:Python4.4k 61 97231
facebookresearch/Pearl
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
Language:Jupyter Notebook2.7k 35 59175
noahshinn/reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Language:Python2.5k 29 37249
EgoAlpha/prompt-in-context-learning
Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.
Language:Jupyter Notebook1.5k 38 294
Farama-Foundation/chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Language:Python1.4k 19 23135
google-deepmind/concordia
A library for generative social simulation
Language:Python752 21 41170
grok-ai/nn-template
Generic template to bootstrap your PyTorch project.
Language:Python639 14 967
kvablack/ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
Language:Python462 6 2642
lqtrung1998/mwp_ReFT
Language:Python434 5 852
nicklashansen/tdmpc2
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
Language:Python429 7 45100
zhuyiche/llava-phi
Language:Python370 26 2439
yingchengyang/Reinforcement-Learning-Papers
Related papers for reinforcement learning, including classic papers and latest papers in top conferences
339 17 025
Farama-Foundation/Minari
A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
Language:Python330 12 6148
mihirp1998/AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language:Python254 6 158
abdulhaim/LMRL-Gym
Language:Python75 5 149
tinker495/jax-baseline
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselines.
Language:Python42 1 04
moripiri/Reinforcement-Learning-on-FrozenLake
Reinforcement Learning Algorithms in FrozenLake-v1
Language:Jupyter Notebook19 2 01
seungju-k1m/CommonRoad
Language:Python6 1 00

seungju-k1m

seungju-k1m's Stars

meta-llama/llama

huggingface/pytorch-image-models

jax-ml/jax

haotian-liu/LLaVA

unslothai/unsloth

google/sentencepiece

salesforce/LAVIS

mistralai/mistral-inference

facebookresearch/xformers

vikhyat/moondream

google/flax

princeton-nlp/tree-of-thought-llm

luosiallen/latent-consistency-model

facebookresearch/Pearl

noahshinn/reflexion

EgoAlpha/prompt-in-context-learning

Farama-Foundation/chatarena

google-deepmind/concordia

grok-ai/nn-template

kvablack/ddpo-pytorch

lqtrung1998/mwp_ReFT

nicklashansen/tdmpc2

zhuyiche/llava-phi

yingchengyang/Reinforcement-Learning-Papers

Farama-Foundation/Minari

mihirp1998/AlignProp

abdulhaim/LMRL-Gym

tinker495/jax-baseline

moripiri/Reinforcement-Learning-on-FrozenLake

seungju-k1m/CommonRoad