dkkim93

Staff Research Scientist @ Field AI

LG AI Research-Ann Arbor

dkkim93's Stars

memesoo99/WebAgent
Computer Vision II Final Project
Language:Jupyter Notebook1
dkkim93/meta-mapg
Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)
Language:Python314
abdulhaim/moral_foundations_llms
Language:Jupyter Notebook82
christophschuhmann/improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
Language:Python87987
microsoft/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
Language:Python3.6k273
stanfordnlp/wge
Workflow-Guided Exploration: sample-efficient RL agent for web tasks
Language:Python10933
Farama-Foundation/miniwob-plusplus
MiniWoB++: a web interaction benchmark for reinforcement learning
Language:HTML28047
baptisteArno/typebot.io
💬 Typebot is a powerful chatbot builder that you can self-host.
Language:TypeScript7.2k2k
dkkim93/further
Source code for "Influencing Long-Term Behavior in Multiagent Reinforcement Learning" (NeurIPS 2022)
Language:Python185
wookayin/gpustat
📊 A simple command-line utility for querying and monitoring GPU status
Language:Python4k281
meta-llama/llama
Inference code for Llama models
Language:Python55.8k9.5k
wcarvalho/oo-model
Language:Python3
cradol/cradol
Source code for "Context-Specific Representation Abstraction for Deep Option Learning"
Language:Python101
3b1b/manim
Animation engine for explanatory math videos
Language:Python62.9k5.8k
aletcher/stable-opponent-shaping
Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
Language:Jupyter Notebook212
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Language:Jupyter Notebook7.9k794
tdurieux/anonymous_github
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
Language:TypeScript1.4k55
Farama-Foundation/SuperSuit
A collection of wrappers for Gymnasium and PettingZoo environments (being merged into gymnasium.wrappers and pettingzoo.wrappers
Language:Python45157
Farama-Foundation/PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
Language:Python2.6k408
marcharper/python-ternary
:small_red_triangle: Ternary plotting library for python with matplotlib
Language:Python726156
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python1.7k343
Farama-Foundation/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Language:Python1.2k269
xinleipan/gym-gridworld
Simple grid-world environment compatible with OpenAI-gym
Language:Python4922
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
Language:Jupyter Notebook3.3k554
mit-satori/mit-satori.github.io
Language:HTML51
mattriemer/MER
Fork of the GEM project (https://github.com/facebookresearch/GradientEpisodicMemory) including Meta-Experience Replay (MER) methods from the ICLR 2019 paper (https://openreview.net/pdf?id=B1gTShAct7)
Language:Python14333
posquit0/Awesome-CV
:page_facing_up: Awesome CV is LaTeX template for your outstanding job application
Language:TeX22.9k4.8k
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python3.7k665
jonbarron/website
Language:HTML2.5k2k
sudharsan13296/Hands-On-Meta-Learning-With-Python
Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow
Language:Jupyter Notebook1.2k361

dkkim93

dkkim93's Stars

memesoo99/WebAgent

dkkim93/meta-mapg

abdulhaim/moral_foundations_llms

christophschuhmann/improved-aesthetic-predictor

microsoft/LMOps

stanfordnlp/wge

Farama-Foundation/miniwob-plusplus

baptisteArno/typebot.io

dkkim93/further

wookayin/gpustat

meta-llama/llama

wcarvalho/oo-model

cradol/cradol

3b1b/manim

aletcher/stable-opponent-shaping

google-deepmind/mujoco

tdurieux/anonymous_github

Farama-Foundation/SuperSuit

Farama-Foundation/PettingZoo

marcharper/python-ternary

nikhilbarhate99/PPO-PyTorch

Farama-Foundation/Metaworld

xinleipan/gym-gridworld

higgsfield-ai/higgsfield

mit-satori/mit-satori.github.io

mattriemer/MER

posquit0/Awesome-CV

google-deepmind/dm_control

jonbarron/website

sudharsan13296/Hands-On-Meta-Learning-With-Python