dkkim93's Stars
memesoo99/WebAgent
Computer Vision II Final Project
dkkim93/meta-mapg
Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)
abdulhaim/moral_foundations_llms
christophschuhmann/improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
microsoft/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
stanfordnlp/wge
Workflow-Guided Exploration: sample-efficient RL agent for web tasks
Farama-Foundation/miniwob-plusplus
MiniWoB++: a web interaction benchmark for reinforcement learning
baptisteArno/typebot.io
💬 Typebot is a powerful chatbot builder that you can self-host.
dkkim93/further
Source code for "Influencing Long-Term Behavior in Multiagent Reinforcement Learning" (NeurIPS 2022)
wookayin/gpustat
📊 A simple command-line utility for querying and monitoring GPU status
meta-llama/llama
Inference code for Llama models
wcarvalho/oo-model
cradol/cradol
Source code for "Context-Specific Representation Abstraction for Deep Option Learning"
3b1b/manim
Animation engine for explanatory math videos
aletcher/stable-opponent-shaping
Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
tdurieux/anonymous_github
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
Farama-Foundation/SuperSuit
A collection of wrappers for Gymnasium and PettingZoo environments (being merged into gymnasium.wrappers and pettingzoo.wrappers
Farama-Foundation/PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
marcharper/python-ternary
:small_red_triangle: Ternary plotting library for python with matplotlib
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Farama-Foundation/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
xinleipan/gym-gridworld
Simple grid-world environment compatible with OpenAI-gym
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
mit-satori/mit-satori.github.io
mattriemer/MER
Fork of the GEM project (https://github.com/facebookresearch/GradientEpisodicMemory) including Meta-Experience Replay (MER) methods from the ICLR 2019 paper (https://openreview.net/pdf?id=B1gTShAct7)
posquit0/Awesome-CV
:page_facing_up: Awesome CV is LaTeX template for your outstanding job application
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
jonbarron/website
sudharsan13296/Hands-On-Meta-Learning-With-Python
Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow