ziyan-wang98

KCL

ziyan-wang98's Stars

ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Language:Go104k 605 5.3k8.3k
karpathy/LLM101n
LLM101n: Let's build a Storyteller
30.6k 2.5k 01.7k
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python21.2k 156 2693.1k
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Language:Python17.2k 281 111.7k
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook13.3k 76 3991.3k
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬
Language:Jupyter Notebook8.5k 105 1151.2k
NeoVertex1/SuperPrompt
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
5.6k 76 21529
isaac-sim/IsaacLab
Unified framework for robot learning built on NVIDIA Isaac Sim
Language:Python2.5k 35 8901k
isaac-sim/IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
Language:Python2.1k 37 211439
eloialonso/diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Language:Python1.6k 20 36110
RayeRen/acad-homepage.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
Language:SCSS1.6k 3 373k
PufferAI/PufferLib
Simplifying reinforcement learning for complex game environments
Language:C1.4k 7 1569
google-deepmind/rlax
Language:Python1.3k 34 2688
facebookresearch/nle
The NetHack Learning Environment
Language:C943 30 113113
lafmdp/Awesome-Papers-Autonomous-Agent
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
612 12 455
Cledersonbc/tic-tac-toe-minimax
Minimax is a AI algorithm.
Language:Python434 19 6250
youssefHosni/Awesome-AI-Data-Guided-Projects
A curated list of data science & AI guided projects to start building your portfolio
347 6 081
ParisNeo/ollama_proxy_server
A proxy server for multiple ollama instances with Key security
Language:Python293 7 1045
Farama-Foundation/MAgent2
An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments
Language:C++245 3 2642
WindyLab/LLM-RL-Papers
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
237 3 010
kying18/tic-tac-toe
Tic-tac-toe AI using minimax
Language:Python205 10 2139
geochri/AlphaZero_Chess
PyTorch implementation of AlphaZero Chess from scratch
Language:Python130 4 028
michaelnny/alpha_zero
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
Language:Python89 3 518
WeihaoTan/TWOSOME
Implementation of TWOSOME
Language:Python55 3 126
luchris429/JaxLife
An Open-Ended Agentic Simulator
Language:Python31 2 12
pickxiguapi/Uni-RLHF-Platform
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
Language:Python30 2 01
mgerstgrasser/super
suPER is a collaborative multi-agent RL algorithm
Language:Python11 2 11
ninell-oldenburg/social-contracts
Language:Python10 1 00
serenabooth/reward-design-perils
Language:Jupyter Notebook8 1 01
WeihaoTan/gym-macro-overcooked
Language:Python8 1 01

ziyan-wang98

ziyan-wang98's Stars

ollama/ollama

karpathy/LLM101n

lucidrains/vit-pytorch

openai/swarm

facebookresearch/sam2

SakanaAI/AI-Scientist

NeoVertex1/SuperPrompt

isaac-sim/IsaacLab

isaac-sim/IsaacGymEnvs

eloialonso/diamond

RayeRen/acad-homepage.github.io

PufferAI/PufferLib

google-deepmind/rlax

facebookresearch/nle

lafmdp/Awesome-Papers-Autonomous-Agent

Cledersonbc/tic-tac-toe-minimax

youssefHosni/Awesome-AI-Data-Guided-Projects

ParisNeo/ollama_proxy_server

Farama-Foundation/MAgent2

WindyLab/LLM-RL-Papers

kying18/tic-tac-toe

geochri/AlphaZero_Chess

michaelnny/alpha_zero

WeihaoTan/TWOSOME

luchris429/JaxLife

pickxiguapi/Uni-RLHF-Platform

mgerstgrasser/super

ninell-oldenburg/social-contracts

serenabooth/reward-design-perils

WeihaoTan/gym-macro-overcooked