nuwuxian's Stars
nuwuxian/RL-state_mask
cla7aye15I4nd/CAMP
CAMP: Compiler and Allocator-based Heap Memory Protection (USENIX Security 2024) ✨ Please give a star to https://github.com/cla7aye15I4nd/shadowbound next door! 🌟😊
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Cornell-RL/tril
sherdencooper/GPTFuzz
Official repo for GPTFUZZER : Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
Zehui127/1d-swin
The implementation of 1d-swin, an efficient transformer for capturing hierarchical 1-dimentional long range sequence
opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
google-deepmind/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
astooke/rlpyt
Reinforcement Learning in PyTorch
openai/safety-starter-agents
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
Henrygwb/edge
ManSoSec/Microsoft-Malware-Challenge
Spijkervet/SimCLR
PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations by T. Chen et al.
Henrygwb/UnsupervisedLearing
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)