AbnerAI
A PhD student at BNU focuses on designing intelligent computing models.
Beijing Normal UniversityBeijing
AbnerAI's Stars
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
chenzomi12/Deep-Reinforcement-Learning
《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>
maidacundo/MoE-LoRA
Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.
Xiang-Li-oss/MoDE-CoTD
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
wutaiqiang/MoSLoRA
GCYZSL/MoLA
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
ezelikman/quiet-star
Code for Quiet-STaR
danny-avila/LibreChat
Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.
zchuz/CoT-Reasoning-Survey
[ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
HKUNLP/diffusion-of-thoughts
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
amazon-science/auto-cot
Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
Alab-NII/chain-of-thought
Research papers about Chain of Thought (CoT)
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
NUS-HPC-AI-Lab/Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
StarLight1212/Generative-models
This project aim to share the knowledge and code concerning generative models, including: GAN, Diffusion, VAE.
Fazziekey/Fazziekey
titu1994/neural-architecture-search
Basic implementation of [Neural Architecture Search with Reinforcement Learning](https://arxiv.org/abs/1611.01578).
EasyJailbreak/EasyJailbreak
An easy-to-use Python framework to generate adversarial jailbreak prompts.
yang-song/score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
zhyjSIAT/A-Two-Stage-CycleGAN-VE-BRATS2020
shuyhere/about-super-alignment
Feeling confused about super alignment? Here is a reading list
maitrix-org/llm-reasoners
A library for advanced large language model reasoning
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
JShollaj/awesome-llm-interpretability
A curated list of Large Language Model (LLM) Interpretability resources.
tingofurro/summac
Codebase, data and models for the SummaC paper in TACL
vectara/hallucination-leaderboard
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
PKU-YuanGroup/Hallucination-Attack
Attack to induce LLMs within hallucinations