AbnerAI

A PhD student at BNU focuses on designing intelligent computing models.

Beijing Normal UniversityBeijing

AbnerAI's Stars

PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Language:Python899119
chenzomi12/Deep-Reinforcement-Learning
《深度强化学习：原理与实践》，Code of the book <Deep Reinforcement Learning: Principles and Practices>
Language:Jupyter Notebook16777
maidacundo/MoE-LoRA
Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.
Language:Python25
Xiang-Li-oss/MoDE-CoTD
Language:Python2
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Language:Python1.8k103
wutaiqiang/MoSLoRA
Language:Python929
GCYZSL/MoLA
Language:Python1238
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
Language:Python8.2k1.1k
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
6.5k362
ezelikman/quiet-star
Code for Quiet-STaR
Language:Python71389
danny-avila/LibreChat
Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.
Language:TypeScript22.1k3.7k
zchuz/CoT-Reasoning-Survey
[ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
41014
HKUNLP/diffusion-of-thoughts
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
Language:Python1095
amazon-science/auto-cot
Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
Language:Jupyter Notebook1.7k155
Alab-NII/chain-of-thought
Research papers about Chain of Thought (CoT)
504
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Language:Jupyter Notebook2.7k132
NUS-HPC-AI-Lab/Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
Language:Python84446
StarLight1212/Generative-models
This project aim to share the knowledge and code concerning generative models, including: GAN, Diffusion, VAE.
Language:Python10627
Fazziekey/Fazziekey
6
titu1994/neural-architecture-search
Basic implementation of [Neural Architecture Search with Reinforcement Learning](https://arxiv.org/abs/1611.01578).
Language:Python433113
EasyJailbreak/EasyJailbreak
An easy-to-use Python framework to generate adversarial jailbreak prompts.
Language:Python55246
yang-song/score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
Language:Jupyter Notebook1.8k330
zhyjSIAT/A-Two-Stage-CycleGAN-VE-BRATS2020
Language:Python61
shuyhere/about-super-alignment
Feeling confused about super alignment? Here is a reading list
421
maitrix-org/llm-reasoners
A library for advanced large language model reasoning
Language:Python1.9k168
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Language:Python1.5k76
JShollaj/awesome-llm-interpretability
A curated list of Large Language Model (LLM) Interpretability resources.
1.2k95
tingofurro/summac
Codebase, data and models for the SummaC paper in TACL
Language:Jupyter Notebook8728
vectara/hallucination-leaderboard
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
Language:Python1.6k60
PKU-YuanGroup/Hallucination-Attack
Attack to induce LLMs within hallucinations
Language:Python14419

AbnerAI

AbnerAI's Stars

PKU-Alignment/omnisafe

chenzomi12/Deep-Reinforcement-Learning

maidacundo/MoE-LoRA

Xiang-Li-oss/MoDE-CoTD

S-LoRA/S-LoRA

wutaiqiang/MoSLoRA

GCYZSL/MoLA

thu-ml/tianshou

hijkzzz/Awesome-LLM-Strawberry

ezelikman/quiet-star

danny-avila/LibreChat

zchuz/CoT-Reasoning-Survey

HKUNLP/diffusion-of-thoughts

amazon-science/auto-cot

Alab-NII/chain-of-thought

FranxYao/chain-of-thought-hub

NUS-HPC-AI-Lab/Neural-Network-Diffusion

StarLight1212/Generative-models

Fazziekey/Fazziekey

titu1994/neural-architecture-search

EasyJailbreak/EasyJailbreak

yang-song/score_sde_pytorch

zhyjSIAT/A-Two-Stage-CycleGAN-VE-BRATS2020

shuyhere/about-super-alignment

maitrix-org/llm-reasoners

dvlab-research/ControlNeXt

JShollaj/awesome-llm-interpretability

tingofurro/summac

vectara/hallucination-leaderboard

PKU-YuanGroup/Hallucination-Attack