louieworth's Stars
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
0xk1h0/ChatGPT_DAN
ChatGPT DAN, Jailbreaks prompt
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
llSourcell/Doctor-Dignity
Doctor Dignity is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.
ahmetbersoz/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
jiji262/tianya-docs
精心收集的天涯神贴,不带水印,方便阅读
epfLLM/meditron
Meditron is a suite of open-source medical Large Language Models (LLMs).
tdurieux/anonymous_github
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
AI-in-Health/MedLLMsPracticalGuide
A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
deepseek-ai/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
andyzoujm/representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
xuhangc/ChatGPT-Academic-Prompt
Use ChatGPT for academic writing
Operations-Research-Science/Ebook-Linear_Programming
ContextualAI/gritlm
Generative Representational Instruction Tuning
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
ebrahimpichka/awesome-optimization
A curated list of mathematical optimization courses, lectures, books, notes, libraries, frameworks and software.
HqWu-HITCS/Awesome-LLM-Survey
An Awesome Collection for LLM Survey
tzxiang/tips-for-scientific-research
A collection of tips for scientific research
ZhengYinan-AIR/FISOR
[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
foundation29org/dxgpt_testing
RecursionSheep/OptimizationHW
The homework assignments of the course Introduction to Optimization Theory