nikeshnaik's Stars
xjdr-alt/entropix
Entropy Based Sampling and Parallel CoT Decoding
chengzeyi/stable-fast
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
google-deepmind/penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Leiay/looped_transformer
databricks/megablocks
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
lyeoni/gpt-pytorch
PyTorch Implementation of OpenAI GPT
NUS-HPC-AI-Lab/VideoSys
VideoSys: An easy and efficient system for video generation
YavorGIvanov/sam.cpp
slimtoolkit/slim
Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled languages even more) making it secure too! (free and open source)
dokku/dokku
A docker-powered PaaS that helps you build and manage the lifecycle of applications
bloomberg/memray
Memray is a memory profiler for Python
e2b-dev/awesome-ai-sdks
A database of SDKs, frameworks, libraries, and tools for creating, monitoring, debugging and deploying autonomous AI agents
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
sabetAI/BLoRA
batched loras
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
OpenNMT/CTranslate2
Fast inference engine for Transformer models
huggingface/text-generation-inference
Large Language Model Text Generation Inference
apache/submarine
Submarine is Cloud Native Machine Learning Platform.
zetavg/LLaMA-LoRA-Tuner
UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
guidance-ai/guidance
A guidance language for controlling large language models.
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
common-fate/iamzero
Identity & Access Management simplified and secure.
plasma-umass/scalene
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals