Ayaan-Sharif's Stars
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
anthropics/courses
Anthropic's educational courses
lucidrains/slot-attention
Implementation of Slot Attention from GoogleAI
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
poloclub/transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
exo-explore/exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
facebookresearch/pytorch3d
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
soulpage/fullstack-assignment
Nexxtjs and django repo for assignments
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
StanislasBertrand/RetinaFace-tf2
RetinaFace (RetinaFace: Single-stage Dense Face Localisation in the Wild, published in 2019) reimplemented in Tensorflow 2.0, with pretrained weights available !
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
davidsandberg/facenet
Face recognition using Tensorflow
lucidrains/video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
NVlabs/DiffiT
[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation
black-forest-labs/flux
Official inference repo for FLUX.1 models
stas00/ml-engineering
Machine Learning Engineering Open Book
filipstrand/mflux
A MLX port of FLUX based on the Huggingface Diffusers implementation.
ml-explore/mlx
MLX: An array framework for Apple silicon
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
timesler/facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
InternLM/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
xiaoachen98/Open-LLaVA-NeXT
An open-source implementation for training LLaVA-NeXT.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
jbhuang0604/awesome-computer-vision
A curated list of awesome computer vision resources
karpathy/LLM101n
LLM101n: Let's build a Storyteller