abliznyuk's Stars
merveenoyan/smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
lllyasviel/Fooocus
Focus on prompting and generating
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
IrinaGoloshchapova/ml_system_design_doc_ru
huggingface/text-generation-inference
Large Language Model Text Generation Inference
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Shiriluz/Word-As-Image
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
openai/transformer-debugger
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
facebookincubator/submitit
Python 3.8+ toolbox for submitting jobs to Slurm
floodsung/Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
HowProgrammingWorks/SelfAssessment
Software engineering self assessment
weaviate/weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
invoke-ai/InvokeAI
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
Zeqiang-Lai/Mini-DALLE3
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
mlzxy/devit
GenImage-Dataset/GenImage
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
georgian-io/LLM-Finetuning-Toolkit
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)