equus71

Wrocław, Poland

equus71's Stars

MC-E/ReVideo
Language:Python2495
reflex-dev/reflex
🕸️ Web apps in pure Python 🐍
Language:Python17.9k993
FoundationVision/Groma
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
Language:Python47255
ZhengPeng7/BiRefNet
[arXiv'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
Language:Python27425
dora-rs/dora
DORA (Dataflow-Oriented Robotic Application) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
Language:Rust1.3k67
KellerJordan/cifar10-airbench
94% on CIFAR-10 in 3.29 seconds 💨 96% in 35 seconds
Language:Python1135
Vision-CAIR/MiniGPT4-video
Official code for MiniGPT4-video
Language:Python44046
InstantStyle/InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
Language:Jupyter Notebook1.4k85
yardenfren1996/B-LoRA
Implicit Style-Content Separation using B-LoRA
Language:Jupyter Notebook21910
snap-research/MyVLM
Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries"
Language:Python1336
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Language:Python4k480
bfshi/scaling_on_scales
When do we not need larger vision models?
Language:Python2537
fuxiao0719/GeoWizard
[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
Language:Python53521
Haiyang-W/GiT
🔥 [ECCV2024] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
Language:Python23611
OpenGVLab/VideoMamba
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
Language:Python67349
PaddlePaddle/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Language:Python12.4k2.8k
deepseek-ai/DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Language:Python1.9k174
VAST-AI-Research/TripoSR
Language:Python4k467
DLYuanGod/TinyGPT-V
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Language:Python1.2k71
NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
Language:Python78538
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python8.7k789
whlzy/FiT
[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model
3407
Vchitect/SEINE
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Language:Python84558
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python10.4k755
TencentARC/PhotoMaker
PhotoMaker
Language:Jupyter Notebook8.6k675
apple/ml-aim
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
Language:Python66341
AIGCDesignGroup/ReplaceAnything
2.3k95
Jiawei-Yang/Denoising-ViT
This is the official code release for our work, Denoising Vision Transformers.
Language:Python2137
yancie-yjr/StreamYOLO
Real-time Object Detection for Streaming Perception, CVPR 2022
Language:Python29840
numba/numba
NumPy aware dynamic Python compiler using LLVM
Language:Python9.6k1.1k

equus71

equus71's Stars

MC-E/ReVideo

reflex-dev/reflex

FoundationVision/Groma

ZhengPeng7/BiRefNet

dora-rs/dora

KellerJordan/cifar10-airbench

Vision-CAIR/MiniGPT4-video

InstantStyle/InstantStyle

yardenfren1996/B-LoRA

snap-research/MyVLM

myshell-ai/MeloTTS

bfshi/scaling_on_scales

fuxiao0719/GeoWizard

Haiyang-W/GiT

OpenGVLab/VideoMamba

PaddlePaddle/PaddleDetection

deepseek-ai/DeepSeek-VL

VAST-AI-Research/TripoSR

DLYuanGod/TinyGPT-V

NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion

karpathy/minbpe

whlzy/FiT

Vchitect/SEINE

InstantID/InstantID

TencentARC/PhotoMaker

apple/ml-aim

AIGCDesignGroup/ReplaceAnything

Jiawei-Yang/Denoising-ViT

yancie-yjr/StreamYOLO

numba/numba