ihollywhy

Research Scientist at ByteDance

ByteDanceBellevue

ihollywhy's Stars

lllyasviel/Fooocus
Focus on prompting and generating
Language:Python42.4k 336 1.6k6.2k
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python25k 260 3132.8k
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
Language:Python23.9k 514 2.5k5.5k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python23k 190 5242.3k
danielgatis/rembg
Rembg is a tool to remove images background
Language:Python17.6k 150 5151.9k
state-spaces/mamba
Mamba SSM architecture
Language:Python13.7k 101 5831.2k
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook11.1k 144 3701.1k
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Language:Python9.8k 80 123716
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
Language:Python7.2k 66 71554
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.7k 57 720521
NVlabs/stylegan3
Official PyTorch implementation of StyleGAN3
Language:Python6.5k 55 2581.1k
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Language:Python6.3k 68 433424
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
Language:Jupyter Notebook5.9k 76 2211.2k
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Language:Python5.8k 79 142377
roboflow/notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
Language:Jupyter Notebook5.7k 80 145911
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Language:Python4.9k 45 494473
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Language:Python4.4k 61 97230
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Language:Python4.1k 86 103367
facebookresearch/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
Language:Jupyter Notebook4.1k 33 114269
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python2.9k 48 0182
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
Language:Python2.7k 37 138176
mit-han-lab/efficientvit
Efficient vision foundation models for high-resolution generation and perception.
Language:Python2.5k 40 151202
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python2.3k 37 149189
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Language:Python2.2k 28 166210
Computer-Vision-in-the-Wild/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
1.2k 38 658
ermongroup/SDEdit
PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations
Language:Python1k 23 2892
wpeebles/gangealing
Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)
Language:Python1k 16 38120
OpenGVLab/VisionLLM
VisionLLM Series
Language:Python968 44 1531
horseee/DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
Language:Python828 15 5339
YifanXu74/MQ-Det
Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)
Language:Python277 2 6213

ihollywhy

ihollywhy's Stars

lllyasviel/Fooocus

Stability-AI/generative-models

deepinsight/insightface

hpcaitech/Open-Sora

danielgatis/rembg

state-spaces/mamba

facebookresearch/seamless_communication

cumulo-autumn/StreamDiffusion

LargeWorldModel/LWM

OpenGVLab/InternVL

NVlabs/stylegan3

THUDM/CogVLM

CompVis/taming-transformers

OpenGVLab/LLaMA-Adapter

roboflow/notebooks

AILab-CVC/YOLO-World

luosiallen/latent-consistency-model

ali-vilab/AnyDoor

facebookresearch/co-tracker

PixArt-alpha/PixArt-alpha

Alpha-VLLM/LLaMA2-Accessory

mit-han-lab/efficientvit

NVlabs/VILA

intel/intel-extension-for-transformers

Computer-Vision-in-the-Wild/CVinW_Readings

ermongroup/SDEdit

wpeebles/gangealing

OpenGVLab/VisionLLM

horseee/DeepCache

YifanXu74/MQ-Det