eisneim

Hi there! I'm a Web developer

视频大拍档Shenzhen China

eisneim's Stars

thunlp/Migician
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
Language:Python503
chrischoy/WhisperChain
Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what you said!
Language:Python29015
chuanruihu/Level-Navi-Agent-Search
The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise search operations. The repo includes benchmarks, datasets, and tools for assessing LLM performance in web searches
Language:Python7810
SparkAudio/Spark-TTS
Spark-TTS Inference Code
Language:Python7.4k767
KohakuBlueleaf/PixelOE
Detail-Oriented Pixelization based on Contrast-Aware Outline Expansion.
Language:Python30417
zilliztech/deep-searcher
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Language:Python5.3k500
thu-pacman/chitu
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
Language:Python1.1k71
EvolvingLMMs-Lab/EgoLife
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
Language:Python24817
SesameAILabs/csm
A Conversational Speech Generation Model
Language:Python12.3k1.1k
dvruette/gidd
Code accompanying the paper "Generalized Interpolating Discrete Diffusion"
Language:Python705
gojasper/LBM
LBM: Latent Bridge Matching for Fast Image-to-Image Translation ✨
Language:Python33015
kuleshov-group/bd3lms
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
Language:Python48829
svg-project/Sparse-VideoGen
Language:Python1534
ButzYung/SystemAnimatorOnline
XR Animator, AI-based Full Body Motion Capture and Extended Reality (XR) solution, powered by System Animator Online
Language:JavaScript1k93
TrajectoryCrafter/TrajectoryCrafter
Official implementation of TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
Language:Python60827
TencentARC/VideoPainter
Any-length Video Inpainting and Editing with Plug-and-Play Context Control
Language:Python30318
kohya-ss/musubi-tuner
Language:Python52454
TTPlanetPig/Gui_for_musubi-tuner
A GUI for Kohya_ss musubi-tuner for easy use!
Language:Python673
AssafSinger94/dino-tracker
Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)
Language:Python47544
LTH14/fractalgen
PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437
Language:Python1k54
nv-tlabs/GEN3C
[CVPR 2025] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
49118
YisuiTT/Mobius
Mobius: Text to Seamless Looping Video Generation via Latent Shift
Language:Python1247
lumalabs/imm
Official implementation of Inductive Moment Matching
Language:Python4338
stepfun-ai/Step-Audio
Language:Python4.1k330
DigiRL-agent/digiq
Language:Python873
showlab/PhotoDoodle
Code Implementation of "PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data"
Language:Python36924
huggingface/movie-shot-categorizer
Fine-tune of Florence-2 for shot categorization.
Language:Jupyter Notebook231
ML-GSAI/LLaDA
Official PyTorch implementation for "Large Language Diffusion Models"
Language:Python1.4k106
FoundationVision/UniTok
A Unified Tokenizer for Visual Generation and Understanding
Language:Python2335
IHe-KaiI/CTRL-D
CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion.
Language:Python331

eisneim

eisneim's Stars

thunlp/Migician

chrischoy/WhisperChain

chuanruihu/Level-Navi-Agent-Search

SparkAudio/Spark-TTS

KohakuBlueleaf/PixelOE

zilliztech/deep-searcher

thu-pacman/chitu

EvolvingLMMs-Lab/EgoLife

SesameAILabs/csm

dvruette/gidd

gojasper/LBM

kuleshov-group/bd3lms

svg-project/Sparse-VideoGen

ButzYung/SystemAnimatorOnline

TrajectoryCrafter/TrajectoryCrafter

TencentARC/VideoPainter

kohya-ss/musubi-tuner

TTPlanetPig/Gui_for_musubi-tuner

AssafSinger94/dino-tracker

LTH14/fractalgen

nv-tlabs/GEN3C

YisuiTT/Mobius

lumalabs/imm

stepfun-ai/Step-Audio

DigiRL-agent/digiq

showlab/PhotoDoodle

huggingface/movie-shot-categorizer

ML-GSAI/LLaDA

FoundationVision/UniTok

IHe-KaiI/CTRL-D