boboyiyi

Read, Search, Ask

BytedanceBeijing

boboyiyi's Stars

black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python18.7k 159 01.3k
ExistentialAudio/BlackHole
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
Language:C15.5k 124 402601
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook11.1k 64 259944
idealvin/coost
A tiny boost library in C++11.
Language:C++4k 133 218563
BadToBest/EchoMimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python2.9k 43 180339
PeterH0323/Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋
Language:Python2.7k 41 28420
malinkang/weread2notion-pro
Language:Python2.6k 10 125.1k
PowerHouseMan/ComfyUI-AdvancedLivePortrait
Language:Python2.1k 17 71175
kijai/ComfyUI-LivePortraitKJ
ComfyUI nodes for LivePortrait
Language:Python1.7k 21 137138
YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
Language:Jupyter Notebook1.7k 26 52100
karpathy/nano-llama31
nanoGPT style version of Llama 3.1
Language:Python1.3k 23 666
muzishen/IMAGDressing
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.
Language:Python1.1k 14 4392
OpenTeleVision/TeleVision
[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Language:Python703 9 4172
facebookresearch/ocean
Ocean is the in-house framework for Computer Vision (CV) and Augmented Reality (AR) applications at Meta. It is platform independent and is mainly implemented in C/C++.
Language:C++660 16 2859
warmshao/FasterLivePortrait
Bring portraits to life in Real Time！onnx/tensorrt support！实时肖像驱动！
Language:Python565 16 9453
IDEA-Research/X-Pose
[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"
Language:Python559 23 3328
Vchitect/VEnhancer
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
Language:Python479 19 2825
huangyangyi/TeCH
[3DV 2024] Official repo of "TeCH: Text-guided Reconstruction of Lifelike Clothed Humans"
Language:Python397 30 4225
aim-uofa/MovieDreamer
261 24 38
OpenT2S/LlamaVoice
LlamaVoice is a llama-based large voice generation model, providing inference and training ability.
Language:Python224 23 312
Francis-Rings/MotionFollower
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion
Language:Python202 16 616
yanivw12/gs2mesh
[ECCV 2024] Official implementation of the paper "GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo Views"
Language:Python185 7 1910
aihacker111/Efficient-Live-Portrait
Fast running Live Portrait with TensorRT and ONNX models
Language:Python148 6 2314
bornfly-detachment/asymmetric_magvitv2
In 2024, the strongest open-source implementation of asymmetric magvit_v2 supports inference code but excludes VQVAE. It supports the joint encoding of images and videos, accommodating arbitrary video lengths and resolutions. It surpasses all open-source models in FID and FVD, with 4z and 16z models available on huggingface.
Language:Python135 10 55
guoqincode/Focus-on-Your-Instruction
[CVPR 2024] Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
Language:Python110 16 511
KeyuWu-CS/MonoHair
Code of MonoHair: High-Fidelity Hair Modeling from a Monocular Video
Language:Python102 18 194
XuanchenLi/Topo4D
[ECCV 2024] Official implementation of Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture
Language:Python73 12 72
asw91666/TRG-Release
Official PyTorch implementation of "6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry," ECCV 2024
67 16 24
TencentQQGYLab/LinguaLinker
LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancement
65 12 32
eccv2024tcan/TCAN
Language:Python41 8 56

boboyiyi

boboyiyi's Stars

black-forest-labs/flux

ExistentialAudio/BlackHole

facebookresearch/segment-anything-2

idealvin/coost

BadToBest/EchoMimic

PeterH0323/Streamer-Sales

malinkang/weread2notion-pro

PowerHouseMan/ComfyUI-AdvancedLivePortrait

kijai/ComfyUI-LivePortraitKJ

YangLing0818/RPG-DiffusionMaster

karpathy/nano-llama31

muzishen/IMAGDressing

OpenTeleVision/TeleVision

facebookresearch/ocean

warmshao/FasterLivePortrait

IDEA-Research/X-Pose

Vchitect/VEnhancer

huangyangyi/TeCH

aim-uofa/MovieDreamer

OpenT2S/LlamaVoice

Francis-Rings/MotionFollower

yanivw12/gs2mesh

aihacker111/Efficient-Live-Portrait

bornfly-detachment/asymmetric_magvitv2

guoqincode/Focus-on-Your-Instruction

KeyuWu-CS/MonoHair

XuanchenLi/Topo4D

asw91666/TRG-Release

TencentQQGYLab/LinguaLinker

eccv2024tcan/TCAN