alfredplpl
Research Scientist. Interests: data science, machine learning, robotics, neuroscience
CyberAgent, incJapan
alfredplpl's Stars
TMElyralab/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
mattyamonaca/starline
Strict coloring machine for line drawings.
chuanyangjin/fast-DiT
Fast Diffusion Models with Transformers
tosiyuki/LLaVA-JP
LLaVA-JP is a Japanese VLM trained by LLaVA method
microsoft/LLaVA-Med
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
IDEA-Research/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
VOICEVOX/voicevox_engine
無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン
google/imageinwords
Data release for the ImageInWords (IIW) paper.
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
magic-research/PLLaVA
Official repository for the paper PLLaVA
rohitgandikota/sliders
Concept Sliders for Precise Control of Diffusion Models
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
HighCWu/ControlLoRA
ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information
HighCWu/control-lora-v2
ControlLoRA Version 2: A Lightweight Neural Network To Control Stable Diffusion Spatial Information Version 2
pytorch/torchtune
PyTorch native post-training library
meta-llama/llama3
The official Meta Llama 3 GitHub site
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
chaojie/ComfyUI-Open-Sora-Plan
xhedit/quantkit
cli tool to quantize gguf, gptq, awq, hqq and exl2 models
ykdai/BasicPBC
Official Implementation of "Learning Inclusion Matching for Animation Paint Bucket Colorization"
HaozheLiu-ST/T-GATE
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!