alfredplpl

Research Scientist. Interests: data science, machine learning, robotics, neuroscience

CyberAgent, incJapan

alfredplpl's Stars

TMElyralab/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
Language:Python2.4k170
mattyamonaca/starline
Strict coloring machine for line drawings.
Language:Python1438
chuanyangjin/fast-DiT
Fast Diffusion Models with Transformers
Language:Python772100
tosiyuki/LLaVA-JP
LLaVA-JP is a Japanese VLM trained by LLaVA method
Language:Python5713
microsoft/LLaVA-Med
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
Language:Python1.7k202
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language:Python2.1k88
IDEA-Research/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
Language:Python2.3k146
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Language:Python1.7k84
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.7k596
VOICEVOX/voicevox_engine
無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン
Language:Python1.3k205
google/imageinwords
Data release for the ImageInWords (IIW) paper.
Language:JavaScript2059
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python8.1k823
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript57.3k8.5k
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Language:Python12.9k889
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python38.3k4.3k
magic-research/PLLaVA
Official repository for the paper PLLaVA
Language:Python62546
rohitgandikota/sliders
Concept Sliders for Precise Control of Diffusion Models
Language:Jupyter Notebook99079
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Language:Go106k8.5k
HighCWu/ControlLoRA
ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information
Language:Python57427
HighCWu/control-lora-v2
ControlLoRA Version 2: A Lightweight Neural Network To Control Stable Diffusion Spatial Information Version 2
Language:Python1056
pytorch/torchtune
PyTorch native post-training library
Language:Python4.5k471
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.8k3.2k
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Language:Python4.1k643
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Language:Python1.7k120
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python137k27.5k
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language:Python1.3k125
chaojie/ComfyUI-Open-Sora-Plan
Language:Python518
xhedit/quantkit
cli tool to quantize gguf, gptq, awq, hqq and exl2 models
Language:Python654
ykdai/BasicPBC
Official Implementation of "Learning Inclusion Matching for Animation Paint Bucket Colorization"
Language:Python26625
HaozheLiu-ST/T-GATE
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
Language:Python37024

alfredplpl

alfredplpl's Stars

TMElyralab/MusePose

mattyamonaca/starline

chuanyangjin/fast-DiT

tosiyuki/LLaVA-JP

microsoft/LLaVA-Med

Alpha-VLLM/Lumina-T2X

IDEA-Research/DWPose

PixArt-alpha/PixArt-sigma

facebookresearch/DiT

VOICEVOX/voicevox_engine

google/imageinwords

huggingface/lerobot

langgenius/dify

openai/tiktoken

RVC-Boss/GPT-SoVITS

magic-research/PLLaVA

rohitgandikota/sliders

ollama/ollama

HighCWu/ControlLoRA

HighCWu/control-lora-v2

pytorch/torchtune

meta-llama/llama3

yisol/IDM-VTON

aigc-apps/EasyAnimate

huggingface/transformers

PKU-YuanGroup/MagicTime

chaojie/ComfyUI-Open-Sora-Plan

xhedit/quantkit

ykdai/BasicPBC

HaozheLiu-ST/T-GATE