jasper0314-huang

Ph.D. student @ National Taiwan University

Taiwan, Taipei

jasper0314-huang's Stars

zhihou7/dit_policy_vla
Language:Python10
Psi-Robot/DexGraspVLA
DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping
Language:Python1005
juruobenruo/DexVLA
Language:Python1782
hiyouga/EasyR1
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Language:Python1.1k56
b09902097/motionmatcher
The implementation of MotionMatcher, a feature-level fine-tuning framework for motion customization.
10
EmbodiedBench/EmbodiedBench
Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
Language:Python732
OpenMOSS/VLABench
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
Language:Python1511
microsoft/CogACT
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
Language:Python18014
2U1/Qwen2-VL-Finetune
An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.
Language:Python39047
QwenLM/Qwen2.5-VL
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Jupyter Notebook8.4k584
Robot-VLAs/RoboVLMs
Language:Python28711
genforce/freecontrol
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"
Language:Python46315
jasper0314-huang/Receler
[ECCV 2024] "Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers" (Official Implementation)
Language:Python34
voxel51/fiftyone-brain
Open source AI/ML capabilities for the FiftyOne ecosystem
Language:Python13810
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python9.3k606
AFeng-x/PixWizard
[ICLR2025]
Language:Python1381
LPengYang/MotionClone
[ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
Language:Python45534
microsoft/WindowsAgentArena
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
Language:Python60864
MiuLab/VisualDialog
Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models
Language:Python31
Jack24658735/FedLGT
[AAAI 2024] Official Implementation of Language-Guided Transformer for Federated Multi-Label Classification
Language:Python152
TimChou-ntu/GSNeRF
[CVPR 2024] GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding
Language:Python13
jjihwan/FIFO-Diffusion_public
Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)
Language:Python44231
chu0802/SnD
This is an official implementation of our work, Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models, accepted to ECCV'24
Language:Python91
showlab/MotionDirector
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
Language:Python87856
NUS-HPC-AI-Lab/VideoSys
VideoSys: An easy and efficient system for video generation
Language:Python1.9k130
ntucllab/libcll
Complementary-label learning in Pytorch
Language:Python176
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python10.9k1k
diffusion-motion-transfer/diffusion-motion-transfer
Official Pytorch Implementation for "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer""
Language:Python16617
jianzongwu/MotionBooth
[NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"
Language:Python1289
agwmon/MuDI
MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)
Language:Jupyter Notebook834

jasper0314-huang

jasper0314-huang's Stars

zhihou7/dit_policy_vla

Psi-Robot/DexGraspVLA

juruobenruo/DexVLA

hiyouga/EasyR1

b09902097/motionmatcher

EmbodiedBench/EmbodiedBench

OpenMOSS/VLABench

microsoft/CogACT

2U1/Qwen2-VL-Finetune

QwenLM/Qwen2.5-VL

Robot-VLAs/RoboVLMs

genforce/freecontrol

jasper0314-huang/Receler

voxel51/fiftyone-brain

voxel51/fiftyone

AFeng-x/PixWizard

LPengYang/MotionClone

microsoft/WindowsAgentArena

MiuLab/VisualDialog

Jack24658735/FedLGT

TimChou-ntu/GSNeRF

jjihwan/FIFO-Diffusion_public

chu0802/SnD

showlab/MotionDirector

NUS-HPC-AI-Lab/VideoSys

ntucllab/libcll

THUDM/CogVideo

diffusion-motion-transfer/diffusion-motion-transfer

jianzongwu/MotionBooth

agwmon/MuDI