Pinned Repositories
auxiliary-deep-generative-models
Implementation of auxiliary deep generative models for semi-supervised learning
Awesome-Face-Forgery-Generation-and-Detection
A curated list of articles and codes related to face forgery generation and detection.
DeepVideoAnalytics
Analyze videos, perform detections, index frames & detected objects, search by examples
DRN-1
Code for "Learning Multiple Tasks with Deep Relationship Networks" NIPS 2017
LLaMA-Adapter-2
Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
MobilePose-pytorch
Single Person Pose Estimation for Mobile Device
Multiscale-Super-Spectral
Accurate Spectral Super-resolution from Single RGB Image Using Multi-scale CNN
PointGrow
An autoregressive model for point cloud generation augmented with self-attention
pose-residual-network
Code for 'MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network' paper
pseudo-3d-residual-networks
Pseudo-3D Convolutional Residual Networks for Video Representation Learning
ml-lab's Repositories
ml-lab/LLaMA-Adapter-2
Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
ml-lab/Depth-Anything
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
ml-lab/sapiens
High-resolution models for human tasks.
ml-lab/AIW
Alice in Wonderland code base for experiments and raw experiments data
ml-lab/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
ml-lab/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
ml-lab/autodistill
Images to inference with no labeling (use foundation models to train supervised models)
ml-lab/BitNet
Official inference framework for 1-bit LLMs
ml-lab/ComfyUI-AnimateAnyone-Evolved
Improved AnimateAnyone implementation that allows you to use the opse image sequence and reference image to generate stylized video
ml-lab/DB-GPT
Interact your data and environment using the local GPT, no data leaks, 100% privately, 100% security
ml-lab/DreamPose
Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"
ml-lab/esm
ml-lab/gala1
ml-lab/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
ml-lab/graphcast
ml-lab/grok-1
Grok open release
ml-lab/guidance
A guidance language for controlling large language models.
ml-lab/HairFastGAN
Official Implementation for "HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach"
ml-lab/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
ml-lab/LivePortrait
Make one portrait alive!
ml-lab/LLaVA
Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
ml-lab/mandala
A powerful and easy to use Python framework for experiment tracking and incremental computing
ml-lab/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
ml-lab/minimal_outer_ellipsoid
Search the smallest ellipsoid that covers a basic semi-algebraic set through convex optimization
ml-lab/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
ml-lab/opendream
An extensible, easy-to-use, and portable diffusion web UI 👨🎨
ml-lab/privateGPT
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
ml-lab/roop
one-click deepfake (face swap)
ml-lab/shap-e
Generate 3D objects conditioned on text or images
ml-lab/text2room
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).