Pinned Repositories
allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
BlendArMocap
realtime motion tracking in blender using mediapipe and rigify
BlenderScripts
Some blender scripts developed during TSL projects
chatgpt_api_test
Demos utilizing the ChatGPT API
ControllableTalkNet
A web app that lets you play around with TalkNet models
CVPR2022-DaGAN
Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation
demo-whisper
This is a Whisper transcription starter template from Banana.dev that allows on-demand serverless GPU inference of the openai/whisper-base model from Hugging Face. Basically your own Whisper API.
dharamshala
DL-Art-School
DLAS - A configuration-driven trainer for generative models
tortoise-tts-fast
Fast TorToiSe inference (5x or your money back!)
andrewkuo's Repositories
andrewkuo/tortoise-tts-fast
Fast TorToiSe inference (5x or your money back!)
andrewkuo/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
andrewkuo/BlendArMocap
realtime motion tracking in blender using mediapipe and rigify
andrewkuo/BlenderScripts
Some blender scripts developed during TSL projects
andrewkuo/chatgpt_api_test
Demos utilizing the ChatGPT API
andrewkuo/ControllableTalkNet
A web app that lets you play around with TalkNet models
andrewkuo/CVPR2022-DaGAN
Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation
andrewkuo/demo-whisper
This is a Whisper transcription starter template from Banana.dev that allows on-demand serverless GPU inference of the openai/whisper-base model from Hugging Face. Basically your own Whisper API.
andrewkuo/dharamshala
andrewkuo/DL-Art-School
DLAS - A configuration-driven trainer for generative models
andrewkuo/gan-control
This package provides a pythorch implementation of "GAN-Control: Explicitly Controllable GANs", ICCV 2021.
andrewkuo/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
andrewkuo/gpt4-pdf-chatbot-langchain
GPT4 & LangChain Chatbot for large PDF docs
andrewkuo/HN-UnifiedSourceFilterGAN
andrewkuo/inbox-manager-agent
andrewkuo/langchain-supabase-website-chatbot
Build a chatgpt chatbot for your website using LangChain, Supabase, Typescript, Openai, and Next.js.
andrewkuo/llama-chat
Chat with Meta's LLaMA models at home made easy
andrewkuo/llama-dl
High-speed download of LLaMA, Facebook's 65B parameter GPT model
andrewkuo/llama-int8
Quantized inference code for LLaMA models
andrewkuo/nginx-proxy
Automated nginx proxy for Docker containers using docker-gen
andrewkuo/One-Shot_Free-View_Neural_Talking_Head_Synthesis
Unofficial pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"
andrewkuo/ShortGPT
The best framework for automating video and short content creation
andrewkuo/sidekick
Open source ETL platform for retrieval augmented generation (RAG). Sync data from your SaaS tools to a vector store, where they can be easily queried by LLM apps
andrewkuo/SiFiGAN
Official implementation of the source-filter HiFiGAN vocoder
andrewkuo/StyleAvatar
Code of SIGGRAPH 2023 Conference paper: StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video
andrewkuo/Thin-Plate-Spline-Motion-Model
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
andrewkuo/tts-tortoise-gradio
A Gradio setup for Tortoise TTS.
andrewkuo/UnifiedSourceFilterGAN
andrewkuo/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
andrewkuo/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)