SakurajimaMaiii

Transfer learning, multimodal learning, and medical AI. NLP @aiwaves-cn

@aiwaves-cn Hangzhou,China

SakurajimaMaiii's Stars

langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Python88.4k 670 7.1k13.9k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python31.7k 270 1.1k3.8k
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
Language:C++29.3k 481 2.4k3.5k
OpenTalker/SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python11.1k 149 7982.1k
BerriAI/litellm
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
Language:Python10.2k 59 2.6k1.1k
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python9.7k 162 6352.1k
bentoml/OpenLLM
Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.
Language:Python9.2k 53 255588
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C++7.6k 75 145404
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python6k 71 226895
wgwang/awesome-LLMs-In-China
**大模型
4.8k 102 24410
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Language:Python4.1k 34 435416
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3k 56 3192
openai/weak-to-strong
Language:Python2.5k 34 18296
Zz-ww/SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形，设置面部区域可配置的增强方式进行合成唇形（人脸）区域画面增强，提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧，补充帧间合成唇形的动作过渡，使合成的唇形更为流畅、真实以及自然。
Language:Python1.7k 36 86292
ytongbai/LVM
Language:Python1.7k 133 1851
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python1.4k 11 330161
microsoft/LLaVA-Med
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
Language:Python1.3k 25 71147
numz/sd-wav2lip-uhq
Wav2Lip UHQ extension for Automatic1111
Language:Python1.2k 23 115157
VILA-Lab/ATLAS
A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171
Language:Python860 20 782
open-mmlab/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA，你的个性化图像动画生成器，利用文本提示将图像变为奇妙的动画
Language:Python787 20 3764
kaidic/LDAM-DRW
[NeurIPS 2019] Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss
Language:Python637 14 18115
dvlab-research/LLaMA-VID
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
Language:Python619 12 9539
kgl-prml/Contrastive-Adaptation-Network-for-Unsupervised-Domain-Adaptation
pytorch implementation for Contrastive Adaptation Network
Language:Python320 6 1858
LLaMafia/llamafia.github
Language:Python294 21 217
shengliu66/ELR
Official Implementation of Early-Learning Regularization Prevents Memorization of Noisy Labels
Language:Python287 7 2129
google-research/syn-rep-learn
Learning from synthetic data - code and models
Language:Python271 11 512
ZhangYuanhan-AI/visual_prompt_retrieval
[NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"
Language:Python158 4 106
nghiakvnvsd/wav2lip384
Language:Python147 6 652
test-time-training/mttt
Language:Python51 2 03
Re-Align/AlignTDS
Analyzing LLM Alignment via Token distribution shift
Language:Python12 2 01

SakurajimaMaiii

SakurajimaMaiii's Stars

langchain-ai/langchain

coqui-ai/TTS

facebookresearch/faiss

OpenTalker/SadTalker

BerriAI/litellm

Rudrabha/Wav2Lip

bentoml/OpenLLM

SJTU-IPADS/PowerInfer

OpenTalker/video-retalking

wgwang/awesome-LLMs-In-China

AutoGPTQ/AutoGPTQ

opendilab/awesome-RLHF

openai/weak-to-strong

Zz-ww/SadTalker-Video-Lip-Sync

ytongbai/LVM

casper-hansen/AutoAWQ

microsoft/LLaVA-Med

numz/sd-wav2lip-uhq

VILA-Lab/ATLAS

open-mmlab/PIA

kaidic/LDAM-DRW

dvlab-research/LLaMA-VID

kgl-prml/Contrastive-Adaptation-Network-for-Unsupervised-Domain-Adaptation

LLaMafia/llamafia.github

shengliu66/ELR

google-research/syn-rep-learn

ZhangYuanhan-AI/visual_prompt_retrieval

nghiakvnvsd/wav2lip384

test-time-training/mttt

Re-Align/AlignTDS