EricKani's Stars
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
mli/paper-reading
深度学习经典、新论文逐段精读
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
microsoft/BioGPT
locuslab/TCN
Sequence modeling benchmarks and temporal convolutional networks
bowang-lab/MedSAM
Segment Anything in Medical Images
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
open-mmlab/mmdeploy
OpenMMLab Model Deployment Framework
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
microsoft/i-Code
microsoft/LLaVA-Med
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
airaria/TextBrewer
A PyTorch-based knowledge distillation toolkit for natural language processing
Sense-X/UniFormer
[ICLR2022] official implementation of UniFormer
jeffhj/LM-reasoning
This repository contains a collection of papers and resources on Reasoning in Large Language Models.
junyuchen245/Transformer_for_medical_image_analysis
A collection of papers about Transformer in the field of medical image analysis.
JunMa11/SegWithDistMap
How Distance Transform Maps Boost Segmentation CNNs: An Empirical Study
iflytek/MiniRBT
MiniRBT (中文小型预训练模型系列)
LynnHo/DCGAN-LSGAN-WGAN-GP-DRAGAN-Pytorch
DCGAN LSGAN WGAN-GP DRAGAN PyTorch
HiLab-git/WORD
[MedIA2022]WORD: A large scale dataset, benchmark and clinical applicable study for abdominal organ segmentation from CT image
speechandlanguageprocessing/ICASSP2022-Depression
Automatic Depression Detection: a GRU/ BiLSTM-based Model and An Emotional Audio-Textual Corpus
hustvl/MSG-Transformer
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens (CVPR 2022)
PingCheng-Wei/DepressionEstimation
Bachelor Thesis - Deep Learning-based Multi-modal Depression Estimation
CMDC-corpus/CMDC-Baseline
the baseline model of CMDC corpus
bbrister/ctOrganSegmentation
Morphological organ segmentation for CT scans