SakurajimaMaiii
Transfer learning, multimodal learning, and medical AI. NLP @aiwaves-cn
@aiwaves-cn Hangzhou,China
SakurajimaMaiii's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
microsoft/autogen
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
netease-youdao/QAnything
Question and Answer based on Anything.
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
apernet/OpenGFW
OpenGFW is a flexible, easy-to-use, open source implementation of GFW (Great Firewall of China) on Linux
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
openai/weak-to-strong
state-spaces/s4
Structured state space sequence models
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
microsoft/LLaVA-Med
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
yuewang-cuhk/awesome-vision-language-pretraining-papers
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
IBM/Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
VILA-Lab/ATLAS
A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171
open-mmlab/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
kaidic/LDAM-DRW
[NeurIPS 2019] Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss
TencentARC/LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
kgl-prml/Contrastive-Adaptation-Network-for-Unsupervised-Domain-Adaptation
pytorch implementation for Contrastive Adaptation Network
LLaMafia/llamafia.github
shengliu66/ELR
Official Implementation of Early-Learning Regularization Prevents Memorization of Noisy Labels
google-research/syn-rep-learn
Learning from synthetic data - code and models
nghiakvnvsd/wav2lip384
MIDL-Conference/MIDLLatexTemplate
Latex template for the MIDL Conference
Re-Align/AlignTDS
Analyzing LLM Alignment via Token distribution shift
TIDESlab/ITS
[ICASSP 2024] Boosting of Implicit Neural Representation-based Image Denoiser