icemoon-creative's Stars
facebookresearch/detr
End-to-End Object Detection with Transformers
CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
facebookresearch/pycls
Codebase for Image Classification Research, written in PyTorch.
mbzuai-oryx/LLaVA-pp
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
tatsu-lab/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
jia-zhuang/pytorch-multi-gpu-training
整理 pytorch 单机多 GPU 训练方法与原理
yxuansu/PandaGPT
[TLLM'23] PandaGPT: One Model To Instruction-Follow Them All
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
SunzeY/AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
TencentARC/LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
huggingface/huggingface-llama-recipes
Coobiw/MPP-LLaVA
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
HqWu-HITCS/Awesome-LLM-Survey
An Awesome Collection for LLM Survey
TXH-mercury/VALOR
Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
IAAR-Shanghai/CRUD_RAG
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
jasonvanf/llama-trl
LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
naver-ai/rope-vit
[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"
Rose-STL-Lab/dyffusion
[NeurIPS 2023] A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting
wei-potato/Train-llm-from-scratch
使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力
IBM/SALMON
Self-Alignment with Principle-Following Reward Models
NVlabs/A-ViT
Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)
whwu95/MVFNet
【AAAI'2021】MVFNet: Multi-View Fusion Network for Efficient Video Recognition
Purdue-M2/Detect-LAIM-generated-Multimedia-Survey
Lisennlp/distributed_train_pytorch
pytorch分布式训练,支持多机多卡,单机多卡。
xieyuankun/Codecfake
This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".
heliossun/SQ-LLaVA
Visual self-questioning for large vision-language assistant.
JiwanChung/vlis
nctu-eva-lab/AntifakePrompt
This is the official implementation of AntifakePrompt.
GeWu-Lab/TSPM
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
roger-tseng/CodecFake
A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024