jxz542189's Stars
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
amusi/Deep-Learning-Interview-Book
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
openvinotoolkit/openvino
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
mli/autocut
用文本编辑器剪视频
TimDettmers/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
microsoft/torchscale
Foundation Architecture for (M)LLMs
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
TingFree/NLPer-Arsenal
收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中
microsoft/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
PaddlePaddle/Research
novel deep learning research works with PaddlePaddle
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
SHI-Labs/Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
microsoft/VideoX
VideoX: a collection of video cross-modal models
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
hpcaitech/EnergonAI
Large-scale model inference.
facebookresearch/omnivore
Omnivore: A Single Model for Many Visual Modalities
Langboat/Mengzi
Mengzi Pretrained Models
HillZhang1999/MuCGEC
MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"
facebookresearch/LaViLa
Code release for "Learning Video Representations from Large Language Models"
microsoft/XPretrain
Multi-modality pre-training
CLUEbenchmark/pCLUE
pCLUE: 1000000+多任务提示学习数据集
alibaba/proxima
zdou0830/METER
METER: A Multimodal End-to-end TransformER Framework
hpcaitech/ColossalAI-Examples
Examples of training models with hybrid parallelism using ColossalAI
iflytek/MiniRBT
MiniRBT (中文小型预训练模型系列)
Alibaba-NLP/HiAGM
Hierarchy-Aware Global Model for Hierarchical Text Classification
salesforce/ALPRO
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
xuguohai/X-CLIP
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
mayuelala/SimVTP
SimVTP: This repo is the official implementation of "Simple Video Text Pre-training with Masked Autoencoders"