shiztong

shiztong's Stars

LAION-AI/CLAP
Contrastive Language-Audio Pretraining
Language:Python1.4k133
liucongg/NLPDataSet
记录本人整理的一些数据集
992133
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python7k512
NLPJCL/RAG-Retrieval
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT,Cross Encoder
Language:Python44138
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python43k7.7k
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Language:Python13.4k1.8k
milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
Language:Go29.7k2.9k
percent4/embedding_rerank_retrieval
本项目是针对RAG中的Retrieve阶段的召回技术及算法效果所做评估实验。使用主体框架为LlamaIndex.
Language:Jupyter Notebook15618
netease-youdao/BCEmbedding
Netease Youdao's open-source embedding and reranker models for RAG products.
Language:Python1.4k91
OpenGVLab/unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Language:Python28514
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Language:Python855121
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language:Python4.4k452
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language:Python1.3k85
X-PLUG/Youku-mPLUG
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
Language:Python28011
danieljf24/awesome-video-text-retrieval
A curated list of deep learning resources for video-text retrieval.
58566
TXH-mercury/VALOR
Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
Language:Python25715
jpthu17/EMCL
[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Language:Python1158
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
17.9k2.6k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12k768
fighting41love/zhvoice
Chinese voice corpus. 中文语音语料，语音更加清晰自然，包含8个开源数据集，3200个说话人，900小时语音，1300万字。
580114
double22a/speech_dataset
The dataset of Speech Recognition
38372
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.7k953
PaddlePaddle/PaddleVideo
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.
Language:Python1.5k376
meta-llama/llama
Inference code for Llama models
Language:Python55.8k9.5k
GitHubDaily/ChatGPT-Prompt-Engineering-for-Developers-in-Chinese
《面向开发者的 ChatGPT 提示词工程》非官方版中英双语字幕 Unofficial subtitles of "ChatGPT Prompt Engineering for Developers"
Language:Jupyter Notebook1.7k212
datawhalechina/llm-cookbook
面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版
Language:Jupyter Notebook11.5k1.4k
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
Language:Python11.9k1.1k
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Python37k3.2k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35k4.1k
LC1332/Luotuo-Chinese-LLM
骆驼(Luotuo): Open Sourced Chinese Language Models. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
Language:Jupyter Notebook3.6k246