guoxinXiong

guoxinXiong's Stars

chatchat-space/Langchain-Chatchat
Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Language:TypeScript31.8k5.5k
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.6k5.2k
CLUEbenchmark/SuperCLUE
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
3k96
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.3k1.9k
TencentARC/UMT
UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.
Language:Python19118
li-plus/DSNet
DSNet: A Flexible Detect-to-Summarize Network for Video Summarization
Language:Python20850
THUDM/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Language:Python4.1k416
VincentJYZhang/USTC_Lecture
USTC研究生学术报告选课脚本
Language:Python193
Yutong-Zhou-cv/Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
2.1k190
danieljf24/awesome-video-text-retrieval
A curated list of deep learning resources for video-text retrieval.
58967
lllyasviel/ControlNet
Let us control diffusion models!
Language:Python30.2k2.7k
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python7k717
microsoft/VSE_Gradient
Language:Python4
AAA-Zheng/Image-Text-Matching-Summary
Summary of Related Research on Image-Text Matching
654
QinYang79/DECL
Deep Evidential Learning with Noisy Correspondence for Cross-modal Retrieval ( ACM Multimedia 2022, Pytorch Code)
Language:Python405
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Language:Python18.4k1.9k
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
Language:Python1.8k198
labyrinth7x/Deep-Cross-Modal-Projection-Learning-for-Image-Text-Matching
Deep Cross-Modal Projection Learning for Image-Text Matching
Language:Python7321
robi56/video-summarization-resources
Video Summarization Dataset, Papers, Codes
15626
woodfrog/vse_infty
Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)
Language:Python15616
LgQu/DIME
Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21
Language:Python664
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
Language:Jupyter Notebook5.8k1.1k
openai/DALL-E
PyTorch package for the discrete VAE used for DALL·E.
Language:Python10.8k1.9k
lucidrains/DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Language:Python5.6k643
zhjohnchan/awesome-vision-and-language-pretraining
A curated list of vision-and-language pre-training (VLP). :-)
567
haofanwang/awesome-vision-language-modeling
Recent Advances in Vision-Language Pre-training!
272
yuewang-cuhk/awesome-vision-language-pretraining-papers
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
1.1k101
DirtyHarryLYL/Transformer-in-Vision
Recent Transformer-based CV and related works.
1.3k144
fawazsammani/awesome-vision-language-pretraining
Awesome Vision-Language Pretraining Papers
293
cshizhe/hgr_v2t
Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".
Language:Python20921