kar9999's Stars
vsislab/Controllable_XGating
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
MarcusNerva/HMN
[CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.
Shreyz-max/Video-Captioning
Video Captioning is an encoder decoder mode based on sequence to sequence learning
SpongebBob/Finetune-ChatGLM2-6B
ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
THUDM/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!
Moeinh77/Image-Captioning-with-Beam-Search
Generating image captions using Xception Network and Beam Search in Keras - My Bachelor's thesis project
open-mmlab/Multimodal-GPT
Multimodal-GPT
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
lucidrains/flamingo-pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
Yushi-Hu/PromptCap
natual language guided image captioning
THUDM/P-tuning-v2
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
vFones/situation-recognition
Situation recognition with Graph Neural Network
zhihuifanqiechaodan/vue3-admin-template
About 🎉 A magical vue3 admin http://vue3.zhihuifanqiechaodan.com
PanJiaChen/vue-element-admin
:tada: A magical vue admin https://panjiachen.github.io/vue-element-admin
Binaryify/NeteaseCloudMusicApi
网易云音乐 Node.js API service
algorithmzuo/algorithmbasic2020
算法和数据结构体系学习班
rock3125/enhanced-subject-verb-object-extraction
Enhanced Subject Word Object Extraction
computationalmedia/semstyle
Code for learning to generate stylized image captions from unaligned text
yishuihanhan/wordninja
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
wubin5/STTN
paper : <Spatial-Temporal Transformer Networks for Traffic Flow Forecasting>
Scarecrow0/SGTR
feymanpriv/DOLG
Pytorch Implementation of DOLG (ICCV 2021)
jd730/STRG
Pytorch Implementation of Videos as Space-Time Region Graphs
cvlab-yonsei/MNAD
An official implementation of "Learning Memory-guided Normality for Anomaly Detection" (CVPR 2020) in PyTorch.
donggong1/memae-anomaly-detection
MemAE for anomaly detection. -- Gong, Dong, et al. "Memorizing Normality to Detect Anomaly: Memory-augmented Deep Autoencoder for Unsupervised Anomaly Detection". ICCV 2019.