kar9999

kar9999's Stars

vsislab/Controllable_XGating
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
Language:Python6713
MarcusNerva/HMN
[CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.
Language:Python529
Shreyz-max/Video-Captioning
Video Captioning is an encoder decoder mode based on sequence to sequence learning
Language:Python12437
SpongebBob/Finetune-ChatGLM2-6B
ChatGLM2-6B 全参数微调，支持多轮对话的高效微调。
Language:Python39641
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook4.9k653
THUDM/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Language:Python4.1k422
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python10.6k825
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！
Language:Jupyter Notebook2.6k248
Moeinh77/Image-Captioning-with-Beam-Search
Generating image captions using Xception Network and Beam Search in Keras - My Bachelor's thesis project
Language:Jupyter Notebook214
open-mmlab/Multimodal-GPT
Multimodal-GPT
Language:Python1.5k125
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.8k287
lucidrains/flamingo-pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
Language:Python1.2k59
Yushi-Hu/PromptCap
natual language guided image captioning
Language:Python787
THUDM/P-tuning-v2
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
Language:Python2k202
vFones/situation-recognition
Situation recognition with Graph Neural Network
Language:Python61
zhihuifanqiechaodan/vue3-admin-template
About 🎉 A magical vue3 admin http://vue3.zhihuifanqiechaodan.com
Language:Vue16046
PanJiaChen/vue-element-admin
:tada: A magical vue admin https://panjiachen.github.io/vue-element-admin
Language:Vue88.2k30.5k
Binaryify/NeteaseCloudMusicApi
网易云音乐 Node.js API service
30.3k15.8k
algorithmzuo/algorithmbasic2020
算法和数据结构体系学习班
Language:Java1.3k1k
rock3125/enhanced-subject-verb-object-extraction
Enhanced Subject Word Object Extraction
Language:Python14850
computationalmedia/semstyle
Code for learning to generate stylized image captions from unaligned text
Language:Python6311
yishuihanhan/wordninja
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
Language:Python509
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
Language:Python1.8k203
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Language:Python892125
wubin5/STTN
paper : <Spatial-Temporal Transformer Networks for Traffic Flow Forecasting>
Language:Python16044
Scarecrow0/SGTR
Language:Python807
feymanpriv/DOLG
Pytorch Implementation of DOLG (ICCV 2021)
Language:Python5911
jd730/STRG
Pytorch Implementation of Videos as Space-Time Region Graphs
Language:Python265
cvlab-yonsei/MNAD
An official implementation of "Learning Memory-guided Normality for Anomaly Detection" (CVPR 2020) in PyTorch.
Language:Python34180
donggong1/memae-anomaly-detection
MemAE for anomaly detection. -- Gong, Dong, et al. "Memorizing Normality to Detect Anomaly: Memory-augmented Deep Autoencoder for Unsupervised Anomaly Detection". ICCV 2019.
Language:Python469104