RetrainIt's Stars
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
lzcemma/LeMDA
Code Example for Learning Multimodal Data Augmentation in Feature Space
guanyingc/latex_paper_writing_tips
Tips for Writing a Research Paper using LaTeX
ienlie0513/KAHAN
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
MILVLG/prophet
Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".
UCSC-VLAA/CLIPA
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
xtekky/gpt4free
The official gpt4free repository | various collection of powerful language models
thunlp/OpenKE
An Open-Source Package for Knowledge Embedding (KE)
yrf1/InfoSurgeon
kx-Huang/ChatGPT-on-WeChat
🤖️ Deploy GPT-4o ChatGPT on your WeChat within 2 steps! 两步在云端部署你的微信ChatGPT聊天机器人!🤖️
LYuhang/Trans-Implementation
Implement of TransE, TransH, KG2E with pytorch
DeepGraphLearning/KnowledgeGraphEmbedding
meta-llama/llama
Inference code for Llama models
multimediaeval/2020-Flood-Related-Multimedia-Task
microsoft/GLIP
Grounded Language-Image Pre-training
TIBHannover/GeoEstimation
This repository contains all necessary meta information, results and source files to reproduce the results in the publication Eric Müller-Budack, Kader Pustu-Iren, Ralph Ewerth: "Geolocation Estimation of Photos using a Hierarchical Model and Scene Classification", In: European Conference on Computer Vision (ECCV), Munich, 2018.
guilk/KAT
Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"
hackerchenzhuo/LaKo
[Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection
PaulCCCCCCH/Multimodal-Categorization-of-Crisis-Events-in-Social-Media
An unofficial implementation of the CVPR 2020 paper Multimodal Categorization of Crisis Events in Social Media
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
ltian678/DUCK-code
Code for DUCK: Rumour Detection on Social Media by Modelling User and Comment Propagation Networks NAACL2022(https://aclanthology.org/2022.naacl-main.364/)
noise-learning/SelfMix
Westlake-AI/openmixup
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
zeke-xie/adaptive-inertia-adai
[ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum".
kkkkkkon/D-MCD
code for Denoised_Maximum_Classifier_Discrepancy_for_Source-Free_Unsupervised_Domain_Adaptation
BIT-DA/SCDA
[ICCV 2021] Code release for "Semantic Concentration for Domain adaptation"
amzn/amazon-weak-ner-needle
Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data
FerdinandZhong/punctuator
A small seq2seq punctuator tool based on DistilBERT
utsnlab/COfEE