Pinned Repositories
Boosted-Multi-View
Multi-View (Multi-Modal) Learning based on Boosting thinking (like AdaBoost)
CET6
六级真题总结
Classic-CNN
经典CNN模型复现
CS231n-2023-Assignments
Stanford University CS231n Spring 2023 - Assignment Solutions
e-wardrobe
数据库设计大作业
GrabCut
C++ implementation for 《"GrabCut" — Interactive Foreground Extraction using Iterated Graph Cuts》
Image-Matting
Three DIP Methods for Alpha Matting
mindspore-GAN
GAN based on MindSpore
Sentiment-Analysis
Sentiment Analysis based on DTC & LSTM
ViT-for-Cifar100
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
xxayt's Repositories
xxayt/GrabCut
C++ implementation for 《"GrabCut" — Interactive Foreground Extraction using Iterated Graph Cuts》
xxayt/CS231n-2023-Assignments
Stanford University CS231n Spring 2023 - Assignment Solutions
xxayt/CET6
六级真题总结
xxayt/e-wardrobe
数据库设计大作业
xxayt/mindspore-GAN
GAN based on MindSpore
xxayt/ViT-for-Cifar100
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
xxayt/Boosted-Multi-View
Multi-View (Multi-Modal) Learning based on Boosting thinking (like AdaBoost)
xxayt/MACA
Discussion about the Influence of Multi-Head Attention In Cross-Attention
xxayt/2023BrithdayShow-for-yt
xxayt/Classic-Transformer
xxayt/CLIP4Clip-annotated
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
xxayt/DynamicMLP
Official Codes and Pretrained Models for Dynamic MLP, CVPR2022, https://arxiv.org/abs/2203.03253
xxayt/LXMERT
PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".
xxayt/MetaFormer
A PyTorch implementation of "MetaFormer: A Unified Meta Framework for Fine-Grained Recognition". A reference PyTorch implementation of “CoAtNet: Marrying Convolution and Attention for All Data Sizes”
xxayt/scnni
Simple Convolution Neural Network Inference Framework
xxayt/ViLBERT
xxayt/ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
xxayt/VMMR
Video to Music Moment Retrieval
xxayt/ConvNeXt
Code release for ConvNeXt model
xxayt/drawio
xxayt/MCAN
Deep Modular Co-Attention Networks for Visual Question Answering(VQA)
xxayt/NLP-Interview-Notes
该仓库主要记录 NLP 算法工程师相关的面试题
xxayt/nothing
xxayt/PromptSwitch
xxayt/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
xxayt/Swin-Transformer-Object-Detection
This is an official implementation for Swin Transformer on Object Detection and Instance Segmentation. Besides, xzj add ConvNeXt model
xxayt/TeachCLIP
Source code of our CVPR2024 paper TeachCLIP for Text-to-Video Retrieval
xxayt/UT-CMVMR
xxayt/ViLBERT-Multi-Task
Multi Task Vision and Language
xxayt/xxayt.github.io