Pinned Repositories
AssignmentProblem_with_Hungarian_and_AntColonyAlgorithm
Using Hungarian Algorithm and Ant Colony Algorithm to deal with balanced (m==n) and unbalanced (m<n) assignment problems.
DSRG
Code for A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation
easy-rl
强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
FireDetection_EfficientDet
use Efficientdet to detect objects in images within own customed dataset.
myFace
包括百度人脸搜索+检测,opencv人脸采集、训练和检测
PASTS
Codes for PASTS: Progress-aware Spatio-Temporal Transformer Speaker For Vision-and-Language Navigation
SEAT
VLN-GOAT
Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)
VLN-MAGIC
This is the official repository for MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation Learning towards Efficient Vision-and-Language Navigation
VoiceRecognition
Use BD API to recognize people's voice
CrystalSixone's Repositories
CrystalSixone/FireDetection_EfficientDet
use Efficientdet to detect objects in images within own customed dataset.
CrystalSixone/AssignmentProblem_with_Hungarian_and_AntColonyAlgorithm
Using Hungarian Algorithm and Ant Colony Algorithm to deal with balanced (m==n) and unbalanced (m<n) assignment problems.
CrystalSixone/myFace
包括百度人脸搜索+检测,opencv人脸采集、训练和检测
CrystalSixone/awesome-vln
A curated list of research papers in Vision-Language Navigation (VLN)
CrystalSixone/CVPR2021-Papers-with-Code
CVPR 2021 论文和开源项目合集
CrystalSixone/Deformable-DETR
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
CrystalSixone/DEKR
This is an official implementation of our CVPR 2021 paper "Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression" (https://arxiv.org/abs/2104.02300)
CrystalSixone/examples-1
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
CrystalSixone/Fine-Grained-R2R
Code and data of the Fine-Grained R2R Dataset proposed in paper Sub-Instruction Aware Vision-and-Language Navigation
CrystalSixone/HRNet-Image-Classification
Train the HRNet model on ImageNet
CrystalSixone/mmcv
OpenMMLab Computer Vision Foundation
CrystalSixone/mmdetection
OpenMMLab Detection Toolbox and Benchmark
CrystalSixone/mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
CrystalSixone/nndl.github.io
《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning
CrystalSixone/object_relation_transformer
Implementation of the Object Relation Transformer for Image Captioning
CrystalSixone/OpenAI-CLIP
Simple implementation of OpenAI CLIP model in PyTorch.
CrystalSixone/R2R-EnvDrop
PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"
CrystalSixone/Recurrent-VLN-BERT
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
CrystalSixone/Res2Net-PretrainedModels
(ImageNet pretrained models) The official pytorch implemention of the TPAMI paper "Res2Net: A New Multi-scale Backbone Architecture"
CrystalSixone/RL-based-Graph2Seq-for-NQG
Code & data accompanying the ICLR 2020 paper "Reinforcement Learning Based Graph-to-Sequence Model for Natural Question Generation"
CrystalSixone/seetaFace6Python
简单、快速搞定人脸识别应用,觉得有帮助,给个start吧!
CrystalSixone/self-critical.pytorch
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
CrystalSixone/speaker_follower
Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.
CrystalSixone/tornado
Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.
CrystalSixone/Transformers-RL
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
CrystalSixone/UDP-Pose
Official code of The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation
CrystalSixone/UniVL
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
CrystalSixone/vln-bert
Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)
CrystalSixone/VLN-CE
Vision-and-Language Navigation in Continuous Environments using Habitat
CrystalSixone/YOLOF