hotelll's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
isl-org/MiDaS
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
fundamentalvision/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
EvanNotFound/hexo-theme-redefine
Simplicity in Speed, Purity in Design. Redefine Your Hexo Journey.
XPoet/hexo-theme-keep
:rainbow: A simple and light theme for Hexo. It makes you more focused on writing.
nv-tlabs/lift-splat-shoot
Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)
ascust/3DMM-Fitting-Pytorch
A 3DMM fitting framework using Pytorch.
NVlabs/Dancing2Music
google-research/mint
Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.
segments-ai/panoptic-segment-anything
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
guicho271828/aaai-template
latex template for various conferences, as well as wise-man's overleaf (overleaf is terrible!)
wkcn/MobulaOP
A Simple & Flexible Cross Framework Operators Toolkit
google-research/soft-dtw-divergences
An implementation of soft-DTW divergences.
HanGuangXin/ByteTrack_ReID
ByteTrack with ReID module following the paradigm of FairMOT, tracking strategy is borrowed from FairMOT/JDE.
airockchip/RK3399Pro_npu
tderflinger/vue-audio-tapir
Audio recorder component for Vue.js 3. It enables to record, play and send audio messages to a server.
tobyperrett/few-shot-action-recognition
Implementations of some few-shot action recognition methods.
svip-lab/WeakSVR
(CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos"
GGQ1996/action_co_localization
BingSu12/RVSML
Learning Distance for Sequences by Learning a Ground Metric
BingSu12/TAP
Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification
weizheliu/VAVA
Code for the paper "Learning to Align Sequential Actions in the Wild"(CVPR 2022)
FuxiCV/music-to-dance
xuan301/BMMDet_MPDSet
hang1017/amap_demo
高德地图使用初级教程