hotelll

Sing like there's nobody listening

Shanghai Jiao Tong UniversityShanghai, China

hotelll's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python71.5k 576 08.5k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47.7k 308 6715.6k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35.5k 346 2.8k4.1k
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Language:Python7.9k 183 2901.9k
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Language:Python6.5k 62 140482
isl-org/MiDaS
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
Language:Python4.5k 73 242626
shibing624/text2vec
text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。
Language:Python4.5k 31 150401
fundamentalvision/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Language:Python3.4k 72 269548
EvanNotFound/hexo-theme-redefine
Simplicity in Speed, Purity in Design. Redefine Your Hexo Journey.
Language:JavaScript1.5k 9 248124
XPoet/hexo-theme-keep
:rainbow: A simple and light theme for Hexo. It makes you more focused on writing.
Language:Stylus1.3k 55 240181
nv-tlabs/lift-splat-shoot
Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)
Language:Python1.1k 17 51221
ascust/3DMM-Fitting-Pytorch
A 3DMM fitting framework using Pytorch.
Language:Python603 13 3195
NVlabs/Dancing2Music
Language:Python532 42 2585
google-research/mint
Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.
Language:Python507 14 6187
segments-ai/panoptic-segment-anything
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
Language:Jupyter Notebook384 8 525
guicho271828/aaai-template
latex template for various conferences, as well as wise-man's overleaf (overleaf is terrible!)
Language:TeX167 4 438
wkcn/MobulaOP
A Simple & Flexible Cross Framework Operators Toolkit
Language:Python164 10 2821
google-research/soft-dtw-divergences
An implementation of soft-DTW divergences.
Language:Python131 9 217
HanGuangXin/ByteTrack_ReID
ByteTrack with ReID module following the paradigm of FairMOT, tracking strategy is borrowed from FairMOT/JDE.
Language:Python107 3 2915
airockchip/RK3399Pro_npu
Language:C++72 2 816
tderflinger/vue-audio-tapir
Audio recorder component for Vue.js 3. It enables to record, play and send audio messages to a server.
Language:Vue51 3 1117
tobyperrett/few-shot-action-recognition
Implementations of some few-shot action recognition methods.
Language:Python42 4 85
svip-lab/WeakSVR
(CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos"
Language:Python27 2 54
GGQ1996/action_co_localization
Language:Python21 2 03
BingSu12/RVSML
Learning Distance for Sequences by Learning a Ground Metric
Language:MATLAB10 0 07
BingSu12/TAP
Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification
Language:Python10 1 12
weizheliu/VAVA
Code for the paper "Learning to Align Sequential Actions in the Wild"(CVPR 2022)
Language:Python9 2 10
FuxiCV/music-to-dance
8 2 01
xuan301/BMMDet_MPDSet
Language:Python6 1 00
hang1017/amap_demo
高德地图使用初级教程
Language:TypeScript4 1 00

hotelll

hotelll's Stars

openai/whisper

facebookresearch/segment-anything

microsoft/DeepSpeed

nl8590687/ASRT_SpeechRecognition

gaomingqi/Track-Anything

isl-org/MiDaS

shibing624/text2vec

fundamentalvision/BEVFormer

EvanNotFound/hexo-theme-redefine

XPoet/hexo-theme-keep

nv-tlabs/lift-splat-shoot

ascust/3DMM-Fitting-Pytorch

NVlabs/Dancing2Music

google-research/mint

segments-ai/panoptic-segment-anything

guicho271828/aaai-template

wkcn/MobulaOP

google-research/soft-dtw-divergences

HanGuangXin/ByteTrack_ReID

airockchip/RK3399Pro_npu

tderflinger/vue-audio-tapir

tobyperrett/few-shot-action-recognition

svip-lab/WeakSVR

GGQ1996/action_co_localization

BingSu12/RVSML

BingSu12/TAP

weizheliu/VAVA

FuxiCV/music-to-dance

xuan301/BMMDet_MPDSet

hang1017/amap_demo