Idate96

Idate96's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python70.9k8.4k
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.3k626
ikostrikov/walk_in_the_park
Language:Python24433
NVlabs/MinVIS
Language:Python26523
facebookresearch/msn
Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)
Language:Python44933
carbon-language/carbon-lang
Carbon Language's main repository: documents, design, implementation, and related tools. (NOTE: Carbon Language is experimental; see README)
Language:C++32.3k1.5k
wjf5203/VNext
Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral))
Language:Python60253
hkchengrex/MiVOS
[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. Semi-supervised VOS as well!
Language:Python46964
hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Language:Python1.8k193
MasterBin-IIAU/Unicorn
[ECCV'22 Oral] Towards Grand Unification of Object Tracking
Language:Python95087
lucidrains/mixture-of-experts
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
Language:Python63849
libigl/libigl
Simple MPL-2.0-licensed C++ geometry processing library.
Language:C++4.6k1.1k
chaytonmin/Occupancy-MAE
Official implementation of our TIV'23 paper: Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders
Language:Python25118
CodedotAl/gpt-code-clippy
Full description can be found here: https://discuss.huggingface.co/t/pretrain-gpt-neo-for-open-source-github-copilot-model/7678?u=ncoop57
Language:Python3.3k224
clementchadebec/benchmark_VAE
Unifying Variational Autoencoder (VAE) implementations in Pytorch (NeurIPS 2022)
Language:Python1.8k163
facebookresearch/omnivore
Omnivore: A Single Model for Many Visual Modalities
Language:Python55939
leggedrobotics/open3d_slam
Pointcloud-based graph SLAM written in C++ using open3D library.
Language:C++51251
facebookresearch/detr
End-to-End Object Detection with Transformers
Language:Python13.6k2.4k
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python8.9k557
mit-han-lab/bevfusion
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Language:Python2.3k420
facebookresearch/pytorch3d
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
Language:Python8.8k1.3k
traveller59/spconv
Spatial Sparse Convolution Library
Language:Python1.9k365
sshaoshuai/PV-RCNN
PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection, CVPR 2020.
17814
ADLab-AutoDrive/BEVFusion
Offical PyTorch implementation of "BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework"
Language:Python751102
patrick-kidger/equinox
Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/
Language:Python2.1k142
lucidrains/PaLM-jax
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
Language:Python18512
lucidrains/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Language:Python8.1k767
facebookresearch/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Language:Python6.6k1.2k
implus/UM-MAE
Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
Language:Jupyter Notebook23821
google/ml_collections
ML Collections is a library of Python Collections designed for ML use cases.
Language:Python89340