dsotomay

dsotomay's Stars

facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook48.5k 313 6825.7k
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Language:Python31k 390 3.5k7.6k
facebookresearch/fastText
Library for fast text representation and classification.
Language:HTML26k 846 1.1k4.7k
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python8.5k 99 93783
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.7k 45 83599
facebookresearch/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Language:Python5.5k 114 657939
facebookresearch/hiplot
HiPlot makes understanding high dimensional data easy
Language:TypeScript2.8k 28 89145
facebookresearch/consistent_depth
We estimate dense, flicker-free, geometrically consistent depth from monocular video, for example hand-held cell phone video.
Language:Python1.6k 55 69236
facebookresearch/TimeSformer
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
Language:Python1.6k 30 129218
facebookresearch/localrf
An algorithm for reconstructing the radiance field of a large-scale scene from a single casually captured video.
Language:Python977 20 4660
facebookresearch/hiera
Hiera: A fast, powerful, and simple hierarchical vision transformer.
Language:Python940 17 3747
facebookresearch/av_hubert
A self-supervised learning framework for audio-visual speech
Language:Python865 15 111138
facebookresearch/vizseq
An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)
Language:Python444 16 1658
facebookresearch/muavic
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
Language:Python374 13 2532
facebookresearch/LLM-QAT
Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"
Language:Python265 5 3025
facebookresearch/online-dt
Online Decision Transformer
Language:Python246 5 835
facebookresearch/PyTouch
PyTouch is a machine learning library for tactile touch sensing.
Language:Python241 14 1339
facebookresearch/vocoder-benchmark
A repository for benchmarking neural vocoders by their quality and speed.
Language:Python208 19 627
facebookresearch/CT2Hair
This is the official implementation of CT2Hair High-fidelity 3D Hair Modeling Using Computed Tomography.
Language:Python201 13 810
facebookresearch/dynamic_stereo
[CVPR 2023] DynamicStereo: Consistent Dynamic Depth from Stereo Videos.
Language:Jupyter Notebook193 8 179
facebookresearch/iopath
A python library that provides common I/O interface across different storage backends.
Language:Python138 13 1224
facebookresearch/impact-driven-exploration
impact-driven-exploration
Language:Python130 9 727
facebookresearch/adversarially-motivated-intrinsic-goals
This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".
Language:Python61 14 48
facebookresearch/controllable_agent
The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning or fine-tuning. Training is reward-free and based on the Forward-Backward representation.
Language:Python59 8 45
facebookresearch/SyncMatch
Self-supervised Correspondence Estimation via Multiview Registration
Language:Python57 4 03
facebookresearch/HierVL
[CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings
Language:Python45 3 133
facebookresearch/novel-view-acoustic-synthesis
Code for Novel View Acoustic Synthesis paper
Language:Python44 5 2
facebookresearch/AdaTT
pytorch open-source library for the paper "AdaTT Adaptive Task-to-Task Fusion Network for Multitask Learning in Recommendations"
Language:Python42 1 05
facebookresearch/HairMSNN
This is the official implementation of our EGSR 2023 paper, Accelerating Hair Rendering by Learning High-Order Scattered Radiance.
Language:Cuda42 8 24
facebookresearch/agenthive
AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.
Language:Python33 14 134