dsotomay's Stars
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
facebookresearch/fastText
Library for fast text representation and classification.
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
facebookresearch/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
facebookresearch/hiplot
HiPlot makes understanding high dimensional data easy
facebookresearch/consistent_depth
We estimate dense, flicker-free, geometrically consistent depth from monocular video, for example hand-held cell phone video.
facebookresearch/TimeSformer
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
facebookresearch/localrf
An algorithm for reconstructing the radiance field of a large-scale scene from a single casually captured video.
facebookresearch/hiera
Hiera: A fast, powerful, and simple hierarchical vision transformer.
facebookresearch/av_hubert
A self-supervised learning framework for audio-visual speech
facebookresearch/vizseq
An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)
facebookresearch/muavic
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
facebookresearch/LLM-QAT
Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"
facebookresearch/online-dt
Online Decision Transformer
facebookresearch/PyTouch
PyTouch is a machine learning library for tactile touch sensing.
facebookresearch/vocoder-benchmark
A repository for benchmarking neural vocoders by their quality and speed.
facebookresearch/CT2Hair
This is the official implementation of CT2Hair High-fidelity 3D Hair Modeling Using Computed Tomography.
facebookresearch/dynamic_stereo
[CVPR 2023] DynamicStereo: Consistent Dynamic Depth from Stereo Videos.
facebookresearch/iopath
A python library that provides common I/O interface across different storage backends.
facebookresearch/impact-driven-exploration
impact-driven-exploration
facebookresearch/adversarially-motivated-intrinsic-goals
This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".
facebookresearch/controllable_agent
The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning or fine-tuning. Training is reward-free and based on the Forward-Backward representation.
facebookresearch/SyncMatch
Self-supervised Correspondence Estimation via Multiview Registration
facebookresearch/HierVL
[CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings
facebookresearch/novel-view-acoustic-synthesis
Code for Novel View Acoustic Synthesis paper
facebookresearch/AdaTT
pytorch open-source library for the paper "AdaTT Adaptive Task-to-Task Fusion Network for Multitask Learning in Recommendations"
facebookresearch/HairMSNN
This is the official implementation of our EGSR 2023 paper, Accelerating Hair Rendering by Learning High-Order Scattered Radiance.
facebookresearch/agenthive
AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.