Idate96's Stars
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
getcursor/cursor
The AI Code Editor
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
NVIDIA/warp
A Python framework for high performance GPU simulation and graphics
Picsart-AI-Research/Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
ai-forever/Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
GoldenCheetah/GoldenCheetah
Performance Software for Cyclists, Runners, Triathletes and Coaches
salesforce/CodeTF
CodeTF: One-stop Transformer Library for State-of-the-art Code LLM
facebookresearch/mbrl-lib
Library for Model Based RL
NVlabs/FasterViT
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
nv-tlabs/NKSR
[CVPR 2023 Highlight] Neural Kernel Surface Reconstruction
tonyzhaozh/act
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
RobertTLange/gymnax
RL Environments in JAX 🌍
instadeepai/jumanji
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
salesforce/CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
facebookresearch/eai-vc
The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).
lucidrains/recurrent-memory-transformer-pytorch
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
mlcommons/algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvements in both training algorithms and models.
linuxserver/docker-rdesktop
lucidrains/memformer
Implementation of Memformer, a Memory-augmented Transformer, in Pytorch
vwxyzjn/cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL