deep-saket's Stars
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
YvanYin/Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
RaymondWang987/NVDS
The official repository of the ICCV2023 paper "Neural Video Depth Stabilizer" (NVDS).
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
niconielsen32/PointClouds
shikras/shikra
TransformerOptimus/SuperAGI
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Deci-AI/super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
arjish/meta-meta-classification
Code for "Meta-Meta Classification for One-Shot Learning"
wbw520/MTUNet
MTUNet: Few-shot Image Classification with Visual Explanations (CVPRW 2021)
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
sicara/easy-few-shot-learning
Ready-to-use code and tutorial notebooks to boost your way into few-shot learning for image classification.
towhee-io/towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
VQAssessment/DOVER
[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.
murtazahassan/OpenCV-Python-Tutorials-and-Projects
An easy to follow course of OpenCV using Python for beginners.