agoryuno's Stars
atamsingh/comp2401
C Programming with Christine Landerau
dvmazur/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
agoryuno/adet_layers
Extracts the compiled portion of the DeepSolo model's code
agoryuno/dconfig
Lightweight version of Detectron2's config package, stripped of all superfluous requirements
leedrake5/Russia-Ukraine
Equipment Loss Tracking
facebookresearch/MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
agoryuno/DeepSolo
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Text Spotting"
agoryuno/deepsolo-onnx
An ONNX exporter fot the DeepSolo scene text recognition model
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
deepchecks/deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
MichalBusta/E2E-MLT
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
ankush-me/SynthText
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
ViTAE-Transformer/DeepSolo
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting"
ymy-k/DPText-DETR
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
D641593/MixNet
adaptech-cz/Tesseract4Android
Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
roboflow/supervision
We write your reusable computer vision tools. 💜
SqueezeAILab/SqueezeLLM
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
atong01/conditional-flow-matching
TorchCFM: a Conditional Flow Matching library
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
agoryuno/autobrowser
Firefox in a docker container with a control API
MishaLaskin/vqvae
A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)
agoryuno/gpt_monkey
A Flask service to allow API access to ChatGPT in a browser
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
berenslab/pubmed-landscape
The landscape of biomedical research
heatz123/naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.