agoryuno

agoryuno's Stars

atamsingh/comp2401
C Programming with Christine Landerau
Language:C1811
dvmazur/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
Language:Python2.3k225
agoryuno/adet_layers
Extracts the compiled portion of the DeepSolo model's code
Language:Cuda1
agoryuno/dconfig
Lightweight version of Detectron2's config package, stripped of all superfluous requirements
Language:Python1
leedrake5/Russia-Ukraine
Equipment Loss Tracking
Language:R62725
facebookresearch/MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
Language:Python1.2k50
agoryuno/DeepSolo
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Text Spotting"
Language:Python1
agoryuno/deepsolo-onnx
An ONNX exporter fot the DeepSolo scene text recognition model
Language:Python3
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook35.6k4.2k
deepchecks/deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
Language:Python3.6k250
MichalBusta/E2E-MLT
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
Language:C++29184
ankush-me/SynthText
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
Language:Python2k621
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Language:Python7.6k756
ViTAE-Transformer/DeepSolo
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting"
Language:Python24133
ymy-k/DPText-DETR
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
Language:Python17022
D641593/MixNet
Language:Python639
adaptech-cz/Tesseract4Android
Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
Language:C725114
roboflow/supervision
We write your reusable computer vision tools. 💜
Language:Python22.9k1.7k
SqueezeAILab/SqueezeLLM
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
Language:Python63342
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.6k1.3k
atong01/conditional-flow-matching
TorchCFM: a Conditional Flow Matching library
Language:Python1.1k85
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
Language:Python2.5k198
agoryuno/autobrowser
Firefox in a docker container with a control API
Language:JavaScript1
MishaLaskin/vqvae
A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)
Language:Jupyter Notebook61474
agoryuno/gpt_monkey
A Flask service to allow API access to ChatGPT in a browser
Language:JavaScript1
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook13k1.8k
berenslab/pubmed-landscape
The landscape of biomedical research
Language:Jupyter Notebook11310
heatz123/naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
Language:Python46767
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47k5.6k
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Language:C++69.7k7.6k