sokazaki's Stars
public-apis/public-apis
A collective list of free APIs
30-seconds/30-seconds-of-code
Short code snippets for all your development needs
openai/openai-cookbook
Examples and guides for using the OpenAI API
TanStack/query
🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
invoke-ai/InvokeAI
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
stanfordnlp/stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
microsoft/torchscale
Foundation Architecture for (M)LLMs
MubertAI/Mubert-Text-to-Music
A simple notebook demonstrating prompt-based music generation via Mubert API
princeton-vl/DROID-SLAM
linkedin/greykite
A flexible, intuitive and fast forecasting library
vahidk/EffectivePyTorch
PyTorch tutorials and best practices.
google/vizier
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
autonomousvision/unimatch
[TPAMI'23] Unifying Flow, Stereo and Depth Estimation
NVIDIA-Merlin/NVTabular
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
microsoft/FocalNet
[NeurIPS 2022] Official code for "Focal Modulation Networks"
castorini/daam
Diffusion attentive attribution maps for interpreting Stable Diffusion.
hanoonaR/object-centric-ovd
[NeurIPS 2022] Official repository of paper titled "Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection".
open-mmlab/mmeval
A unified evaluation library for multiple machine learning libraries
ai-forever/Kandinsky-2.0
Kandinsky 2.0 — multilingual text2image latent diffusion model
deepmind/perception_test
kakaobrain/tcl
Official implementation of TCL (CVPR 2023)
Ben93kie/SeaDronesSee
Vision Benchmark for Maritime Search and Rescue