minienglish1

minienglish1's Stars

google-research/google-research
Google Research
Language:Jupyter Notebook34.5k 755 1.3k8k
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python21.1k 210 3922.2k
qarmin/czkawka
Multi functional app to find duplicates, empty folders, similar images etc.
Language:Rust20.7k 126 859668
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python9k 63 1.5k575
Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Language:Python4.5k 64 144390
philz1337x/clarity-upscaler
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
Language:Python4k 31 47413
layerdiffusion/sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
Language:Python3.7k 37 90324
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Language:Jupyter Notebook3k 28 160278
KichangKim/DeepDanbooru
AI based multi-label girl image classification system, implemented by using TensorFlow.
Language:Python2.7k 38 95260
Bionus/imgbrd-grabber
Very customizable imageboard/booru downloader with powerful filenaming features.
Language:HTML2.6k 101 3.1k220
toriato/stable-diffusion-webui-wd14-tagger
Labeling extension for Automatic1111's Web UI
Language:Python1.3k 9 91236
ermig1979/AntiDupl
A program to search similar and defect pictures on the disk
Language:C#1.3k 40 17194
derrian-distro/LoRA_Easy_Training_Scripts
A UI made in Pyside6 to make training LoRA/LoCon and other LoRA type models in sd-scripts easy
Language:Python1.1k 13 226103
kevinhendricks/KindleUnpack
python based software to unpack Amazon / Kindlegen generated ebooks
Language:Python981 35 30106
jiayev/GPT4V-Image-Captioner
Language:Python797 13 5358
pythongosssss/ComfyUI-WD14-Tagger
A ComfyUI extension allowing for the interrogation of booru tags from images.
Language:Python684 8 7373
mosaicml/diffusion
Language:Python681 16 5971
picobyte/stable-diffusion-webui-wd14-tagger
Labeling extension for Automatic1111's Web UI
Language:Python610 3 8975
snap-research/Panda-70M
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Language:Python543 11 4720
SalesforceAIResearch/DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
Language:Python280 7 1525
mihirp1998/AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language:Python247 6 158
HybridShivam/Pokemon
The highest quality Pokemon Images and Assets.
Language:Python182 6 1053
johnoneil/MangaTextDetection
Experiments in text localization and detection in raw manga scans. Mostly using OpenCV python API.
Language:Python167 20 250
djghosh13/geneval
GenEval: An object-focused framework for evaluating text-to-image alignment
Language:HTML130 1 97
pedrovgs/DeepPanel
Finding a panel inside a comic page is the hardest thing I've ever done in computer science!
Language:Python122 6 311
thu-ml/low-bit-optimizers
Low-bit optimizers for PyTorch
Language:Python121 6 68
SmilingWolf/SW-CV-ModelZoo
Repo for my Tensorflow/Keras CV experiments. Mostly revolving around the Danbooru20xx dataset
Language:Python118 6 78
joshua-stone/DerPyBooru
Python bindings for Derpibooru's API
Language:Python36 13 1415
LexCybermac/smlr
A Simple Image Clustering Script using CLIP and Hierarchial Clustering
Language:Python34 1 13
KutsuyaYuki/WD14Tagger
Automatically tag images with booru tags
Language:Python18 1 23