zer0int

I 💙 CLIP ~ 🤓🫶🤖

Pinned Repositories

CLIP-Direct-Ascent-Synthesis
Like a CLIP + VQGAN. Except without a VQGAN.
Language:Python6 1 01
CLIP-fine-tune
Fine-tuning code for CLIP models
Language:Python259 4 1617
CLIP-fine-tune-registers-gated
Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!
Language:Python47 1 01
CLIP-Interrogator-LongCLIP-hallucinwords
CLIP Interrogator, fully in HuggingFace Transformers 🤗, with LongCLIP & CLIP's own words and / or *your* own words!
Language:Python17 2 01
CLIP-ViT-visualization
What do CLIP Vision Transformers learn? Feature Visualization can show you!
Language:Python14 1 11
CLIP-XAI-GUI
CLIP GUI - XAI app ~ explainable (and guessable) AI with ViT & ResNet models
Language:Python21 1 22
ComfyUI-HunyuanVideo-Nyan
Text Encoders finally matter 🤖🎥 - scale CLIP & LLM influence! + a Nerdy Transformer Shuffle node
Language:Python68 4 153
ComfyUI-Long-CLIP
ComfyUI implementation of Long-CLIP, including Flux node: LongCLIPTextEncodeFlux
Language:Python57 1 01
ComfyUI-Nuke-a-Text-Encoder
For SDXL, SD1.5, Flux. Nuke T5 and let CLIP guide Flux.1 on its own! Or let let random guide Flux.1! Or load a CLIP crazy opinion embedding about your image and let that guide the AI!
Language:Python24 2 21
Long-CLIP
Scripts for use with LongCLIP, including fine-tuning Long-CLIP
Language:Python62 2 02

zer0int's Repositories

zer0int/CLIP-fine-tune
Fine-tuning code for CLIP models
Language:Python259 4 1617
zer0int/ComfyUI-HunyuanVideo-Nyan
Text Encoders finally matter 🤖🎥 - scale CLIP & LLM influence! + a Nerdy Transformer Shuffle node
Language:Python68 4 153
zer0int/Long-CLIP
Scripts for use with LongCLIP, including fine-tuning Long-CLIP
Language:Python62 2 02
zer0int/ComfyUI-Long-CLIP
ComfyUI implementation of Long-CLIP, including Flux node: LongCLIPTextEncodeFlux
Language:Python57 1 01
zer0int/CLIP-fine-tune-registers-gated
Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!
Language:Python47 1 01
zer0int/ComfyUI-Nuke-a-Text-Encoder
For SDXL, SD1.5, Flux. Nuke T5 and let CLIP guide Flux.1 on its own! Or let let random guide Flux.1! Or load a CLIP crazy opinion embedding about your image and let that guide the AI!
Language:Python24 2 21
zer0int/CLIP-Interrogator-LongCLIP-hallucinwords
CLIP Interrogator, fully in HuggingFace Transformers 🤗, with LongCLIP & CLIP's own words and / or *your* own words!
Language:Python17 2 01
zer0int/CLIP-SAE-finetune
Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.
Language:Python17 2 05
zer0int/ComfyUI-LLama3-Layer-Shuffle-Prompting
Shuffle LLama-3.2 layers and have it prompt an image. Node works with any model - Flux, SD3, SDXL...
Language:Python10 2 02
zer0int/ComfyUI-CLIP-Flux-Layer-Shuffle
Comfy Nodes (and a CLI script) for shuffling around layers in transformer models, creating a curious confusion.
Language:Python9 1 00
zer0int/CLIP-Direct-Ascent-Synthesis
Like a CLIP + VQGAN. Except without a VQGAN.
Language:Python6 1 01
zer0int/CLIP-gradient-ascent-embeddings
Use CLIP to create matching texts + embeddings for given images; useful for XAI, adversarial training
Language:Python6 1 0
zer0int/bitsandbytes-windows
Windows bitsandbytes 0.46.0 for Python 3.10, CUDA 12.8
5 1 0
zer0int/ComfyUI-GPT2-Layer-Shuffle-Prompting
Shuffle GPT-2's layers and have it prompt an image. Node works with any model - Flux, SD3, SDXL...
Language:Python5 1 0
zer0int/CLIP-Layer-Deck-Shuffle
Experimental removal / shuffling of layers in CLIP ViT + Text Transformer
Language:Python4 2 01
zer0int/ComfyUI-workflows
Workflows to implement fine-tuned CLIP Text Encoders with ComfyUI / SD, SDXL, SD3
4 2 0
zer0int/CLIP-tokenizer
A simple CLIP tokenizer encode / decode script. For when you need to know CLIP's tokenization and / or token IDs for some reason.
Language:Python3 1 0
zer0int/CLIPInversion
What do we learn from inverting CLIP models? And what does a CLIP 'see' in an image?
Language:Python3 1 00
zer0int/CLIP-attention-entropy
A small script for CLIP attn entropy plots
Language:Python2 1 0
zer0int/CLIP-test-time-registers
Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"
Language:Jupyter Notebook2
zer0int/GPT-OSS-20B-Windows-16GB-RTX4090
No Hopper architecture (RTX 5090, etc.) required! <16 GB VRAM, Windows.
Language:Python2 0 0
zer0int/Inf-CLIP
Geometric Parametrization GmP-Inf-CLIP modification of: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training scheme.
Language:Python2 0 0
zer0int/OpenVision
OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning. + A closer look in PyTorch!
Language:Python2
zer0int/CLIP-HeadHunter
Head-Hunter: A Visual Bias Explorer. Attention Head Max Visualization to find, rank, and visualize heads; map bias; see what a CLIP 'sees'.
Language:Python1
zer0int/CLIP-ResNet-classic-DeepDream
Classic original Inception style DeepDream, but with CLIP ResNet. And CLIP ViT for comparison.
Language:Python1
zer0int/CLIP-vs-human-cosine-similarity-game
How much do YOU align to an AI / CLIP?
Language:Python1
zer0int/LLMorse
Talk Morse code to multimodal LLM using your voice. Beep-boop!
Language:Python1
zer0int/plop-for-CLIP
PLoP applied to CLIP
Language:Python1
zer0int/ComfyUI-HunyuanVideoWrapper
Language:Python0 0
zer0int/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python