gokayfem

Turkey

Pinned Repositories

awesome
😎 Awesome lists about all kinds of interesting topics
00
awesome-vlm-architectures
Famous Vision Language Models and Their Architectures
Language:Markdown180 9 212
CogVLM2
第二代 CogVLM多模态预训练对话模型
Language:Python00
ComfyUI-Depth-Visualization
Depth map applied Image viewer inside ComfyUI
Language:JavaScript49 4 25
ComfyUI-Dream-Interpreter
Dream Interpreter inside ComfyUI
Language:JavaScript66 3 09
ComfyUI-Texture-Simple
Visualize your textures inside ComfyUI
Language:JavaScript30 2 14
ComfyUI_VLM_nodes
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
Language:Python298 7 8823
dspy-ollama-colab
dspy with ollama and llamacpp on google colab
Language:Jupyter Notebook1 1 00
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Python2 0 00
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python1 0 00

gokayfem's Repositories

gokayfem/ComfyUI_VLM_nodes
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
Language:Python298 7 8823
gokayfem/awesome-vlm-architectures
Famous Vision Language Models and Their Architectures
Language:Markdown180 9 212
gokayfem/ComfyUI-Dream-Interpreter
Dream Interpreter inside ComfyUI
Language:JavaScript66 3 09
gokayfem/ComfyUI-Depth-Visualization
Depth map applied Image viewer inside ComfyUI
Language:JavaScript49 4 25
gokayfem/ComfyUI-Texture-Simple
Visualize your textures inside ComfyUI
Language:JavaScript30 2 14
gokayfem/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Python2 0 00
gokayfem/dspy-ollama-colab
dspy with ollama and llamacpp on google colab
Language:Jupyter Notebook1 1 00
gokayfem/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python1 0 00
gokayfem/awesome
😎 Awesome lists about all kinds of interesting topics
00
gokayfem/CogVLM2
第二代 CogVLM多模态预训练对话模型
Language:Python00
gokayfem/DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Language:Python0 1 00
gokayfem/flash-attention-minimal
Flash Attention in ~100 lines of CUDA (forward pass only)
Language:Cuda0 0 00
gokayfem/lectures
Material for cuda-mode lectures
Language:Jupyter Notebook0 0 00
gokayfem/graph_websearch_agent
Websearch agent built on the LangGraph framework
gokayfem/img2txt-comfyui-nodes
Implements some of the most popular img2txt models on HF into ComfyUI nodes. Uses questions/conditional-prompts to get descriptions that are suited for being fed back into a txt2img node.
gokayfem/Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
Language:Python0 0
gokayfem/siglip
Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗
gokayfem/Vitron
A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

gokayfem

Pinned Repositories

awesome

awesome-vlm-architectures

CogVLM2

ComfyUI-Depth-Visualization

ComfyUI-Dream-Interpreter

ComfyUI-Texture-Simple

ComfyUI_VLM_nodes

dspy-ollama-colab

HunyuanDiT

Video-LLaMA

gokayfem's Repositories

gokayfem/ComfyUI_VLM_nodes

gokayfem/awesome-vlm-architectures

gokayfem/ComfyUI-Dream-Interpreter

gokayfem/ComfyUI-Depth-Visualization

gokayfem/ComfyUI-Texture-Simple

gokayfem/HunyuanDiT

gokayfem/dspy-ollama-colab

gokayfem/Video-LLaMA

gokayfem/awesome

gokayfem/CogVLM2

gokayfem/DeepSeek-VL

gokayfem/flash-attention-minimal

gokayfem/lectures

gokayfem/graph_websearch_agent

gokayfem/img2txt-comfyui-nodes

gokayfem/Reka-Torch

gokayfem/siglip

gokayfem/Vitron