Pinned Repositories
CRAFT
Re-implementation of CRAFT's data pre-processing for text detection
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
dotieuthien
Event-Info-Extraction-From-Flyers
Extract key information (i.e., time and location) of an event from a poster/flyer using Digital image processing technique.
genetic-algorithm-for-assignment
Project for Algorithm course
graph-matching-using-quadratic-programming
[ICPR2020] Solving graph matching using principle eigenvector
linear-regression
Final grade prediction (Kaggle challenge : https://www.kaggle.com/uciml/student-alcohol-consumption)
llama.cpp
Port of Facebook's LLaMA model in C/C++
paddleocr.cpp
PaddleOCR + OnnxRuntime
perspective-transformation
Perspective Transformation for Indoor Image Aesthetic Enhancement
dotieuthien's Repositories
dotieuthien/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
dotieuthien/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
dotieuthien/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
dotieuthien/AnyText
dotieuthien/ComfyUI
The most powerful and modular stable diffusion GUI with a graph/nodes interface.
dotieuthien/ComfyUI-GGUF
GGUF Quantization support for native ComfyUI models
dotieuthien/ComfyUI-IDM-VTON
ComfyUI adaptation of IDM-VTON for virtual try-on.
dotieuthien/ComfyUI-OOTDiffusion
ComfyUI custom node that simply integrates the OOTDiffusion.
dotieuthien/consistencydecoder
Consistency Distilled Diff VAE
dotieuthien/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
dotieuthien/Fooocus
Focus on prompting and generating
dotieuthien/headshots-starter
dotieuthien/InstantStyle
dotieuthien/krita-ai-diffusion
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
dotieuthien/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
dotieuthien/llm.c
LLM training in simple, raw C/CUDA
dotieuthien/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
dotieuthien/minRF
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
dotieuthien/nebuly
The user analytics platform for LLMs
dotieuthien/Open-Sora
Building your own video generation model like OpenAI's Sora
dotieuthien/Open-Sora-Plan
This project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
dotieuthien/OpenCOLE
dotieuthien/parler-tts
Inference and training library for high-quality TTS models.
dotieuthien/sd-scripts
dotieuthien/speech-to-speech
dotieuthien/stable-diffusion-webui
Stable Diffusion web UI
dotieuthien/stable-diffusion-webui-forge
dotieuthien/stable-diffusion.cpp
Stable Diffusion in pure C/C++
dotieuthien/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
dotieuthien/TinyChatEngine
TinyChatEngine: On-Device LLM Inference Library