G7b9
As a passionate coding enthusiast, I, Erich Bouch, excel in Python. I eagerly participate in competitions and aspire to make an impact in the tech world.
Pinned Repositories
-
ACE_plus
albumentations
Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
alpaca-lora
微调 Lora,Instruct-tune LLaMA on consumer hardware
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
AnomalyGPT
[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
awesome-computer-vision
A curated list of awesome computer vision resources
LMChineseChess
这是一个中国象棋阿尔法猪
SOLOv2.tensorRT
SOLOv2 on onnx & tensorRT
G7b9's Repositories
G7b9/ACE_plus
G7b9/ChinaTextbook
所有小初高、大学PDF教材。
G7b9/Comfy-WaveSpeed
[WIP] The all in one inference optimization solution for ComfyUI, universal, flexible, and fast.
G7b9/ComfyUI-Crystools
A powerful set of tools for ComfyUI
G7b9/ComfyUI-HunyuanVideoWrapper
G7b9/ComfyUI-LatentSyncWrapper
This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audio input.
G7b9/ComfyUI-TeaCache
G7b9/ComfyUI-WanVideoStartEndFrames
Start and end frames video generation nodes based on the modified Kijai version Wan2.1 nodes
G7b9/ComfyUI_CosyVoice
ComfyUI for CosyVoice
G7b9/ComfyUI_InfiniteYou
An implementation for InfiniteYou
G7b9/Comfyui_Redux_Advanced
Redux style adds more controls
G7b9/ComfyUI_Sonic
Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI
G7b9/ComfyUI_YuE
YuE is a groundbreaking series of open-source foundation models designed for music generation, specifically for transforming lyrics into full songs (lyrics2song). you can use it in comfyUI
G7b9/deep-research
My own open source implementation of OpenAI's new Deep Research agent. Get the same capability without paying $200. You can even tweak the behavior of the agent with adjustable breadth and depth. Run it for 5 min or 5 hours, it'll auto adjust.
G7b9/deep-research-web-ui
(Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models.
G7b9/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
G7b9/ganloss-latent-space
有趣的80后程序员的工作流分享
G7b9/InfiniteYou
🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
G7b9/ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
G7b9/Magic-1-For-1
G7b9/MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
G7b9/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
G7b9/OpenDeepResearcher
G7b9/OpenManus
No fortress, purely open ground. OpenManus is Coming.
G7b9/Qwen2.5-Omni
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
G7b9/rgthree-comfy
Making ComfyUI more comfortable!
G7b9/sd-scripts
G7b9/SkyReels-A1
SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers
G7b9/UI-TARS-desktop
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
G7b9/yolov12
YOLOv12: Attention-Centric Real-Time Object Detectors