mkygogo

mkygogo's Stars

googlesamples/arcore-depth-lab
ARCore Depth Lab is a set of Depth API samples that provides assets using depth for advanced geometry-aware features in AR interaction and rendering. (UIST 2020)
Language:C#817154
icosa-foundation/open-blocks
Open Blocks is the open source, community led evolution of Google Blocks!
Language:C#886
Siccity/xNode
Unity Node Editor: Lets you view and edit node graphs inside Unity
Language:C#3.5k609
cemuka/UnityRuntimeNodeEditor
Unity runtime node editor using with Unity UI.
Language:C#43264
angristan/openvpn-install
Set up your own OpenVPN server on Debian, Ubuntu, Fedora, CentOS, Arch Linux and more
Language:Shell14.4k3.1k
BinNong/meet-libai
李白 :bust_in_silhouette: 作为唐代杰出诗人，其诗歌作品在**文学史上具有重要地位。近年来，随着数字技术和人工智能的快速发展，传统文化普及推广的形式也面临着创新与变革。国内外对于李白诗歌的研究虽已相当深入，但在数字化、智能化普及方面仍存在不足。因此，本项目旨在通过构建李白知识图谱，结合大模型训练出专业的AI智能体，以生成式对话应用的形式，推动李白文化的普及与推广。
Language:Python1.6k202
roboflow/supervision
We write your reusable computer vision tools. 💜
Language:Python26.4k2k
UnblockNeteaseMusic/server
Revive unavailable songs for Netease Cloud Music (Refactored & Enhanced version)
Language:JavaScript6.8k658
BasedHardware/OpenGlass
Turn any glasses into AI-powered smart glasses
Language:C3.6k461
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Language:Python10.6k1.1k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python35.6k3.9k
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python11.3k917
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
Language:Python135k10.3k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook14.8k1.2k
modelscope/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Language:Python4.4k497
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python9.3k944
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Language:Python10.1k862
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.4k497
Fictionarry/ER-NeRF
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
Language:Python1.2k141
lipku/LiveTalking
Real time interactive streaming digital human
Language:Python5.1k760
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python7.4k570
livekit/agents
A powerful framework for building realtime voice AI agents 🤖🎙️📹
Language:Python5.4k733
agno-agi/agno
A lightweight library for building Multimodal Agents. Give LLMs superpowers like memory, knowledge, tools and reasoning.
Language:Python23.8k3k
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python20.4k1.6k
unslothai/unsloth
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Language:Python36.5k2.8k
yolain/ComfyUI-Easy-Use
In order to make it easier to use the ComfyUI, I have made some optimizations and integrations to some commonly used nodes.
Language:Python1.5k105
ejoy/vaststars
A game demo for Ant engine
Language:Lua57864
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python73.3k8k
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook15.6k1.5k
aceway/weixin-dyh-ai
一个支持将微信订阅号接入AI的后台管理系统。
Language:Python1508