mkygogo's Stars
googlesamples/arcore-depth-lab
ARCore Depth Lab is a set of Depth API samples that provides assets using depth for advanced geometry-aware features in AR interaction and rendering. (UIST 2020)
icosa-foundation/open-blocks
Open Blocks is the open source, community led evolution of Google Blocks!
Siccity/xNode
Unity Node Editor: Lets you view and edit node graphs inside Unity
cemuka/UnityRuntimeNodeEditor
Unity runtime node editor using with Unity UI.
angristan/openvpn-install
Set up your own OpenVPN server on Debian, Ubuntu, Fedora, CentOS, Arch Linux and more
BinNong/meet-libai
李白 :bust_in_silhouette: 作为唐代杰出诗人,其诗歌作品在**文学史上具有重要地位。近年来,随着数字技术和人工智能的快速发展,传统文化普及推广的形式也面临着创新与变革。国内外对于李白诗歌的研究虽已相当深入,但在数字化、智能化普及方面仍存在不足。因此,本项目旨在通过构建李白知识图谱,结合大模型训练出专业的AI智能体,以生成式对话应用的形式,推动李白文化的普及与推广。
roboflow/supervision
We write your reusable computer vision tools. 💜
UnblockNeteaseMusic/server
Revive unavailable songs for Netease Cloud Music (Refactored & Enhanced version)
BasedHardware/OpenGlass
Turn any glasses into AI-powered smart glasses
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
2noise/ChatTTS
A generative speech model for daily dialogue.
guoyww/AnimateDiff
Official implementation of AnimateDiff.
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
modelscope/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Fictionarry/ER-NeRF
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
lipku/LiveTalking
Real time interactive streaming digital human
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
livekit/agents
A powerful framework for building realtime voice AI agents 🤖🎙️📹
agno-agi/agno
A lightweight library for building Multimodal Agents. Give LLMs superpowers like memory, knowledge, tools and reasoning.
fishaudio/fish-speech
SOTA Open Source TTS
unslothai/unsloth
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥
yolain/ComfyUI-Easy-Use
In order to make it easier to use the ComfyUI, I have made some optimizations and integrations to some commonly used nodes.
ejoy/vaststars
A game demo for Ant engine
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
KindXiaoming/pykan
Kolmogorov Arnold Networks
aceway/weixin-dyh-ai
一个支持将微信订阅号接入AI的后台管理系统。