StevenNeal's Stars
cocos/cocos-engine
Cocos simplifies game creation and distribution with Cocos Creator, a free, open-source, cross-platform game engine. Empowering millions of developers to create high-performance, engaging 2D/3D games and instant web entertainment.
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
jingyaogong/minimind
「大模型」3小时完全从0训练一个仅有26M的小参数GPT,个人显卡即可推理训练!
NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
wa-lang/wa
The Wa Programming Language
fufankeji/MateGen
Next-Generation Interactive Intelligent Programming Assistant
NexaAI/Awesome-LLMs-on-device
Awesome LLMs on Device: A Comprehensive Survey
megvii-research/megactor
520CCC/AIGenerateCode
Android-谷歌上架-马甲包-垃圾代码-混淆
om-ai-lab/OmAgent
A multimodal agent framework for solving complex tasks [EMNLP'2024]
Vchitect/Vchitect-2.0
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
guanchuwang/redis-bench
juggleim/im-server
A high-performance IM server.
MingXiangL/DEVIL
Evaluating dynamics capability of T2V generation models with DEVIL protocols.
geek-fun/dockit
Elasticsearch GUI client for Mac, windows and linux, Opensearch GUI client for Mac, windows and linux
bio-mlhui/LGRNet
Offical Code of MICCAI'24 early accepted paper "LGRNet: Local-Global Reciprocal Network for Uterine Fibroid Segmentation in Ultrasound Videos"
conallwang/MeGA
The official implementation of "MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing".
ozhang3/asrt
An open source task scheduling library ASRT (Async Runtime) written in modern C++ tailored for embedded linux systems.
DefaultRui/BEV-Scene-Graph
[ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation
mengjiac2023/pfedrec_secretflow
CaptainDra/MaydayDIYPi
一个五月天相关的树莓派实验品
zhangbaijin/From-Redundancy-to-Relevance
Code for paper:An Information Flow Perspective for Exploring Large Vision Language Models on Reasoning Tasks
tanbryan/ai-mv-generator
A multi-agent system designed for generating music videos with scrolling subtitles based on lyrics. This project analyzes the lyrics and creates detailed prompts based on the analysis results to generate story-like images, ultimately producing an image-to-image music video, using openai API, gpt-4o and dall-e 3 models.
QianFox/FoxCMS
LeeJarvis996/edsr_project
Fushen-Zhang/ODKL
This resp presents a probabilistic and online forecasting model. In detail, a deep kernel is proposed by integrating the deep soft Spiking Neural Networks into the Gaussian kernel, which is then applied to perform sparse Gaussian Process regression.
whanxueyu/cyberpunk-ui
准备做一个赛博朋克风格组件库
Sma1lboy/WeChatJobInfoBot