yaochengrong's Stars
mybigday/whisper.rn
React Native binding of whisper.cpp.
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
m-schuetz/compute_rasterizer
Rendering Point Clouds with Compute Shaders
AlexKashi/AlphaHoldem
A Deep Reinforcment Learning Aproach to Texas Holdem
BoomingTech/Piccolo
Piccolo (formerly Pilot) – mini game engine for games104
jbush001/NyuziProcessor
GPGPU microprocessor architecture
jac99/FootAndBall
FootAndBall: Integrated player and ball detector
jfzhang95/pytorch-video-recognition
PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.
hsharma35/dnnweaver2
Open Source Specialized Computing Stack for Accelerating Deep Neural Networks.
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
ZipCPU/eth10g
10Gb Ethernet Switch
ShichenLiu/SoftRas
Project page of paper "Soft Rasterizer: A Differentiable Renderer for Image-based 3D Reasoning"
google/CFU-Playground
Want a faster ML processor? Do it yourself! -- A framework for playing with custom opcodes to accelerate TensorFlow Lite for Microcontrollers (TFLM). . . . . . Online tutorial: https://google.github.io/CFU-Playground/ For reference docs, see the link below.
GPUPeople/cuRE
SoccerNet/sn-gamestate
[CVPRW'24] SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
RunyiYang/SUNDAE
The implementation of SUNDAE: Spectrally Pruned Gaussian Fields with Neural Compensation
NVlabs/RADIO
Official repository for "AM-RADIO: Reduce All Domains Into One"
2471023025/RALM_Survey
This is a repository of RALM surveys containing a summary of state-of-the-art RAG and other technologies
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
CompVis/stable-diffusion
A latent text-to-image diffusion model
chuanyangjin/fast-DiT
Fast Diffusion Models with Transformers
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.