cahya-wirawan
System engineer, currently working on NLP, CV and Speech Recognition for fun and curiosity
Vienna, Austria
cahya-wirawan's Stars
Vaibhavs10/optimise-my-whisper
OpenMOSE/RWKV-LM-State-4bit-Orpo
State tuning with Orpo of RWKV v6 can be performed with 4-bit quantization. Every model can be trained with Orpo on Single 24GB GPU!
JL-er/RWKV-PEFT
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
harrisonvanderbyl/rwkv-v5-state-tune
Profluent-AI/OpenCRISPR
AI-generated gene editing systems
theodorblackbird/lina-speech
lina-speech : linear attention based text-to-speech
joaomdmoura/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
MatthiasLienhard/flowkey_dl
helper to create sheet music from flowkey songs
Audiveris/audiveris
Latest generation of Audiveris OMR engine
SHI-Labs/Smooth-Diffusion
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024
huggingface/parler-tts
Inference and training library for high-quality TTS models.
jacoblee93/fully-local-pdf-chatbot
Yes, it's another chat over documents implementation... but this one is entirely local!
explodinggradients/ragas
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
AudiogenAI/agc
Audiogen Codec
deepseek-ai/DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
neobundy/Deep-Dive-Into-AI-With-MLX-PyTorch
"Deep Dive into AI with MLX and PyTorch" is an educational initiative designed to help anyone interested in AI, specifically in machine learning and deep learning, using Apple's MLX and Meta's PyTorch frameworks.
sh-lee-prml/HierSpeechpp
The official implementation of HierSpeech++
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
yerfor/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
argmaxinc/WhisperKit
Swift native on-device speech recognition with Whisper for Apple Silicon
collabora/WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
OpenAccess-AI-Collective/axolotl
Go ahead and axolotl questions
radames/LLM-automator
Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.
SciPhi-AI/synthesizer
A multi-purpose LLM framework for RAG and data creation.
dubverse-ai/MahaTTS