Pinned Repositories
speech-model-hub
A simple interface for serving speech models trained on Bangla datasets
envy-rs
Generates example files for your configurations
etl-pyspark-airflow
Example of an ETL pipeline with PySpark and Airflow
IReLU-Demo
A Simple Way to Initialize Recurrent Networks of ReLU
spell-magic
Transformer Based Seq2Seq Model for Bangla Spell Correction
texy
Texy: A conservative text processing library
tutorials
Random tutorials on random topics
mdmmn378's Repositories
mdmmn378/AppFlowy
Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.
mdmmn378/berty
Berty is a secure peer-to-peer messaging app that works with or without internet access, cellular data or trust in the network
mdmmn378/browser-use
Make websites accessible for AI agents
mdmmn378/celery-beat-rs
An alternative language agnostic implementation of Celery Beat
mdmmn378/datafusion
Apache DataFusion SQL Query Engine
mdmmn378/EfficientWord-Net
OneShot Learning-based hotword detection.
mdmmn378/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
mdmmn378/flux
Official inference repo for FLUX.1 models
mdmmn378/gitpod
The developer platform for on-demand cloud development environments to create software faster and more securely.
mdmmn378/hyperswitch
An open source payments switch written in Rust to make payments fast, reliable and affordable
mdmmn378/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
mdmmn378/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
mdmmn378/localsend
An open-source cross-platform alternative to AirDrop
mdmmn378/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
mdmmn378/metavoice-src
Foundational model for human-like, expressive TTS
mdmmn378/mitmproxy
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
mdmmn378/OpenVoice
Instant voice cloning by MIT and MyShell.
mdmmn378/penpot
Penpot: The open-source design tool for design and code collaboration
mdmmn378/phonemizer
Simple text to phones converter for multiple languages
mdmmn378/PhotoMaker
PhotoMaker [CVPR 2024]
mdmmn378/rudolfs
A high-performance, caching Git LFS server with an AWS S3 and local storage back-end.
mdmmn378/rust-sdks
LiveKit realtime and server SDKs for Rust
mdmmn378/SalesGPT
Context-aware AI Sales Agent to automate sales outreach.
mdmmn378/sherpa-onnx
Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
mdmmn378/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
mdmmn378/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
mdmmn378/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
mdmmn378/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
mdmmn378/uv
An extremely fast Python package and project manager, written in Rust.
mdmmn378/wekws
Production First and Production Ready End-to-End Keyword Spotting Toolkit