Pinned Repositories
2048-android
The android port of the 2048 game (for offline playing)
actor-platform
Actor Messaging platform
AI-Auto-Video-Generator
An AI-powered storytelling video generator that takes user input as a story prompt, generates a story using OpenAI's GPT-3, creates images using OpenAI's DALL-E, adds voiceover using ElevenLabs API, and combines the elements into a video.
AICoverGen
A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.
AllTrans
Completely Translate Android apps in-place aka Google translate in Chrome
amoc
amoc is a simple tool for running massively parallel XMPP tests
Duolingo-Clone
Duolingo Android App Clone
munzuruleee
munzuruleee's Repositories
munzuruleee/AI-Auto-Video-Generator
An AI-powered storytelling video generator that takes user input as a story prompt, generates a story using OpenAI's GPT-3, creates images using OpenAI's DALL-E, adds voiceover using ElevenLabs API, and combines the elements into a video.
munzuruleee/AICoverGen
A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.
munzuruleee/android_audio_mixer
A simple android library for processing audio and mixing multiple audios parallelly or sequentially, made with android native media APIs (MediaExtractor, MediaCodec, MediaFormat, MediaMuxer) and JAVA.
munzuruleee/Audio-to-Voice-Dataset
Create an LJSpeech structured voice dataset on wave input
munzuruleee/BanglaASR
Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset
munzuruleee/banglanmt
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
munzuruleee/bark
🔊 Text-Prompted Generative Audio Model
munzuruleee/book
GPT4 & LangChain Chatbot for large PDF docs
munzuruleee/comprehensive-bangla-tts
Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural voice cloning systems for Bangla for the first time, supporting different SOTA models for Bangla and also Multilingual (Arabic+Bengali) code mixed TTS pipeline.
munzuruleee/Contextual-Spell-Checker-For-Bangla
Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance
munzuruleee/crontab
munzuruleee/face_up
New faceswap approach for image and video domains
munzuruleee/Gramformer
A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
munzuruleee/IndicTrans2
Translation models for 22 scheduled languages of India
munzuruleee/IndicWav2Vec
Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2
munzuruleee/meshenger-android
P2P Audio/Video calls over local networks. No server or Internet access needed.
munzuruleee/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
munzuruleee/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
munzuruleee/shottower
An open source, self-hosted implementation of the Shotstack API backend
munzuruleee/Show-1
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
munzuruleee/speech-assistant-openai-realtime-api-node
munzuruleee/StableCascade
munzuruleee/TalkingHead
Talking Head (3D): A JavaScript class for real-time lip-sync using Ready Player Me full-body 3D avatars.
munzuruleee/translate
Effortless Real-Time Sign Language Translation
munzuruleee/vid
This is a TikTok UI Clone that replicates the TikTok feed and its elements, including smooth video scrolling. Explore an interactive interface similar to Tiktok and enjoy a seamless browsing experience.
munzuruleee/video-editor-tutorial
How to build a video editor in JavaScript
munzuruleee/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
munzuruleee/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
munzuruleee/wav2lip-hq
Extension of Wav2Lip repository for processing high-quality videos.
munzuruleee/whisper.cpp
Port of OpenAI's Whisper model in C/C++