saiful9379
An Enthusiastic AI | Deep Learning | ML Researcher
https://hishab.coGulshan-2 House 4A, Rd 96, Dhaka 1212
Pinned Repositories
Bangla_LLAMA
This model is finetune bangla LLAMA(LLM) model. We train this model using LORA approach
Bangla_Translation
Transformer Based Bangla Machine Translation.Here we have used 195775 number of sentence pair Bangla to English sentence.
Bangla_TTS
Bangla TTS Inference pipeline using Vit TTS
BanglaASR
Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset
BanglaNER
Bangla Name Entity Recognition (NER) is extract human name from input bangla string or text
BN-LDA-YOLO8
Bangla Layout Document Analysis Model[Yolov8]
google_street_view_data
Google street view data crawling
pdf_parser
This report use for pdf parser purposes. it can extract text, image, cruve as well as table information.
pybangla
Text Normalizer module use for Bangla as well as English digit convert to textual format, Normalize Date and Extract Date
resume_ai
Introducing Smart Resume AI: Revolutionizing Resume Sorting and Job Matching
saiful9379's Repositories
saiful9379/pybangla
Text Normalizer module use for Bangla as well as English digit convert to textual format, Normalize Date and Extract Date
saiful9379/BanglaASR
Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset
saiful9379/Bangla_TTS
Bangla TTS Inference pipeline using Vit TTS
saiful9379/resume_ai
Introducing Smart Resume AI: Revolutionizing Resume Sorting and Job Matching
saiful9379/Bangla_LLAMA
This model is finetune bangla LLAMA(LLM) model. We train this model using LORA approach
saiful9379/MRZ_DETECTION
MRZ Detection Using Yolov7
saiful9379/BanglaNER
Bangla Name Entity Recognition (NER) is extract human name from input bangla string or text
saiful9379/AudioRecoder
Recording Audio file using pyaudio module
saiful9379/langchain
🦜🔗 Build context-aware reasoning applications
saiful9379/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
saiful9379/NeMo
NeMo: a toolkit for conversational AI
saiful9379/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
saiful9379/saiful9379
About Me
saiful9379/xtts-finetune-tests
In this repository I will be running various experiments on finetune different parts for xtts
saiful9379/BanglaNER_BERT
Bert Bangla NER
saiful9379/dataspeech
saiful9379/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
saiful9379/jtubespeech
saiful9379/langchain-tutorials
A set of LangChain Tutorials from my youtube channel
saiful9379/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
saiful9379/LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
saiful9379/Moisture-Detection
Moisture detection project
saiful9379/NeMo-text-processing
NeMo text processing for ASR and TTS
saiful9379/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
saiful9379/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
saiful9379/Train_Hifigan_XTTS
This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
saiful9379/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
saiful9379/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
saiful9379/WebRTC-Audio-Streaming
Real Time Audio Streaming using WebRTC.
saiful9379/xtts-api-server
A simple FastAPI Server to run XTTSv2