Pinned Repositories
Conda-Jupyter-Docker
Create conda environment and launch jupyter notebook in Anaconda docker container
dtw-compare-audio-files
Compute the MFCCs and measure (dis)similarity between two audio files using DTW
EuroSat-Satellite-CNN-and-ResNet
Classifying custom image datasets by creating Convolutional Neural Networks and Residual Networks from scratch with PyTorch
Image-Caption-Generation
InceptionV3-Multi-layer GRU based automatic image captioning with Keras and TensorFlow frameworks
IMECA
Automatic image captioning on Android-based mobile application with CNN and multi-layer GRU encoder-decoder model
Question-Answering-BERT
Extractive Question-Answering with BERT on SQuAD v2.0 (Stanford Question Answering Dataset) using NVIDIA PyTorch Lightning
Speaker-Verification
Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, SpeakerNet, TitaNet-L).
Speech-Datasets-for-ASR
Download speech datasets (English and non-English) for Automatic Speech Recognition
Turkish-Text-to-Speech
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
YOLO-Darknet-Video-and-Image-Detection-Traffic-Signs
YOLO Darknet: Traffic sign detection on image and video
Rumeysakeskin's Repositories
Rumeysakeskin/Turkish-Text-to-Speech
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
Rumeysakeskin/Speaker-Verification
Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, SpeakerNet, TitaNet-L).
Rumeysakeskin/EuroSat-Satellite-CNN-and-ResNet
Classifying custom image datasets by creating Convolutional Neural Networks and Residual Networks from scratch with PyTorch
Rumeysakeskin/Image-Caption-Generation
InceptionV3-Multi-layer GRU based automatic image captioning with Keras and TensorFlow frameworks
Rumeysakeskin/IMECA
Automatic image captioning on Android-based mobile application with CNN and multi-layer GRU encoder-decoder model
Rumeysakeskin/Speech-Datasets-for-ASR
Download speech datasets (English and non-English) for Automatic Speech Recognition
Rumeysakeskin/YOLO-Darknet-Video-and-Image-Detection-Traffic-Signs
YOLO Darknet: Traffic sign detection on image and video
Rumeysakeskin/Conda-Jupyter-Docker
Create conda environment and launch jupyter notebook in Anaconda docker container
Rumeysakeskin/dtw-compare-audio-files
Compute the MFCCs and measure (dis)similarity between two audio files using DTW
Rumeysakeskin/Question-Answering-BERT
Extractive Question-Answering with BERT on SQuAD v2.0 (Stanford Question Answering Dataset) using NVIDIA PyTorch Lightning
Rumeysakeskin/Archiconda3-for-ARM64-Jetson-TX1-TX2
Create light-weight conda environment for ARM64 devices
Rumeysakeskin/ASR-Quantization
Post-training quantization on Nvidia Nemo ASR model
Rumeysakeskin/Image-Captioning
Image captioning with a benchmark of CNN-based encoder and GRU-based inject-type (init-inject, pre-inject, par-inject) and merge decoder architectures
Rumeysakeskin/mms-turkish-tts
Turkish text to speech model that the part of Facebook's Massively Multilingual Speech
Rumeysakeskin/Image-Classification-InceptionV3
Transfer learning using Inception V3 for custom image classification dataset with TensorFlow and Keras
Rumeysakeskin/Custom-Object-Detection-PyTorch
Custom object detection on a video dataset using PyTorch Faster RCNN
Rumeysakeskin/Rumeysakeskin
Rumeysakeskin/Speech-Emotion-Recognition-Turkish-and-more
Advanced Speech Emotion Recognition, based on ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets and 14 languages (Emotions: Disgust, Neutral, Kind, Anger, Surprise, Joy)
Rumeysakeskin/Free-Offline-Multilingual-Translators
Fast and accurate multilingual translations, all available offline for enhanced privacy and accessibility.
Rumeysakeskin/KenLM
Determining the probability of a sequence of words in Turkish using the KenLM language model with Python
Rumeysakeskin/NGC-docker
NVIDIA GPU Cloud setup and building NVIDIA Containers for Jetson and JetPack
Rumeysakeskin/TextPrepR
Package for cleaning and preprocessing text data is supported for all languages (some functions) and English (all fuctions).
Rumeysakeskin/jetson-voice
ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT
Rumeysakeskin/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)