Pinned Repositories
ISSAI_SAIDA_Kazakh_ASR
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
kaz-image-captioning
ExpansionNet v2 model trained on the COCO dataset with captions translated into Kazakh
Kazakh_TTS
An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers has risen from two to five (three females and two males), and the topic coverage has been diversified.
KazEmoTTS
An open-source Kazakh Emotional Text-to-Speech Dataset
KazNERD
An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.
SpeakingFaces
A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer interaction.
TFW
TFW: Annotated Thermal Faces in the Wild Dataset
thermal-facial-landmarks-detection
SF-TL54: Thermal Facial Landmark Dataset with Visual Pairs.
TurkicASR
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
TurkicTTS
A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Turkmen, Uyghur, and Uzbek.
ISSAI's Repositories
IS2AI/Kazakh_TTS
An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers has risen from two to five (three females and two males), and the topic coverage has been diversified.
IS2AI/SpeakingFaces
A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer interaction.
IS2AI/ISSAI_SAIDA_Kazakh_ASR
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
IS2AI/kaz-image-captioning
ExpansionNet v2 model trained on the COCO dataset with captions translated into Kazakh
IS2AI/KazNERD
An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.
IS2AI/TFW
TFW: Annotated Thermal Faces in the Wild Dataset
IS2AI/Chest-X-ray-module
Leveraging the recent advances in machine learning and availability of public medical imaging datasets, we created a Free Online X-Ray Diagnostic Tool using deep learning that can determine the X-ray type and visualize the pathology.
IS2AI/tutorial_indoor_localization_WiFine
In this tutorial, we will load, preprocess a simplified version of the WiFine dataset. The data will be used to train a location prediction model based (a random forest regressor and a multilayer perceptron)
IS2AI/Kazakh_ASR
IS2AI/MultilingualASR
IS2AI/trimodal_person_verification
This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"
IS2AI/COVID-19-Simulator
Covid Epidemic Simulator
IS2AI/Uzbek_ASR
IS2AI/IMUWiFine
IS2AI/Shear-Design-Optimization-of-RC-Column
Deep Neural Network model for the automatic design of rectangular reinforced concrete columns under axial load, biaxial bending and shear forces.
IS2AI/Particle-Based-COVID19-Simulator
Particle-based COVID-19 Simulator with Contact Tracing and Testing
IS2AI/tutorial_COVID-19_epidemic_simulator
The workshop materials for Epidemic simulator and indoor Wi-Fi localization projects.
IS2AI/WiFine
A finer-level sequential dataset of WiFi received signal strengths (RSS) and corresponding (x, y, z) positions.
IS2AI/CLTL_Turkic_ASR
Automatic Speech Recognition for Turkic Languages Using Cross-Lingual Transfer Learning from Kazakh
IS2AI/AD_classifier
IS2AI/cargoxray
It is a dataset of X-ray images of cargo transport. The dataset includes images of railcars and trucks with trailers.
IS2AI/Deep_Fault_Tolerant_Control
Implementation of deep fault tolerant control for inverted pendulum with reaction wheels
IS2AI/ExoMem-AR-Memory
ExoMem: Augmented Reality based human memory enhancement system using AI
IS2AI/gym-viewshed
Gym custom environment for ArcGIS Viewshed 2 analysis
IS2AI/PTZ-Control
Driver for controlling the PTZ with Python script
IS2AI/RL_PTZ_Coverage
Reinforcement learning algorithms for PTZ (pan-tilt-zoom) system with surveillance camera
IS2AI/visual_assistant
A visual assistant system for blind people.