ISSAI

Institute of Smart Systems and Artificial Intelligence

Kazakhstan

Pinned Repositories

ISSAI_SAIDA_Kazakh_ASR
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
Language:Shell52 6 76
kaz-image-captioning
ExpansionNet v2 model trained on the COCO dataset with captions translated into Kazakh
Language:Jupyter Notebook32 1 21
Kazakh_TTS
An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers has risen from two to five (three females and two males), and the topic coverage has been diversified.
Language:Shell137 15 1225
KazEmoTTS
An open-source Kazakh Emotional Text-to-Speech Dataset
Language:Python31 3 24
KazNERD
An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.
Language:Python29 2 25
SpeakingFaces
A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer interaction.
Language:Python84 4 59
TFW
TFW: Annotated Thermal Faces in the Wild Dataset
Language:Jupyter Notebook24 1 34
thermal-facial-landmarks-detection
SF-TL54: Thermal Facial Landmark Dataset with Visual Pairs.
Language:Jupyter Notebook47 2 57
TurkicASR
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
Language:Python72 6 39
TurkicTTS
A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Turkmen, Uyghur, and Uzbek.
Language:Python72 4 37

ISSAI's Repositories

IS2AI/Kazakh_TTS
An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers has risen from two to five (three females and two males), and the topic coverage has been diversified.
Language:Shell137 15 1225
IS2AI/SpeakingFaces
A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer interaction.
Language:Python84 4 59
IS2AI/TurkicASR
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
Language:Python72 6 39
IS2AI/KazEmoTTS
An open-source Kazakh Emotional Text-to-Speech Dataset
Language:Python31 3 24
IS2AI/KazNERD
An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.
Language:Python29 2 25
IS2AI/OpenThermalPose
An Open-Source Annotated Thermal Human Pose Dataset
23 0 23
IS2AI/Central-Asian-Food-Dataset
42 food classes from Kazakh National and Central Asian cuisine
Language:Python20 0 0
IS2AI/faces-in-event-streams
This repo contains code and instructions for the detection of faces in event streams
Language:Python15 2 81
IS2AI/Kazakh_ASR
Language:Shell13 4 20
IS2AI/IMUWiFine
Language:Python12 1 12
IS2AI/Soyle
Language:Python11 1 0
IS2AI/Kazakh-Speech-Commands-Dataset
Kazakh Speech Commands Dataset
Language:Jupyter Notebook9 1 0
IS2AI/KazQAD
An open-source Kazakh Question Answering Dataset
8 5 00
IS2AI/KazLLM_Benchmark
Language:Python6 1 0
IS2AI/AnyFacePP
Language:Python4 0 01
IS2AI/visual_assistant
A visual assistant system for blind people.
Language:Python3 2 0
IS2AI/city-identification
This repo contains dataset and models for city classification
Language:Python2 2 00
IS2AI/city-sustainability-indexes
This repo contains code and models for detecting city sustainability indexes
Language:Python2 1 00
IS2AI/Common-Objects-in-Hemispherical-Images-Dataset
39 classes of objects sampled from the MS COCO dataset captured with a hemispherical/fisheye camera
Language:Python2 0 0
IS2AI/TatarTTS
TatarTTS: An Open-Source Text-to-Speech Synthesis Dataset for the Tatar Language
2 1 21
IS2AI/Central_Asian_Food_Scenes_Dataset
This is the repository for the Central Asian Food Scenes Dataset
1
IS2AI/talk-llm
Talk with ChatGPT
Language:Jupyter Notebook0 0 00
IS2AI/construction-sites-detection
This repo contains code and dataset for training and testing ml model which implements instance segmentation of construction sites
Language:Python
IS2AI/Global-Gastronomic-Culinary-Dataset
Language:Python
IS2AI/Keyword-MLP-LangID
Language:Jupyter Notebook
IS2AI/MMHA-28
MMHA-28: Human Action Recognition Across RGB, Depth, Thermal, and Event Modalities
Language:Python
IS2AI/Multilingual-Speech-Command-Recognition
Language:Jupyter Notebook
IS2AI/multispectral-motion-analysis
IS2AI/oylan_car_demo
Language:TypeScript
IS2AI/TatarSCR
An Open-Source Speech Commands Dataset for the Tatar Language
Language:Jupyter Notebook0 0