DataoceanAI
Dataocean AI is one of the world’s leading AI and machine learning training data providers – a one-stop service for TTS, ASR, NLP, CV, Transcription, Lexicons
@DataoceanAI
Pinned Repositories
CNVSRC2023Baseline
Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)
DataoceanAI
Datasets-Best-Sellers
Take a look of our best sellers' datasets! There are multiple categories for you to choose including speech, text, image, and multimodal datasets!
Datasets-New-Arrivals
Explore our new released dataset! We will keep updating them each month, stay tuned!
Dolphin
Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
ICME2025-Audio-Encoder-Challenge
Off_the_shelf_Datasets
DataoceanAI's Repositories
DataoceanAI/Dolphin
Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
DataoceanAI/CNVSRC2023Baseline
Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)
DataoceanAI/Dolphin-Dataset
Here is the list of our training datasets for Dolphin model. Contact Dataocean AI if you have high quality datasets needs.
DataoceanAI/ICME2025-Audio-Encoder-Challenge
DataoceanAI/Off_the_shelf_Datasets
DataoceanAI/DataoceanAI
DataoceanAI/Datasets-Best-Sellers
Take a look of our best sellers' datasets! There are multiple categories for you to choose including speech, text, image, and multimodal datasets!
DataoceanAI/Datasets-New-Arrivals
Explore our new released dataset! We will keep updating them each month, stay tuned!