Nexdata has off-the-shelf 15,000 hours of 8kHz conversational speech datasets covering 100+ countries including English, German, French, Spanish, Italian, Portuguese, Korean, Japanese, Hindi, Russia and etc. 1. Specifications Format : 8kHz, 8bit, u-law/a-law pcm, mono channel; Environment : quiet indoor environment, without echo; Recording content : No preset linguistic data๏ผdozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed; Demographics : Speakers are evenly distributed across all age groups, covering children, teenagers, middle-aged, elderly, etc. Annotation : annotating for the transcription text, speaker identification, gender and noise symbols; Device : Telephony recording system; Language : 100+ Languages; Application scenarios : speech recognition; voiceprint recognition; Accuracy rate : the word accuracy rate is not less than 98% 2. About Nexdata Nexdata owns off-the-shelf 200,000 hours of speech recognition data, 800TB of image/video data, about 2 billion pieces of NLP data. These ready-to-go datasets support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/?source=Datarade or contact us via info@nexdata.ai.