The data is collected from native English speakers in 40 countries,covering a varity of pronunciation habits and characteristics. The script is designed by linguists and covers a wide range of topics including generic, interactive, in-car and home. The text is manually proofread with high accuracy. 1. Specifications Format : 16kHz, 16bit, uncompressed wav, mono channel. Recording environment : quiet indoor environment, low background noise, without echo. Recording content (read speech) : generic category; human-machine interaction category; smart home command and control category; in-car command and control category; numbers. Demographics : Speakers are evenly distributed across all age groups, covering children, teenagers, middle-aged, elderly, etc. Device : Android mobile phone, iPhone. Language : American English, British English, Canadian English, Australian English, French English, German English, Spanish English, Italian English, Portuguese English, Russian English, Chinese English, Indian English, Japanese English, Korean English, Singaporean English and etc. Application scenarios : speech recognition; voiceprint recognition. 2. About Nexdata Nexdata owns off-the-shelf 200,000 hours of speech recognition data, 800TB of image/video data, about 2 billion pieces of NLP data. These ready-to-go datasets support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/?source=Datarade or contact us via info@nexdata.ai.