The British English Speech Synthesis Corpus contains the recordings of 1 voice talent (female, professional broadcaster, 32 years old, born and and raised in London). It is recorded in a professional studio over 2 channels (Shure SM15 microphone and Electroglottography (EGG) sensor). The script contains 10,146 (approx.) utterances in total. There are 31 categories in this corpus, which contains the most common language phenomenon in British English. Speech rate consistency, energy consistency and tone color consistency are strictly controlled during recording process. Each utterance wave is stored in Windows uncompressed PCM format. Manual Prosody labeling, Stress Labeling, Phone Boundary Labeling were adopted based on the real speech. A pronunciation lexicon is available with a phone set. All manually checked. Each utterance was carefully proofread by linguists, to make sure the phonetic symbol is consistent with the real pronunciation. Misread utterances have been re-recorded."