Measuring Speech Intelligibility with Romanian Synthetic Unpredictable Sentences in Normal Hearing

Oana Astefanei; Sebastian Cozma; Cristian Martu; Roxana Serban; Corina Butnaru; Petronela Moraru; Gabriela Musat; Luminita Radulescu

doi:10.3390/audiolres14060085

Measuring Speech Intelligibility with Romanian Synthetic Unpredictable Sentences in Normal Hearing

Audiol Res. 2024 Dec 1;14(6):1028-1044. doi: 10.3390/audiolres14060085.

Authors

Oana Astefanei^{1

2}, Sebastian Cozma^{1

2

3}, Cristian Martu^{2

3}, Roxana Serban^{2

3}, Corina Butnaru^{2

3}, Petronela Moraru^{1

3}, Gabriela Musat⁴, Luminita Radulescu^{1

2

3}

Affiliations

¹ Doctoral School, Grigore T Popa University of Medicine and Pharmacy, 700115 Iasi, Romania.
² ENT Clinic Department, Clinical Rehabilitation Hospital, 700661 Iasi, Romania.
³ Department of Otorhinolaryngology, Faculty of Medicine, Grigore T Popa University of Medicine and Pharmacy, 700115 Iasi, Romania.
⁴ Department of Otorhinolaryngology, "Carol Davila" University of Medicine and Pharmacy, 050474 Bucharest, Romania.

Abstract

Background/objectives: Understanding speech in background noise is a challenging task for listeners with normal hearing and even more so for individuals with hearing impairments. The primary objective of this study was to develop Romanian speech material in noise to assess speech perception in diverse auditory populations, including individuals with normal hearing and those with various types of hearing loss. The goal was to create a versatile tool that can be used in different configurations and expanded for future studies examining auditory performance across various populations and rehabilitation methods.

Methods: This study outlines the development of Romanian speech material for speech-in-noise testing, initially presented to normal-hearing listeners to establish baseline data. The material consisted of unpredictable sentences, each with a fixed syntactic structure, generated using speech synthesis from all Romanian phonemes. A total of 50 words were selected and organized into 15 lists, each containing 10 sentences, with five words per sentence. Two evaluation methods were applied in two sessions to 20 normal-hearing volunteers. The first method was an adaptive speech-in-noise recognition test designed to assess the speech recognition threshold (SRT) by adjusting the signal-to-noise ratio (SNR) based on individual performance. The intelligibility of the lists was further assessed at the sentence level to evaluate the training effect. The second method was used to obtain normative data for the SRT, defined as the SNR at which a subject correctly recognizes 50% of the speech material, as well as for the slope, which refers to the steepness of the psychometric function derived from threshold recognition scores measured at three fixed SNRs (-10 dB, -7 dB, and -4 dB) during the measurement phase.

Results: The adaptive method showed that the training effect was established after two lists and remained consistent across both sessions. During the measurement phase, the fixed SNR method yielded a mean SRT50 of -7.38 dB with a slope of 11.39%. These results provide reliable and comparable data, supporting the validity of the material for both general population testing and future clinical applications.

Conclusions: This study demonstrates that the newly developed Romanian speech material is effective for evaluating speech recognition abilities in noise. The training phase successfully mitigated initial unfamiliarity with the material, ensuring that the results reflect realistic auditory performance. The obtained SRT and slope values provide valuable normative data for future auditory assessments. Due to its flexible design, the material can be further developed and extended to accommodate various auditory rehabilitation methods and diverse populations in future studies.

Keywords: Romanian speech; sentence in noise; speech recognition threshold; synthetic speech; text-to-speech system.

Grants and funding

This research received no external funding.