Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), pp. 4813-4816, Las Vegas, March 31-April 4, 2008.doi: 10.1109/ICASSP.2008.4518734
Abstract: We describe an experiment where listeners were asked to detect two specific forms of stress in talkers' recorded voices heard via six different simulated communication systems. Both task–induced stress and dramatized urgency were used. Communication systems included low–rate digital speech coding combined with bit errors, packet loss, and packet loss concealment. Twenty–four listeners participated in a total of 11,520 detection trials. A parallel investigation of word intelligibility in sentence context used 576 trials. Intelligibility results showed wide variance due to communication system and stress detection results showed less variance. More specifically, we found that listener detection of dramatized talker urgency was 4.7 times more robust to communication system degradations than word intelligibility in sentence context.
Keywords: stress; speech coding; Degradation; Testing; databases; Robustness; automatic speech recognition; Context; Helicopters; Oral communication
For technical information concerning this report, contact:
Stephen D. Voran
Institute for Telecommunication Sciences
Disclaimer: Certain commercial equipment, components, and software may be identified in this report to specify adequately the technical aspects of the reported results. In no case does such identification imply recommendation or endorsement by the National Telecommunications and Information Administration, nor does it imply that the equipment or software identified is necessarily the best available for the particular application or uses.