Recent Publications
-
Stephen D. Voran and Jaden Pieper, “A Powerful, Fixed-Size Modulation Spectrum Representation for Perceptually Consistent Speech Evaluation,” Technical Memorandum NTIA TM-24-274, September 2024
We develop the wideband fixed-size modulation spectra (FMS) and show that they contain the necessary information to perform perceptually consistent evaluation of speech. We compare FMS with the already established frame-based modula-tion spectra as r...
-
Institute for Telecommunication Sciences, “ISART 2022: Proceedings of the 19th International Symposium on Advanced Radio Technologies: Evolving Spectrum-Sharing Regulation through Data-, Science-, and Technology-Driven Analysis and Decision-making,” Conference Proceedings NITA SP-24-573, August 2024
The topic of the 2022 International Symposium on Advanced Radio Technologies™ (ISART 2022), which took place fully virtually June 13, 14, 15, and 16, 2022, was “Evolving Spectrum-Sharing Regulation through Data-, Science-, and Technology-Driven Analy...
-
Jaden Pieper and Stephen D. Voran, “AlignNet: Learning Dataset Score Alignment Functions To Enable Better Training of Speech Quality Estimators,” Conference Paper, July 2024
We develop two complementary advances for training no-reference (NR) speech quality estimators with independent datasets. Multi-dataset finetuning (MDF) pretrains an NR estimator on a single dataset and then finetunes it on multiple datasets at once,...
-
Jaden Pieper, Stephen D. Voran, and Kenneth R. Tilley, “Improving Speech Audio for Prerecorded and Live Online Conference Sessions,” Special Publication NTIA SP 24-572, July 2024
Learn how to identify impaired speech and avoid its causes. Listen to audio clips that demonstrate six useful tips for online conference and meeting organizers, as well as contributors who plan to speak during a live online event or submit a pre-reco...