Approximate Bayesian Inference for Robust Speech Processing

Show simple item record

dc.contributor.author Wa Maina, Ciira
dc.date.accessioned 2022-11-25T07:11:12Z
dc.date.available 2022-11-25T07:11:12Z
dc.date.issued 2011-06
dc.identifier.uri http://repository.dkut.ac.ke:8080/xmlui/handle/123456789/7764
dc.description.abstract Speech processing applications such as speech enhancement and speaker identification rely on the estimation of relevant parameters from the speech signal. These parameters must often be estimated from noisy observations since speech signals are rarely obtained in ‘clean’ acoustic environments in the real world. As a result, the parameter estimation algorithms we employ must be robust to environmental factors such as additive noise and reverberation. In this work we derive and evaluate approximate Bayesian algorithms for the following speech processing tasks: 1) speech enhancement 2) speaker identification 3) speaker verification and 4) voice activity detection. Building on previous work in the field of statistical model based speech enhancement, we derive speech enhancement algorithms that rely on speaker dependent priors over linear prediction parameters. These speaker dependent priors allow us to handle speech enhancement and speaker identification in a joint framework. Furthermore, we show how these priors allow voice activity detection to be performed in a robust manner. We also develop algorithms in the log spectral domain with applications in robust speaker verification. The use of speaker dependent priors in the log spectral domain is shown to improve equal error rates in noisy environments and to compensate for mismatch between training and testing conditions. en_US
dc.language.iso en en_US
dc.title Approximate Bayesian Inference for Robust Speech Processing en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account