Alternating minimisation for glottal inverse filtering

Show full item record



Permalink

http://hdl.handle.net/10138/195078

Citation

Bleyer , I R , Lybeck , L , Auvinen , H , Airaksinen , M , Alku , P & Siltanen , S 2017 , ' Alternating minimisation for glottal inverse filtering ' , Inverse Problems , vol. 33 , no. 6 , 065005 . https://doi.org/10.1088/1361-6420/aa6eb8

Title: Alternating minimisation for glottal inverse filtering
Author: Bleyer, Ismael Rodrigo; Lybeck, Lasse; Auvinen, Harri; Airaksinen, Manu; Alku, Paavo; Siltanen, Samuli
Contributor organization: Department of Mathematics and Statistics
Mikko Samuli Siltanen / Principal Investigator
Date: 2017-06
Language: eng
Number of pages: 19
Belongs to series: Inverse Problems
ISSN: 0266-5611
DOI: https://doi.org/10.1088/1361-6420/aa6eb8
URI: http://hdl.handle.net/10138/195078
Abstract: A new method is proposed for solving the glottal inverse filtering (GIF) problem. The goal of GIF is to separate an acoustical speech signal into two parts: the glottal airflow excitation and the vocal tract filter. To recover such information one has to deal with a blind deconvolution problem. This ill-posed inverse problem is solved under a deterministic setting, considering unknowns on both sides of the underlying operator equation. A stable reconstruction is obtained using a double regularization strategy, alternating between fixing either the glottal source signal or the vocal tract filter. This enables not only splitting the nonlinear and nonconvex problem into two linear and convex problems, but also allows the use of the best parameters and constraints to recover each variable at a time. This new technique, called alternating minimization glottal inverse filtering (AM-GIF), is compared with two other approaches: Markov chain Monte Carlo glottal inverse filtering (MCMC-GIF), and iterative adaptive inverse filtering (IAIF), using synthetic speech signals. The recent MCMC-GIF has good reconstruction quality but high computational cost. The state-of-the-art IAIF method is computationally fast but its accuracy deteriorates, particularly for speech signals of high fundamental frequency (F0). The results show the competitive performance of the new method: With high F0, the reconstruction quality is better than that of IAIF and close to MCMC-GIF while reducing the computational complexity by two orders of magnitude.
Subject: ill-posed problems
glottal inverse filtering
double regularisation
alternating minimisation
glottal airflow
wavelets
deterministic
VOICE GENERATION PROBLEM
VOCAL-TRACT
SCATTERING
SHAPE
REGULARIZATION
QUALITY
MODEL
111 Mathematics
114 Physical sciences
Peer reviewed: Yes
Rights: cc_by
Usage restriction: openAccess
Self-archived version: publishedVersion


Files in this item

Total number of downloads: Loading...

Files Size Format View
Bleyer_2017_Inverse_Problems_33_065005.pdf 2.051Mb PDF View/Open

This item appears in the following Collection(s)

Show full item record