The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. ex. Some numerals are expressed as "XNUMX".
Copyrights notice
The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. Copyrights notice
Neste artigo, propomos um levantador cepstral ponderado por pico (PWL) para melhorar os picos espectrais de um espectro modelo de todos os pólos no domínio cepstral. O parâmetro de projeto do PWL é o grau de aprimoramento ou deslocamento dos pólos em direção ao círculo unitário. O fator ideal de mudança de pólo é escolhido considerando a sensibilidade aos picos de ressonância espectral, a variabilidade das variâncias cepstrais e a precisão do reconhecimento. A seguir, generalizamos o PWL para que o fator de mudança ideal seja determinado de forma adaptativa quadro a quadro. Comparado com outros levantadores cepstrais, um reconhecedor de fala que emprega o PWL adaptável ao quadro proporciona melhor desempenho de reconhecimento.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copiar
Hong Kook KIM, Hwang Soo LEE, "Spectral Peak-Weighted Liftering of Cepstral Coefficients for Speech Recognition" in IEICE TRANSACTIONS on Information,
vol. E83-D, no. 7, pp. 1540-1549, July 2000, doi: .
Abstract: In this paper, we propose a peak-weighted cepstral lifter (PWL) for enhancing the spectral peaks of an all-pole model spectrum in the cepstral domain. The design parameter of the PWL is the degree of pole enhancement or pole shifting toward the unit circle. The optimal pole shifting factor is chosen by considering the sensitivity to spectral resonance peaks, the variability of cepstral variances, and the recognition accuracy. Next, we generalize the PWL so that the optimal shifting factor is adaptively determined in frame-by-frame basis. Compared with other cepstral lifters, a speech recognizer employing the frame-adaptive PWL provides better recognition performance.
URL: https://global.ieice.org/en_transactions/information/10.1587/e83-d_7_1540/_p
Copiar
@ARTICLE{e83-d_7_1540,
author={Hong Kook KIM, Hwang Soo LEE, },
journal={IEICE TRANSACTIONS on Information},
title={Spectral Peak-Weighted Liftering of Cepstral Coefficients for Speech Recognition},
year={2000},
volume={E83-D},
number={7},
pages={1540-1549},
abstract={In this paper, we propose a peak-weighted cepstral lifter (PWL) for enhancing the spectral peaks of an all-pole model spectrum in the cepstral domain. The design parameter of the PWL is the degree of pole enhancement or pole shifting toward the unit circle. The optimal pole shifting factor is chosen by considering the sensitivity to spectral resonance peaks, the variability of cepstral variances, and the recognition accuracy. Next, we generalize the PWL so that the optimal shifting factor is adaptively determined in frame-by-frame basis. Compared with other cepstral lifters, a speech recognizer employing the frame-adaptive PWL provides better recognition performance.},
keywords={},
doi={},
ISSN={},
month={July},}
Copiar
TY - JOUR
TI - Spectral Peak-Weighted Liftering of Cepstral Coefficients for Speech Recognition
T2 - IEICE TRANSACTIONS on Information
SP - 1540
EP - 1549
AU - Hong Kook KIM
AU - Hwang Soo LEE
PY - 2000
DO -
JO - IEICE TRANSACTIONS on Information
SN -
VL - E83-D
IS - 7
JA - IEICE TRANSACTIONS on Information
Y1 - July 2000
AB - In this paper, we propose a peak-weighted cepstral lifter (PWL) for enhancing the spectral peaks of an all-pole model spectrum in the cepstral domain. The design parameter of the PWL is the degree of pole enhancement or pole shifting toward the unit circle. The optimal pole shifting factor is chosen by considering the sensitivity to spectral resonance peaks, the variability of cepstral variances, and the recognition accuracy. Next, we generalize the PWL so that the optimal shifting factor is adaptively determined in frame-by-frame basis. Compared with other cepstral lifters, a speech recognizer employing the frame-adaptive PWL provides better recognition performance.
ER -