The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. ex. Some numerals are expressed as "XNUMX".
Copyrights notice
The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. Copyrights notice
Este artigo propõe um método de mixagem de voz de baixa complexidade com codecs de voz baseado em codificação preditiva para conferências multimídia. O método proposto aplica uma técnica de gerenciamento de estado de filtro (FSM) a um método de mistura parcial, a fim de evitar inconsistência dos estados de filtro dos codificadores. A inconsistência é criada pela troca dos codificadores quando os alto-falantes a serem mixados são trocados. Os resultados das avaliações subjetivas da qualidade da fala mostram que o método proposto evita a inconsistência e alcança uma qualidade de fala significativamente maior do que o método convencional de mixagem parcial sem o FSM e quase a mesma qualidade de fala que o método de mixagem completo. Os resultados da avaliação da complexidade mostram que o método proposto atinge uma complexidade muito menor do que o método de mistura completa.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copiar
Hironori ITO, Kazunori OZAWA, "Low Complexity Speech Mixing with Speech Codecs Based on Predictive Coding for Multimedia Conferences" in IEICE TRANSACTIONS on Communications,
vol. E92-B, no. 7, pp. 2477-2483, July 2009, doi: 10.1587/transcom.E92.B.2477.
Abstract: This paper proposes a method of low complexity speech mixing with speech codecs based on predictive coding for multimedia conferences. The proposed method applies a filter state management (FSM) technique to a partial mixing method in order to avoid inconsistency of the filter states of encoders. The inconsistency is created by switching of the encoders when the speakers to be mixed are switched. The results of subjective evaluations of speech quality show that the proposed method avoids the inconsistency, and achieves significantly higher speech quality than the conventional partial mixing method without the FSM and almost the same speech quality as the full mixing method. The complexity evaluation results show that the proposed method achieves much lower complexity than the full mixing method.
URL: https://global.ieice.org/en_transactions/communications/10.1587/transcom.E92.B.2477/_p
Copiar
@ARTICLE{e92-b_7_2477,
author={Hironori ITO, Kazunori OZAWA, },
journal={IEICE TRANSACTIONS on Communications},
title={Low Complexity Speech Mixing with Speech Codecs Based on Predictive Coding for Multimedia Conferences},
year={2009},
volume={E92-B},
number={7},
pages={2477-2483},
abstract={This paper proposes a method of low complexity speech mixing with speech codecs based on predictive coding for multimedia conferences. The proposed method applies a filter state management (FSM) technique to a partial mixing method in order to avoid inconsistency of the filter states of encoders. The inconsistency is created by switching of the encoders when the speakers to be mixed are switched. The results of subjective evaluations of speech quality show that the proposed method avoids the inconsistency, and achieves significantly higher speech quality than the conventional partial mixing method without the FSM and almost the same speech quality as the full mixing method. The complexity evaluation results show that the proposed method achieves much lower complexity than the full mixing method.},
keywords={},
doi={10.1587/transcom.E92.B.2477},
ISSN={1745-1345},
month={July},}
Copiar
TY - JOUR
TI - Low Complexity Speech Mixing with Speech Codecs Based on Predictive Coding for Multimedia Conferences
T2 - IEICE TRANSACTIONS on Communications
SP - 2477
EP - 2483
AU - Hironori ITO
AU - Kazunori OZAWA
PY - 2009
DO - 10.1587/transcom.E92.B.2477
JO - IEICE TRANSACTIONS on Communications
SN - 1745-1345
VL - E92-B
IS - 7
JA - IEICE TRANSACTIONS on Communications
Y1 - July 2009
AB - This paper proposes a method of low complexity speech mixing with speech codecs based on predictive coding for multimedia conferences. The proposed method applies a filter state management (FSM) technique to a partial mixing method in order to avoid inconsistency of the filter states of encoders. The inconsistency is created by switching of the encoders when the speakers to be mixed are switched. The results of subjective evaluations of speech quality show that the proposed method avoids the inconsistency, and achieves significantly higher speech quality than the conventional partial mixing method without the FSM and almost the same speech quality as the full mixing method. The complexity evaluation results show that the proposed method achieves much lower complexity than the full mixing method.
ER -