情感语音特征对语料库依赖性的统计分析

doi:10.3969/j.issn.1006-1355-2011.04.031

›› 2011, Vol. 31 ›› Issue (4): 132-136.DOI: 10.3969/j.issn.1006-1355-2011.04.031

• 6.信号处理与故障诊断 • 上一篇下一篇

情感语音特征对语料库依赖性的统计分析

孙颖,张雪英

（太原理工大学信息工程学院，太原 030024 ）

收稿日期:2011-01-04 修回日期:2011-03-24 出版日期:2011-08-18 发布日期:2011-08-18
通讯作者: 孙颖

Statistical Analysis for Database Dependence in Classification of Emotional Speech by using Different Features Extraction Approaches

SUN Ying,ZHANG Xue-ying

（ College of Information Engineering, TYUT, Taiyuan 030024, China ）

Received:2011-01-04 Revised:2011-03-24 Online:2011-08-18 Published:2011-08-18
Contact: SUN Ying

摘要/Abstract

摘要： 简述线性预测倒谱系数（LPCC）、Teager能量算子（TEO）、梅尔频率倒谱系数（MFCC）和过零峰值幅度（ZCPA）特征提取方法，并将这四种方法应用于情感识别。设计两种实验，第一种是使用TYUT和Berlin语料库的单语言实验，这种实验证明，以上四种特征在单一的语料库单一语言条件下均能够有效地表征语音的情感特征，其中MFCC特征对情感的识别率最高。第二种实验是混合语料库的单一语言实验。之前大多数关于情感特征的研究都是基于某一种语料库中某种特定语言的，但在实际中，说话人的背景环境总是多种多样。因此，对特征的混合语料库研究是有现实意义的。第二种实验证明这四种特征都是语料库依赖性的，其中 ZCPA特征的识别率下降最少。

关键词: 声学, 信号处理, 情感语音识别, 语料库依赖性, 情感特征, 混合语料库

Abstract: Four approaches of feature extraction: the Linear Predictive Cepstral Coefficient (LPCC), the Teager Energy Operator (TEO), the Mel-Frequency Cepstral Coefficient (MFCC) and the Zero Crossings with Peak Amplitudes (ZCPA) are described in this paper. And these approaches are applied to emotional speech recognition. Two kinds of experiments are carried out. The first one is a kind of single language experiments with TYUT database and Berlin database. Its results show that these four approaches can represent speech emotion effectively by using single language of single database. MFCC has the best result of the four approaches. The second kind experiment is merge-database of single language. Most previous work on emotional feature extraction is based on a special language of single speech database. But in practice, the environment of the speaker is various. So the study of emotional feature extraction based on merge-database is significative. Experiments of the second kind indicate that the four features are all database dependent. ZCPA features are of the least database dependence of the four approaches.

Key words: acoustics, signal analysis, emotional speech recognition, database dependence, emotional features, merge-database

中图分类号:

TN912.34

孙颖;张雪英. 情感语音特征对语料库依赖性的统计分析[J]. , 2011, 31(4): 132-136.

SUN Ying;ZHANG Xue-ying. Statistical Analysis for Database Dependence in Classification of Emotional Speech by using Different Features Extraction Approaches[J]. , 2011, 31(4): 132-136.

100

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	100

	来源	本网站

	次数	100
	比例	100%

摘要

297

最新录用	在线预览	正式出版

0	0	297

	来源	本网站

	次数	297
	比例	100%

[1]	郭栋, 熊义周, 周益, 周仪, 饶文毅. 预选档位策略下DCT敲击声品质评价[J]. 噪声与振动控制, 2022, 42(3): 138-143.
[2]	黄泽好, 刘琳, 刘子谦, 张杨, 陈家乐, 严生辉. 变速器啸叫噪声扩展工况传递路径分析与优化[J]. 噪声与振动控制, 2022, 42(3): 144-149.
[3]	张超, 张劲松, 李帅, 徐巍, 周明刚. 某型内燃机车驾驶室阻尼优化降噪分析[J]. 噪声与振动控制, 2022, 42(3): 150-154.
[4]	张宇, 李进, 吴鸿飞, 刘术成. 电动车电池包壳体辐射噪声性能研究[J]. 噪声与振动控制, 2022, 42(3): 155-160.
[5]	李健, 周虹希, 王瑞乾, 唐昭. 车轮降噪效果评价中衰减时间取值与适用性分析[J]. 噪声与振动控制, 2022, 42(3): 161-167.
[6]	胡传俊, 张军, 李虹, 焦明, 刘锋, 刘波. 瞬态扭矩下轮端粘滑异响分析与控制[J]. 噪声与振动控制, 2022, 42(3): 177-181.
[7]	顾佶智, 师蔚, 胡定玉, 廖爱华, 丁亚琦. 强背景噪声下基于谱峭度-波束形成轴承故障特征提取[J]. 噪声与振动控制, 2022, 42(3): 110-115.
[8]	陈士斌, 王谛, 吴健, 于洋, 贺义. 基于信号分离技术的波束形成算法研究[J]. 噪声与振动控制, 2022, 42(3): 116-121.
[9]	姚金忠, 曹继学. 中华鲟保育车间噪声频谱特性分析及降噪措施应用[J]. 噪声与振动控制, 2022, 42(3): 192-195.
[10]	陈亘. 环境背景噪声对飞机噪声监测结果的影响[J]. 噪声与振动控制, 2022, 42(3): 237-240.
[11]	吴佳康, 柳政卿, 王秋成. 复合微穿孔板吸声结构声学性能预测[J]. 噪声与振动控制, 2022, 42(3): 203-208.
[12]	黄永虎, 吕梦圆, 张红丽, 李浩, . 基于复合弹性质量块薄膜声学超材料低频隔声性能研究[J]. 噪声与振动控制, 2022, 42(3): 209-214.
[13]	王前选, 陈志民, 吴玲玲, 刘成沛, 杨艺, 卢钊明. 散射体特征对隔声超材料性能的影响[J]. 噪声与振动控制, 2022, 42(3): 215-219.
[14]	赵鹏瑜, 杨兴林, 马恒, 吴维维. 切向流作用下单层声衬噪声阻尼性能的数值研究[J]. 噪声与振动控制, 2022, 42(3): 229-236.
[15]	朱从云, 张仁琪, 丁国芳, 黄其柏. 具有分流对冲的穿孔管吸声方法[J]. 噪声与振动控制, 2022, 42(3): 13-18.

情感语音特征对语料库依赖性的统计分析

Statistical Analysis for Database Dependence in Classification of Emotional Speech by using Different Features Extraction Approaches

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐 0

Metrics