您的位置: 专家智库 > >

国家自然科学基金(90820011)

作品数:8 被引量:64H指数:3
相关作者:刘文举徐波倪崇嘉李鹏廖逢钗更多>>
相关机构:中国科学院自动化研究所山东财政学院三明学院更多>>
发文基金:国家自然科学基金国家重点基础研究发展计划国家高技术研究发展计划更多>>
相关领域:电子电信自动化与计算机技术交通运输工程机械工程更多>>

文献类型

  • 6篇期刊文章
  • 1篇会议论文

领域

  • 5篇电子电信
  • 3篇自动化与计算...
  • 1篇机械工程
  • 1篇交通运输工程

主题

  • 3篇语音
  • 3篇语音识别
  • 2篇声学模型
  • 2篇声源
  • 2篇声源定位
  • 2篇连续语音
  • 2篇连续语音识别
  • 2篇MUSIC算...
  • 2篇大词汇量
  • 1篇短时傅里叶变...
  • 1篇短时傅立叶变...
  • 1篇信号
  • 1篇信号子空间
  • 1篇信息处理
  • 1篇音变
  • 1篇音节
  • 1篇语音识别系统
  • 1篇声学建模
  • 1篇搜索
  • 1篇搜索技术

机构

  • 3篇中国科学院自...
  • 1篇河南理工大学
  • 1篇山东财政学院
  • 1篇三明学院

作者

  • 3篇刘文举
  • 1篇廖逢钗
  • 1篇杨占磊
  • 1篇倪崇嘉
  • 1篇徐波
  • 1篇晁浩
  • 1篇李鹏

传媒

  • 3篇Chines...
  • 1篇声学学报
  • 1篇计算机应用
  • 1篇中文信息学报

年份

  • 2篇2013
  • 3篇2012
  • 2篇2009
8 条 记 录,以下是 1-7
排序方式:
汉语大词汇量连续语音识别系统研究进展被引量:44
2009年
大词汇量连续语音识别(LVCSR)技术近年来发展迅速,并在许多领域得到了广泛的应用,国内外许多大公司加大了对语音识别技术的研究,不少商业化的语音识别系统已经面世,并得到较为广泛的使用。该文综述了近年来大词汇量连续语音识别技术的研究进展,描述了汉语大词汇量连续语音识别系统,主要是基于统计方法的语音识别系统的框架与设计方法,对语音识别系统的一些关键技术和原理进行了分析,并对近年来国内外对语音识别研究发展动向进行了讨论。
倪崇嘉刘文举徐波
关键词:中文信息处理语音识别模型自适应搜索技术
Integrating induced probability into decoding for large vocabulary continuous speech recognition被引量:2
2012年
This paper integrates location information of frames into conventional acoustic model(AM)and language model(LM)likelihoods,in order to distinguish potential path candidates more precisely at decoding stage.This paper proposes an induced probability,which represents location information of frames within the whole acoustic space.By integrating the induced probability,the decoder is directed to search within the most promising regions of acoustic space.Promising paths are enhanced and unlikely paths are weakened.Experiments conducted on Chinese Putonghua show that the character error rate is reduced by 10.95%relatively without increasing decoding complexity significantly.Finally,pruning analysis shows that integrating location information of frames into traditional decoding framework is helpful for improving system performance.
YANG Zhanlei LIU Wenju CHAO Hao
关键词:连续语音识别大词汇量声学模型
A signal subspace dimension estimator based on F-norm with application to subspace-based multi-channel speech enhancement被引量:2
2012年
Although the signal subspace approach has been studied extensively for speech enhancement,no good solution has been found to identify signal subspace dimension in multichannel situation.This paper presents a signal subspace dimension estimator based on F-norm of correlation matrix,with which subspace-based multi-channel speech enhancement is robust to adverse acoustic environments such as room reverberation and low input signal to noise ratio (SNR).Experiments demonstrate the presented method leads to more noise reduction and less speech distortion comparing with traditional methods.
LI Chao LIU Wenju
关键词:信号子空间维数估计
Auditory filter based broadband MUSIC algorithm for sound source localization被引量:7
2013年
Based on the analysis of the shortcomings of broadband MUSIC algorithm with short-time Fourier transform(SF-MUSIC) for sound source localization,a broadband MUSIC algorithm with auditory filter(AF-MUSIC) was proposed.The proposed algorithm first employs auditory filter bank to decompose the signals received on the microphone array,and then locates the sound source with MUSIC algorithm over every frequency channel.At last,by combining with the subinterval frequency estimation,the final localization result is gained.Evaluations on the proposed algorithm prove that comparing with the SF-MUSIC algorithm,the AF-MUSIC algorithm decreases the average error of the estimation results with 2.5479 degree in different source conditions.The accuracy of sound source DOA estimation is enhanced effectively.
LIAO FengchaiLI PengLIU Wenju
关键词:MUSIC算法声源定位滤波器组短时傅立叶变换频率估计
采用听觉滤波器的宽带MUSIC声源定位方法被引量:6
2012年
在分析了采用短时傅里叶变换的宽带MUSIC声源定位算法(SF-MUSIC)存在问题的基础上,提出了一种采用听觉滤波器的宽带MUSIC声源定位算法(AF-MUSIC)。该算法使用听觉滤波器组对传声器阵列接收到的信号进行不等带宽分解后,在各个频率通道上使用MUSIC算法进行声源定位,并结合子区间频数估计法得出最终定位结果。对算法进行的实验评估表明,在不同声源类型条件下,相比SF-MUSIC算法,AF-MUSIC算法的平均估计误差减少2.5479°,有效地提高了声源波达方向估计的精度。
廖逢钗李鹏刘文举
关键词:MUSIC算法声源定位宽带短时傅里叶变换
Mandarin Pitch Accent Prediction Using Hierarchical Model Based Ensemble Machine Learning
In this study, we combine the Mandarin characteristics with Mandarin acoustic attribute and text information a...
Chongjia Ni 1
汉语语音识别中基于音节的声学模型改进算法被引量:1
2013年
针对汉语语音识别中协同发音现象引起的语音信号的易变性,提出一种基于音节的声学建模方法。首先建立基于音节的声学模型以解决音节内部声韵母之间的音变现象,并提出以音节内双音子模型来初始化基于音节声学模型的参数以缓解训练数据稀疏的问题;然后引入音节之间的过渡模型来处理音节之间的协同发音问题。在"863-test"测试集上进行的汉语连续语音识别实验显示汉语字的相对错误率下降了12.13%,表明了基于音节的声学模型和音节间过渡模型相结合在解决汉语协同发音问题上的有效性。
晁浩杨占磊刘文举
关键词:语音识别协同发音音变声学建模
共1页<1>
聚类工具0