公共文化服务平台

基于PLDA的“一对多”下的说话人确认方法研究: 近年来,概率线性鉴别分析（Probabilistic Linear Discriminant Analysis,PLDA）因其优异的性能而得到学者们的广泛关注。然而,各主流单位的PLDA研究都是基于NISTSRE 201...; 许云飞黄厚军金怡珠李桂莲周若华; 关键词：说话人识别

基于SVM的合成语音检测: 斯超向量分类技术引入到了合成语音检测系统中，利用svm二分类起进行合成语音检测在保证检测速度的情况下可以达到一个较为优秀的效果。但由于训练数据标注问题，对性能还是一定影响。并且在更大的训练数据规模下，svm也很难进一步进...; 杨朔计哲郭耀辉颜永红; 关键词：自动化检测信号识别

多特征融合的维哈口语短文本分类: 少数民族语言如维吾尔语、哈萨克语、乌兹别克语、柯尔克孜语等使用基本相同的字符,并且共用大量词汇,难以通过判断特殊字符来区分文种。而来自社交网络的口语文本因为长度短、噪声多以及不合语法,是相近语言识别的难题。我们提出了一种...; 何峻青赵学敏颜永红; 关键词：文字信息处理少数民族语言文种识别; 文献传递

Acoustic characteristics of stop consonants in fluent reading Chinese Putonghua speech of adult stutterers被引量：1: 2013年; This study investigated whether adults who stutter and normal adult speakers differ in the production of stop consonants in fluent reading Chinese Putonghua speech.Voice onset time(VOT) was measured and the spectral moments at the stop burst were calculated for the stutterers(both before and after the speech therapy) and also for the nonstutterers. The statistical results showed that there were no significant differences in VOT between the nonstutterers and stutterers either prior to or after therapy,although the mean VOT of the stutterers was slightly greater than that of the nonstutterers.The results also indicated that both the obstruction place and the subsequent syllabic final exhibited an influence to a greater extent on VOT for the stutterers.In the spectral domain,the spectral mean of the stuttering participants before therapy was significantly different from that of the normal participants, whereas the group difference became insignificant after the therapy session.The smaller spectral mean for the stutterers might be interpreted as a more posterior occlusion in the oral cavity when producing alveolars and velars.In addition,productions of the stutterers scattered with a wider range in the space of spectral moments.Furthermore,the smaller main effect of syllabic finals on the mean spectral frequency of the burst suggested that the stutterers exhibited weaker anticipatory coarticulation than the nonstutterers.; FENG YongqiangYAN QianGAO XinglongPAN FupingXING LiliLIN ChunlanPAN Jielin; 关键词：声学特性成年正常成人

A forced alignment approach to detect Chinese repetitive stuttering: 2013年; A forced alignment based algorithms to detect Chinese repetitive stuttering is studied. According to the features of repetitions in Chinese stuttered speech,improvement solutions are provided based on the previous research findings.First,a multi-span looping forced alignment decoding networks is designed to detect multi-syllable repetitions in Chinese stuttered speech.Second,branch penalty factor is added in the networks to adjust decoding trend using recursive search in order to reduce the error from the complexity of the decoding networks. Finally,we re-judge the detected stutters by calculating confidence to improve the reliability of the detection result.The experimental results show that compared to previous algorithm,the proposed algorithm can improve system performance significantly,about 18%average detection error rate relatively.; ZHANG JunboYAN QianGAO XinglongPAN FupingFENG YongqiangXING LiliLIN ChunlanPAN Jielin; 关键词：惩罚因子

基于NMF和FCRF的单通道语音分离被引量：1: 2017年; 近年来,非负矩阵分解(non-negative matrix factorization,NMF)被广泛应用于单通道语音分离问题。然而,标准的NMF算法假设语音的相邻帧之间是相互独立的,不能表征语音信号的时间连续性信息。为此,该文提出了一种基于NMF和因子条件随机场(factorial conditional random field,FCRF)的语音分离算法,首先将NMF和k均值聚类结合对纯净语音的频谱结构以及时间连续性进行建模,然后利用得到的模型训练FCRF模型,进而对混合语音信号进行分离。结果表明:该算法相比没有考虑语音时间连续特性的基于NMF的算法如激活集牛顿算法(active-set Newton algorithm,ASNA),在客观指标上有明显提高。; 李煦屠明吴超国雁萌纳跃跃付强颜永红; 关键词：非负矩阵分解 K均值聚类

基于DNN的声学模型自适应实验被引量：5: 2015年; 声学模型自适应算法研究目的是缓解由测试数据和训练数据不匹配而引起的识别性能下降问题.基于深度神经网络(DNN)模型框架的自适应技术中,重训练是最直接的方法,但极容易出现过拟合现象,尤其是自适应数据稀疏的情况下.文章针对领域相关的自动语音识别任务,对典型的两种声学模型自适应算法进行了尝试,实验了基于线性变换网络的自适应方法和基于相对熵正则化准则的自适应方法,并对两种算法进行了详尽的系统性能比较.结果表明,在不同的自适应数据量下,相对熵正则化自适应方法均能表现出较好的性能.; 张宇计哲万辛张震葛凤培颜永红; 关键词：语音识别

两扬声器配置下的串声消除系统参数优化设置被引量：1: 2014年; 针对三维声音两扬声器重放中基本上独立研究逆滤波器的设计或扬声器的配置等因素对串声消除系统(CCS)性能的影响,提出了采用频域最小均方(LS)估计逼近方法,系统考察这些因素之间的关联以及对串声消除性能优化的作用,并通过折中考虑CCS的运算效率及系统性能获得了一组最优参数。实验采用通道分离度(CS)和性能误差(PE)两个指标对串声消除效果进行综合评价,仿真结果表明,该组最优参数能获得很好的串声消除效果。; 许春冬李军锋裘嫄夏日升颜永红; 关键词：三维声音逆滤波

多领域系统融合在语音云系统中的应用: <正>0引言近年来,各大IT公司推出了自己的语音云系统,语音识别技术被大量运用到人们的日常生活中。通过云系统的强大计算能力,用户可利用语音通过移动终端打开手机应用,编辑短信、电子邮件,拨打电话和搜索网页等。各式各样的功能...; 陈梦喆张晴晴颜永红; 文献传递

利用二重打分方法的激活词语音识别: <正>0引言语音被认为是人与人之间交流最自然的方式之一,自动语音识别(ASR)也是一种重要的人机交互方式。几十年来,众多学者做了大量与语音识别相关的工作,其中的一个方向就是激活词语音识别,也可以称为激活词检测:向机器发出...; 邢安昊黎塔颜永红; 文献传递

渝B2-20050021-1　渝公网安备 50019002500403号　违法和不良信息举报中心　互联网出版许可证　新出网证(渝)字10号

国家自然科学基金(61271426)

文献类型

领域

主题

机构

作者

传媒

年份

用户反馈

国家自然科学基金(61271426)

文献类型

领域

主题

机构

作者

传媒

年份

用户登录

用户反馈