A TWO-STAGE ALGORITHM FOR ENHANCEMENT OF REVERBERANT SPEECH

时间:2025-03-09

Room reverberation causes two perceptual distortions on clean speech: Coloration and long-term reverberation. These two effects correspond to two physical variables: Signal-toreverberant energy ratio (SRR) and reverberation time, respectively. Based on thi

ATWO-STAGEALGORITHMFORENHANCEMENTOF

REVERBERANTSPEECH

MingyangWuandDeLiangWang

DepartmentofComputerScienceandEngineering

andCenterforCognitiveScienceTheOhioStateUniversityColumbus,OH43210-1277,USA

Email:mwu@,dwang@cse.ohio-state.edu

ABSTRACT

Roomreverberationcausestwoperceptualdistortionsoncleanspeech:Colorationandlong-termreverberation.Thesetwoeffectscorrespondtotwophysicalvariables:Signal-to-reverberantenergyratio(SRR)andreverberationtime,respectively.Basedonthisobservation,weproposeatwo-stagealgorithmthatenhancesreverberantspeechfromone-microphonerecordings.Inthefirststage,aninversefilterisestimatedtoreducecolorationeffectsorincreaseSRR.Thesecondstageemploysspectralsubtractiontominimizetheinfluenceoflong-termreverberation.Theproposedalgorithmsignificantlyimprovesthequalityofreverberantspeech.Acomparisonwitharecentone-microphoneenhancementalgorithmshowsthatoursystemproducessignificantlybetterresults.

1.INTRODUCTION

Amaincauseofspeechdegradationinpracticallyalllisteningsituationsisroomreverberation.Althoughapersonwithnormalhearingislittleaffectedbyroomreverberationtoaconsiderabledegree,hearing-impairedlistenerssufferfromreverberationeffectsdisproportionally[12].Also,reverberationcausessignificantperformancedecrementforcurrentautomaticspeechrecognition(ASR)andspeakerrecognitionsystems.Consequently,aneffectivereverberantspeechenhancementsystemcanbeusedforimprovingintelligenthearingaidsdesignandisessentialformanyspeechtechnologyapplications.

Inthisarticlewestudyone-microphonereverberantspeechenhancement.Thisismotivatedbythefollowingtwoconsiderations.First,aone-microphonesolutionishighlydesirableformanyreal-worldapplicationssuchashand-freeaudiocommunicationandaudioinformationretrieval.Second,moderatelyreverberantspeechishighlyintelligibleinmonaurallisteningconditions.Hencehowtoachievethismonauralcapabilityremainsafundamentalscientificquestion.

Anumberofreverberantspeechenhancementalgorithmshavebeendesignedutilizingmorethanonemicrophone.Forexample,microphone-arraybasedmethods[6],suchasbeamformingtechniques,attempttosuppressthesoundenergycomingfromdirectionsotherthanthatofthedirectsourceandthereforeenhancetargetspeech.AspointedoutbyKoenigetal.[10],thereverberationtailsoftheimpulseresponses,characterizingthereverberationprocessinaroomwithmultiplemicrophonesandonespeaker,areuncorrelated.Several

«

algorithmsareproposedtoreducethereverberationeffectsbyremovingtheincoherentpartsofreceivedsignals.Blinddeconvolutionalgorithmsaimtoreconstructtheinversefilterswithoutthepriorknowledgeofroomimpulseresponses(forexample,see[8]).BrandsteinandGriebel[5]utilizetheextremaofwaveletcoefficientstoreconstructthelinearprediction(LP)residualoforiginalspeech.

Reverberantspeechenhancementusingonemicrophoneissignificantlymorechallengingthanthatusingmultiplemicrophones.Nonetheless,anumberofone-microphonealgorithmshavebeenproposed.Beesetal.[3]employsacepstrum-basedmethodtoestimatethecepstrumofreverberationimpulseresponse,anditsinverseisthenusedtodereverberatethesignal.Severaldereverberationalgorithms(forexample,see[2])aremotivatedbytheeffectsofreverberationonModulationTransferFunction(MTF).YegnanarayanaandMurthy[16]observedthatLPresidualofvoicedcleanspeechhasdampedsinusoidalpatternswithineachglottalcycle,whilethatofreverberantspeechissmearedandresemblesGaussiannoise.Withthisobservation,LPresidualofcleanspeechisestimatedandthentheenhancedspeechisresynthesized.NakataniandMiyoshi[13]proposedasystemcapableofblinddereverberationbyemployingtheharmonicstructureofspeech.Goodresultsareobtainedbutthisalgorithmrequiresalargeamountofreverberantspeechproducedusingthesameroomimpulseresponsefunction.Despitethesestudies,existingreverberantspeechenhancementalgorithms,however,donotreachaperformanceleveldemandedbymanypracticalapplications.

2.BACKGROUND

Reverberationcausesanoticeablechangeinspeechquality.BerkleyandAllen[4]identifiedthattwophysicalvariables,reverberationtimeT60andspectraldeviation,areimportantforreverberantspeechquality.Considertheimpulseresponseasacombinationofthreeparts,thedirect,early,andlatereflections.Whilelatereflectionssmearthespeechspectraandreducetheintelligibilityandqualityofspeechsignals,earlyreflectionscauseanotherdistortionofspeechsignalcalledcoloration;thenon-flatfrequencyresponseoftheearlyreflectionsdistortsthespeechspectrum.Thecolorationcanbecharacterizedbyaspectraldeviationdefinedasthestandarddeviationofroomfrequencyresponse.Increasingeitherspectraldeviationorreverberationtimeresultsindecreasedreverberantspeechquality.Moreover,Jetzt[9]showsthatspectraldeviationis

…… 此处隐藏:2784字,全部文档内容请下载后查看。喜欢就下载吧 ……
A TWO-STAGE ALGORITHM FOR ENHANCEMENT OF REVERBERANT SPEECH.doc 将本文的Word文档下载到电脑

精彩图片

热门精选

大家正在看

× 游客快捷下载通道(下载后可以自由复制和排版)

限时特价:7 元/份 原价:20元

支付方式:

开通VIP包月会员 特价:29元/月

注:下载文档有可能“只有目录或者内容不全”等情况,请下载之前注意辨别,如果您已付费且无法下载或内容有问题,请联系我们协助你处理。
微信:fanwen365 QQ:370150219