A TWO-STAGE ALGORITHM FOR ENHANCEMENT OF REVERBERANT SPEECH

时间：2026-04-23

ATWO-STAGEALGORITHMFORENHANCEMENTOF

REVERBERANTSPEECH

MingyangWuandDeLiangWang

DepartmentofComputerScienceandEngineering

andCenterforCognitiveScienceTheOhioStateUniversityColumbus,OH43210-1277,USA

Email:mwu@,dwang@cse.ohio-state.edu

ABSTRACT

Roomreverberationcausestwoperceptualdistortionsoncleanspeech:Colorationandlong-termreverberation.Thesetwoeffectscorrespondtotwophysicalvariables:Signal-to-reverberantenergyratio(SRR)andreverberationtime,respectively.Basedonthisobservation,weproposeatwo-stagealgorithmthatenhancesreverberantspeechfromone-microphonerecordings.Inthefirststage,aninversefilterisestimatedtoreducecolorationeffectsorincreaseSRR.Thesecondstageemploysspectralsubtractiontominimizetheinfluenceoflong-termreverberation.Theproposedalgorithmsignificantlyimprovesthequalityofreverberantspeech.Acomparisonwitharecentone-microphoneenhancementalgorithmshowsthatoursystemproducessignificantlybetterresults.

1.INTRODUCTION

Amaincauseofspeechdegradationinpracticallyalllisteningsituationsisroomreverberation.Althoughapersonwithnormalhearingislittleaffectedbyroomreverberationtoaconsiderabledegree,hearing-impairedlistenerssufferfromreverberationeffectsdisproportionally[12].Also,reverberationcausessignificantperformancedecrementforcurrentautomaticspeechrecognition(ASR)andspeakerrecognitionsystems.Consequently,aneffectivereverberantspeechenhancementsystemcanbeusedforimprovingintelligenthearingaidsdesignandisessentialformanyspeechtechnologyapplications.

Inthisarticlewestudyone-microphonereverberantspeechenhancement.Thisismotivatedbythefollowingtwoconsiderations.First,aone-microphonesolutionishighlydesirableformanyreal-worldapplicationssuchashand-freeaudiocommunicationandaudioinformationretrieval.Second,moderatelyreverberantspeechishighlyintelligibleinmonaurallisteningconditions.Hencehowtoachievethismonauralcapabilityremainsafundamentalscientificquestion.

Anumberofreverberantspeechenhancementalgorithmshavebeendesignedutilizingmorethanonemicrophone.Forexample,microphone-arraybasedmethods[6],suchasbeamformingtechniques,attempttosuppressthesoundenergycomingfromdirectionsotherthanthatofthedirectsourceandthereforeenhancetargetspeech.AspointedoutbyKoenigetal.[10],thereverberationtailsoftheimpulseresponses,characterizingthereverberationprocessinaroomwithmultiplemicrophonesandonespeaker,areuncorrelated.Several

algorithmsareproposedtoreducethereverberationeffectsbyremovingtheincoherentpartsofreceivedsignals.Blinddeconvolutionalgorithmsaimtoreconstructtheinversefilterswithoutthepriorknowledgeofroomimpulseresponses(forexample,see[8]).BrandsteinandGriebel[5]utilizetheextremaofwaveletcoefficientstoreconstructthelinearprediction(LP)residualoforiginalspeech.

Reverberantspeechenhancementusingonemicrophoneissignificantlymorechallengingthanthatusingmultiplemicrophones.Nonetheless,anumberofone-microphonealgorithmshavebeenproposed.Beesetal.[3]employsacepstrum-basedmethodtoestimatethecepstrumofreverberationimpulseresponse,anditsinverseisthenusedtodereverberatethesignal.Severaldereverberationalgorithms(forexample,see[2])aremotivatedbytheeffectsofreverberationonModulationTransferFunction(MTF).YegnanarayanaandMurthy[16]observedthatLPresidualofvoicedcleanspeechhasdampedsinusoidalpatternswithineachglottalcycle,whilethatofreverberantspeechissmearedandresemblesGaussiannoise.Withthisobservation,LPresidualofcleanspeechisestimatedandthentheenhancedspeechisresynthesized.NakataniandMiyoshi[13]proposedasystemcapableofblinddereverberationbyemployingtheharmonicstructureofspeech.Goodresultsareobtainedbutthisalgorithmrequiresalargeamountofreverberantspeechproducedusingthesameroomimpulseresponsefunction.Despitethesestudies,existingreverberantspeechenhancementalgorithms,however,donotreachaperformanceleveldemandedbymanypracticalapplications.

2.BACKGROUND

Reverberationcausesanoticeablechangeinspeechquality.BerkleyandAllen[4]identifiedthattwophysicalvariables,reverberationtimeT60andspectraldeviation,areimportantforreverberantspeechquality.Considertheimpulseresponseasacombinationofthreeparts,thedirect,early,andlatereflections.Whilelatereflectionssmearthespeechspectraandreducetheintelligibilityandqualityofspeechsignals,earlyreflectionscauseanotherdistortionofspeechsignalcalledcoloration;thenon-flatfrequencyresponseoftheearlyreflectionsdistortsthespeechspectrum.Thecolorationcanbecharacterizedbyaspectraldeviationdefinedasthestandarddeviationofroomfrequencyresponse.Increasingeitherspectraldeviationorreverberationtimeresultsindecreasedreverberantspeechquality.Moreover,Jetzt[9]showsthatspectraldeviationis

…… 此处隐藏：1784字，全部文档内容请下载后查看。喜欢就下载吧 ……

A TWO-STAGE ALGORITHM FOR ENHANCEMENT OF REVERBERANT SPEECH.doc 将本文的Word文档下载到电脑

下载这篇word文档

上一篇：水平未知时一种图像恢复正则化算法(图像和数字

下一篇：从惠普浅谈构建以人为本的现代企业文化建设