Mandarin Pronunciation Modeling Based on CASS Corpus

(整期优先)网络出版时间:2002-03-13
/ 1
Thepronunciationvariabilityisanimportantissuethatmustbefacedwithwhendevelopingpracticalautomaticspontaneousspeechrecognitionsystems.Inthispaper,thefactorsthatmayaffecttherecognitionperformanceareanalyzed,inculdingthosespecifictotheChineselanguage.BystudyintheINITIAL/FINAL(IF)characteristicsofChineselanguageanddevelopingtheBayesianequation,theconceptsofgeneralizedINITIAL/FINAL(GIF)andgeneralizedsyllable(GS),theGIFmodelingandtheIF-GIFmodeling,aswellasthecontext-dependentpronunciationweighting,areproposedbasedonawellphoneticallytranscribedseeddatabase.Byusingthesemethods,theChinesesylableerrorrate(SER)isreducedby6.3%and4.2%comparedwiththeGIFmodelingandIFmodelingrespectivelywhenthelanguagemodel,suchassyllableorwordN-gram,isnotused.Theeffectivenessofthesemethodsisalsoprovedwhenmoredatawithoutthephonetictranscriptionareusedtorefinetheacousticmodelusingtheproposediterativeforced-alignmentbasedtranscribing(IFABT)method,achievinga5.7%SERreduction.