Ngày tải lên: 31/03/2014, 20:20
... Using Chunk Based Partial Parsing of Spontaneous Speech in Unrestricted Domains for Reducing Word Error Rate in Speech Recognition Klaus Zechner and Alex Waibel Language Technologies Institute ... putting efforts into further training. ã alternative language models: An idea for im- provement here is to integrate skipped words into the LM (similar to the modeling of noise in speech) . ... 1996). Since we cannot build on semantic knowledge for constructing parsers in the way it is done for lim- ited domains when attempting to parse spontaneous speech in unrestricted domains, we...
Ngày tải lên: 23/03/2014, 19:20
Vietnamese Speech Recognition and Synthesis in Embedded System Using T-Engine
... pitch marking for prosodic modification of speech using td-psola', 2006 Vietnamese Speech Recognition and Synthesis in Embedded System Using T-Engine Trinh Van Loan, La The Vinh Department ... layout III. PROPOSED METHOD FOR SPEECH RECOGNITION IN T-ENGINE Fig.2. Speech recognition in T-Engine The UDA1342 audio codec in T-Engine provides a minimal sampling frequency (SF) of 44100Hz. ... of speech engines. Finally, we demonstrate a human-computer interaction software in T-Engine embedded system. I. INTRODUCTION In this paper, we are concerned with the combination of speech recognition...
Ngày tải lên: 12/04/2013, 16:05
Tài liệu Báo cáo khoa học: "A Method for Correcting Errors in Speech Recognition Using the Statistical Features of Character Co-occurrence" pptx
... errors. In integrating recognition and translation into a speech translation system, the development of the following processes is therefore important: (1) detection of errors in speech recognition ... the string including errors from the String- Database (the former string is referred to as the Similar-String, and the latter as the Error-String). Finally, the correction is made using the ... error-block in the Error-String, am found in the Similar- String, take out the string (denoted C) between A and B in 1 For detecting errors in Japanese sentences, the method using the probability...
Ngày tải lên: 20/02/2014, 18:20
Tài liệu Báo cáo khoa học: "Learning Sub-Word Units for Open Vocabulary Speech Recognition" doc
... each word in the corpus. Resulting segments form the sub- word lexicon. 1 Learning input includes a list of words to segment taken from raw text, a mapping between words and classes (side information ... proposed sub-words are not simply modeling the training OOVs (named-entities) better than the base- line sub-words, but also describe better novel unex- pected words. Furthermore, including context ... segmentations for the word ANJANI with pronunciation: AA,N,JH,AA,N,IY Algorithm 1 Training Input: Lexicon L from training text W , Dictionary D, Mapping M, L2S pronunciations, Annealing temp T . Initialization: Assign...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "A Finite-Slate Parser for Use in Speech Recognition" pdf
... of/d/before/y /in didyou (5b) Reduction of unstressed/u/to schwa in) ,~u (5c) Flapping of intervocalic /t/ in hit. it (5d) Reduction of schwa and devoicing of/u /in to (5e) Reduc:ion of geminate/t /in ... Have Thought, AJCL, 8: I, [982. 19. Klatt, D., Word Verification in a Speech Understanding System, in P,. R, eddy (ed.), Speech Recognition, Invited Papers Presented at the 1974 [EEE Symposium, ... of finite-state parsing techniques at the phonetic level in order to exploit certain classes or" contextual constraints. -In the second section, the parsing framework is extended in order...
Ngày tải lên: 21/02/2014, 20:20
Báo cáo khoa học: "Improving On-line Handwritten Recognition using Translation Models in Multimodal Interactive Machine Translation" docx
... Linguistics Improving On-line Handwritten Recognition using Translation Models in Multimodal Interactive Machine Translation Vicent Alabau, Alberto Sanchis, Francisco Casacuberta Institut Tecnol ` ogic d’Inform ` atica Universitat ... discriminative training of direct translation models. In International Conference on Acoustics, Speech, and Signal Process- ing (ICASSP’98), pages 189–192, Seattle, Washing- ton, USA, May. [Rabiner1989] ... translation. In Proceedings of the 2010 International Conference on Multimodal Inter- faces (ICMI-MLMI’10), pages 46:1–4, Beijing, China, Nov. [Barrachina et al.2009] S. Barrachina, O. Bender, F....
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "Word Sense Disambiguation using lexical cohesion in the context" ppt
... of individual word or sense linkages. An interesting question is whether these results will be borne out in other datasets. In the forthcoming work we will inves- tigate their validity in ... (2004). Word Association Thesau- rus as a Resource for Building Wordnet. In GWC 2004. Sussna, M. (1993). Word Sense Disambiguation for Free-Text Indexing Using a Massive Semantic Network. In CKIM'93. ... word we retrieve its corresponding words in each word list and calculate the similarity be- tween the target and these words including the context words. As a result we transform the original...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "FACTORIZATION OF LANGUAGE CONSTRAINTS IN SPEECH RECOGNITION" pptx
... constraints in a post- processing mode using multiple word and string hypotheses generated from the speech decoder as input. When testing on the DARPA resource management task using the word- pair ... decoding, is kept within reasonable bounds without a loss in the performance. In this paper we propose an approach in which speech recognition is still performed in an integrated fashion using ... decoding is also transformed into an FSN in terms of phonetic units. The transformation is obtained by expanding all the non-terminals into the corresponding vocabulary words and each word in...
Ngày tải lên: 08/03/2014, 07:20
Báo cáo khoa học: "Correcting errors in speech recognition with articulatory dynamics" pot
... component takes into account var- ious physiological aspects of human speech production, including intergestural and in- terarticulator co-ordination and timing (Nam and Saltzman, 2003; Goldstein and ... vocal tract during speech. In EMA, the speaker is placed within a low-amplitude electromagnetic field pro- duced within a cube of a known geometry. Tiny sensors within this field induce small electric ... sets for 5-cross validation. MOCHA and TORGO data are never combined in a single training set due to dif- fering EMA recording rates. In all cases, models are database-dependent (i.e., all TORGO...
Ngày tải lên: 16/03/2014, 23:20
Báo cáo khoa học: "Dependencies between Student State and Speech Recognition Problems in Spoken Tutoring Dialogues" ppt
... Detecting Certainness in Spoken Tutorial Dia- logues. In Proc. of Interspeech. D. Litman and K. Forbes-Riley. 2004. Annotating Student Emotional States in Spoken Tutoring Dia- logues. In Proc. ... observed in human-human conversations through a noisy speech channel (Skantze, 2005). Correctness/certainty–SRP interactions We also find an interesting interaction between correctness/certainty ... its success. In our case, if we look at affect (FAH) or attitude (CERT) in isolation we find many interactions; in contrast, combining them offers little insight. 6 Results – insights &...
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "WordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for Meetings" doc
... for word prediction in conversational speech. In IWCS 6, Sixth International Workshop on Computational Se- mantics, Tilburg, Netherlands. H Schmid. 1994. Probabilistic part-of -speech tagging using ... Laprun. 2005. The rich transcription 2005 spring meeting recognition evaluation. In Rich Transcription 2005 Spring Meeting Recognition Evaluation Workshop, Edinburgh, UK. Jay J. Jiang and David W. ... not using the whole WordNet and its similarities, but defining word- trigger pairs that are used for rescor- ing. 5 Acknowledgements This work was supported by the European Union 6th FP IST Integrated...
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "Using Speech to Reply to SMS Messages While Driving: An In-Car Simulator User Study" docx
... examined two independent variables: SMS Reply Approach, consisting of voice search and dictation, and Driving Condition, consisting of no driving, easy driving and difficult driving. We included ... automo- biles than using dictation, users may have difficulties verifying whether SMS re- sponse templates match their intended meaning, especially while driving. Using a high-fidelity driving simulator, ... search in increasingly difficult driv- ing conditions. Although the two ap- proaches did not differ in terms of driving performance measures, users made about six times more errors on average using...
Ngày tải lên: 23/03/2014, 16:20
Báo cáo khoa học: "Investigating Pitch Accent Recognition in Non-native Speech" potx
... significantly larger in native speech, recognition remains robust to train- ing with native speech and testing on non-native speech, without significant drops in accuracy. This result argues that binary pitch ... some training sets. How- ever, crucially none of the differences between native-based or near-native training and within- group training reach significance for the binary pitch accent recognition ... recognition using within-group training data. Furthermore, there is no significant drop in accuracy when models trained on native or near-native speech are employed for classification of non-native speech. The...
Ngày tải lên: 23/03/2014, 17:20
Báo cáo khoa học: "LEXICAL ACCESS IN CONNECTED SPEECH RECOGNITION" pptx
... distinguish word. initial/I/ in/ 17/fzom word- inlernal /I/ in /hid/? In this paper, I shall argue for a model which splits the lexical access process into a pre-lexical phonological parsing ... are intended to select the correct word from the cohort. The bulk of engineering systems for speech recognition have finessed the issues of lexical access and word recognition by attempting ... warranted because the resulting system will be considerably more robust in the face of inacct~rate or indeterminate input concerning the nature of the weak syllables in the input utterance. CONCLUSION...
Ngày tải lên: 24/03/2014, 02:20
Báo cáo khoa học: "Practical Issues in Compiling Typed Unification Grammars for Speech Recognition" ppt
... Linguistics, 1993. O. Gauthron and N. Colineau. SETHIVoice: Cgf control by speech- recognition/ interpretation. In I/ITSEC ’99 (Interservice/Industry Training, Simu- lation and Education Conference), ... statistics generated using both techniques. The recognition results were obtained on a test set of 250 utter- ances. Recognition accuracy is measured in word error rate, and recognition speed is measured in multiples ... bug in the description of Moore’s algo- rithm that occurs in his paper, that the set of “retained non- terminals” needs to be extended to include any nonterminals that occur either in the non-initial...
Ngày tải lên: 31/03/2014, 04:20
(ebook) crc press - pattern recognition in speech and language processing 2003
Ngày tải lên: 28/04/2014, 09:51
techniques for noise robustness in automatic speech recognition
Ngày tải lên: 03/05/2014, 20:50