download speech recognition macros for xp

Báo cáo khoa học: "AN AUTOMATIC SPEECH RECOGNITION SYSTEM FOR ITALIAN LANGUAGE" doc

Báo cáo khoa học: "AN AUTOMATIC SPEECH RECOGNITION SYSTEM FOR ITALIAN LANGUAGE" doc

Ngày tải lên : 01/04/2014, 00:20
... dependent on any W. Then, the recognition task can be decomposed in these problems: 1. perform an acoustic processing able to extract from the speech signal an information A representative of ... vector. Therefore, the acoustic information ,4 is formed by a sequence of labels al, a2,"" , with a considerable reduction in the amount of data needed to represent the speech signal. ... AN AUTOMATIC SPEECH RECOGNITION SYSTEM FOR T! IE ITALIAN LANGUAGE Paolo D'Orta, Marco Ferretti, Alessandro Martelli,...
  • 4
  • 308
  • 0
Tài liệu Báo cáo khoa học: "Learning Sub-Word Units for Open Vocabulary Speech Recognition" doc

Tài liệu Báo cáo khoa học: "Learning Sub-Word Units for Open Vocabulary Speech Recognition" doc

Ngày tải lên : 20/02/2014, 04:20
... hybrid sys- tem for open vocabulary speech recognition. Rather than relying on the text alone, we also utilize side information: a mapping of words to classes so we can optimize learning for a specific ... gradient takes the usual form, where we en- courage the expected segmentation from the current model given the correct labels to equal the expected segmentation and expected labels. The next ... = 40 itera- tions for learning and 200 samples to compute the expectations in Eq. 5. The sampler was initialized by sampling for 500 iterations with deterministic an- nealing for a temperature...
  • 10
  • 441
  • 0
Tài liệu Báo cáo khoa học: "Improving Automatic Speech Recognition for Lectures through Transformation-based Rules Learned from Minimal Data" ppt

Tài liệu Báo cáo khoa học: "Improving Automatic Speech Recognition for Lectures through Transformation-based Rules Learned from Minimal Data" ppt

Ngày tải lên : 20/02/2014, 07:20
... 55.16 Table 5: Experimental evaluation: WER values for instructor K using the WSJ-5K language model. hours 4 for a threshold of 2 when training over tran- scripts for one third of a lecture. Therefore, ... 40.08 43.31 41.52 Table 3: Experimental evaluation: WER values for instructor R using the WEB language models. As for how the transcripts improve, words with lower information content (e.g., a ... tran- scripts for lectures. Section 3 describes our exper- imental setup, and Section 4 analyses its results. 2 Transformation-Based Learning Brill’s tagger introduced the concept of Transformation-Based...
  • 9
  • 427
  • 0
Tài liệu Báo cáo khoa học: "Discriminative Syntactic Language Modeling for Speech Recognition" pdf

Tài liệu Báo cáo khoa học: "Discriminative Syntactic Language Modeling for Speech Recognition" pdf

Ngày tải lên : 20/02/2014, 15:20
... that we used in our experiments. Section 4 describes experiments using the approach. 2 Background 2.1 Previous Work Techniques for exploiting stochastic context-free grammars for language modeling ... are a first step in examining the potential utility of syntactic features for discriminative language modeling for speech recognition. We tried two possible sets of features derived from the full ... Using a stochastic context-free grammar as a lan- guage model for speech recognition. In Proceedings of the IEEE Conference on Acoustics, Speech, and Signal Process- ing, pages 189–192. John Lafferty,...
  • 8
  • 409
  • 0
Tài liệu Báo cáo khoa học: "A Method for Correcting Errors in Speech Recognition Using the Statistical Features of Character Co-occurrence" pptx

Tài liệu Báo cáo khoa học: "A Method for Correcting Errors in Speech Recognition Using the Statistical Features of Character Co-occurrence" pptx

Ngày tải lên : 20/02/2014, 18:20
... correct string for the string between A and B in the Error-String (see figure 2-3). 3. Evaluation 3.1 Data Condition for Experiments Results of Speech Recognition: We used 4806 recognition ... integrating recognition and translation into a speech translation system, the development of the following processes is therefore important: (1) detection of errors in speech recognition results; ... to correct the errors in the results of speech recognition to increase the performance of a speech translation system. This paper proposes a method for correcting errors using the statistical...
  • 5
  • 588
  • 0
Tài liệu Báo cáo khoa học: "A Finite-Slate Parser for Use in Speech Recognition" pdf

Tài liệu Báo cáo khoa học: "A Finite-Slate Parser for Use in Speech Recognition" pdf

Ngày tải lên : 21/02/2014, 20:20
... invariants; allophonic variation is traditionally seen as problematic for recognition. (I) "In most systems for sentence recognition, such modifications must be viewed as a kind of 'noise' ... before the labial stop /p/, the cor9nal nasal/n/ before the coronal stop/t/, and the velar nasal/7// before the velar stop/k/. This constraint, like subject-verb agreement. poses a problem for ... This view of allophonic variation is representative of much of the speech recognition literature, especially during the ARPA speech project. One can find similar statements by Cole and Jakim~k...
  • 7
  • 420
  • 0
Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

Ngày tải lên : 22/02/2014, 02:20
... 2.2.1) for the same number of queries. Also results from language modeling and speech recog- nition experiments favored statistical querying. 2.3 Web collections obtained For the speech recognition ... rates were 17.0 for En- glish, 18.7 for Spanish, and 22.5 for French. For English, we also created web mixture mod- els with KN smoothing. The error rates were 16.5, 15.9 and 15.7 for the 20 MB, ... Continuous speech recognition experiments are con- ducted on three languages: English, Span- ish, and French. The web data is utilized for augmenting in-domain LMs in general and for adapting...
  • 9
  • 301
  • 0
Báo cáo khoa học: "Vocabulary Decomposition for Estonian Open Vocabulary Speech Recognition" ppt

Báo cáo khoa học: "Vocabulary Decomposition for Estonian Open Vocabulary Speech Recognition" ppt

Ngày tải lên : 08/03/2014, 02:21
... problem Open vocabulary speech recognition refers to au- tomatic speech recognition (ASR) of continuous speech, or speech- to-text” of spoken language, where the recognizer is expected to recognize ... Technology for help in performing experiments with Estonian speech and text databases. This work was supported by the Academy of Finland in the project: New adaptive and learning methods in speech recognition. References Bernard ... Computational Model for Word-Form Recognition and Production. University of Helsinki, Helsinki, Finland. Tanel Alumäe. 2006. Methods for Estonian Large Vo- cabulary Speech Recognition. PhD Thesis....
  • 7
  • 377
  • 0
Báo cáo khoa học: "A Preference-first Language Processor Integrating the Unification Grammar and Markov Language Model for Speech Recognition-ApplicationS" potx

Báo cáo khoa học: "A Preference-first Language Processor Integrating the Unification Grammar and Markov Language Model for Speech Recognition-ApplicationS" potx

Ngày tải lên : 08/03/2014, 07:20
... vocabulary environment. An experimental system for Mandarin speech recognition has been implemented (Lee, 1990) and tested, in which a very high correct rate of recognition (93.8%) was obtained ... Unification Grammar and Markov Language Model for Continuous Speech Recognition. Proceedings of the IEEE 990 International Conference on Acoustics, Speech and Signal Processing, Albuquerque, NM, ... (1986). An Efficient Word Lattice Parsing Algorithm for Continuous Speech Recognition. Proceedings of the 1986 International Conference on Acoustic, Speech and Signal Processing, pp. 1569-1572....
  • 6
  • 392
  • 0
Báo cáo khoa học: "Grounded Language Modeling for Automatic Speech Recognition of Sports Video" doc

Báo cáo khoa học: "Grounded Language Modeling for Automatic Speech Recognition of Sports Video" doc

Ngày tải lên : 17/03/2014, 02:20
... Wactlar, H., Witbrock, M., Hauptmann, A., (1996 ). Informedia: News-on-Demand Experiments in Speech Recognition. ARPA Speech Recognition Workshop, Arden House, Harriman, NY. Witten, I. and Frank, ... evaluate the performance of our grounded language model on a speech recognition task using video highlights from Major League Baseball games. Results indicate improved per- formance using three ... transcription. For example, if the ASR output contains the term sequence “… and farther home run for David forty says…” and the closed captioning contains the sequence “…another home run for David...
  • 9
  • 395
  • 0
Báo cáo khoa học: "WordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for Meetings" doc

Báo cáo khoa học: "WordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for Meetings" doc

Ngày tải lên : 17/03/2014, 04:20
... this paper the best performing measures from (Pucher, 2005), which outperform baseline models on word prediction for conversational tele- phone speech are used for Automatic Speech Recog- nition ... conversational speech. The JCN (Sec- tion 2.1) measure performs best for nouns using the noun-context. The LESK (Section 2.1) measure per- forms best for verbs and adjectives using a mixed word-context. Text-based ... 129–132, Prague, June 2007. c 2007 Association for Computational Linguistics WordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for Meetings Michael Pucher Telecommunications...
  • 4
  • 204
  • 0
Báo cáo khoa học: "Using Chunk Based Partial Parsing of Spontaneous Speech in Unrestricted Domains for Reducing Word Error Rate in Speech Recognition" potx

Báo cáo khoa học: "Using Chunk Based Partial Parsing of Spontaneous Speech in Unrestricted Domains for Reducing Word Error Rate in Speech Recognition" potx

Ngày tải lên : 23/03/2014, 19:20
... Spontaneous Speech in Unrestricted Domains for Reducing Word Error Rate in Speech Recognition Klaus Zechner and Alex Waibel Language Technologies Institute Carnegie Mellon University 5000 Forbes ... type 1453 data set eval21 test best expected performance WER gain +2.0 +0.5 +0.3 -4.9 Table 3: WER gain: best results in neural net experiments for two test sets (in absolute %) The ... representations were the only source of information for our reranking system, in addition to the internal scores of the speech recognizer. It can be expected that including more sources of knowledge,...
  • 7
  • 388
  • 0
Báo cáo khoa học: "Practical Issues in Compiling Typed Unification Grammars for Speech Recognition" ppt

Báo cáo khoa học: "Practical Issues in Compiling Typed Unification Grammars for Speech Recognition" ppt

Ngày tải lên : 31/03/2014, 04:20
... coorelate with poor recognition performance, and that size of the resuling language model does not appear to directly coorelate with recognition performance. We have developed new techniques for further ... Jan- uary 2001. World Wide Web Consortium (W3C). Speech Recognition Grammar Specification for the W3C Speech Interface Framework. http://www.w3.org/TR /speech- grammar, 2001. As of 3 January 2001. ... the input to the speech recognizer. In our experience, each of the compilation stages, as well as speech recognition itself, has the po- tential to lead to a combinatorial explosion that exceeds...
  • 9
  • 285
  • 0
Báo cáo khoa học: "User Simulations for context-sensitive speech recognition in Spoken Dialogue Systems" potx

Báo cáo khoa học: "User Simulations for context-sensitive speech recognition in Spoken Dialogue Systems" potx

Ngày tải lên : 31/03/2014, 20:20
... in- formation extracted from speech waveforms, to- gether with information derived from their speech recognizer, to automatically predict misrecog- nized turns in a corpus of train-timetable informa- tion ... is 18.17% and for the ’pos’ 8.9%, all in all a lot larger than for the other categories. 9.2 Second Layer: Re-ranker Experiments In these experiments we measure WER, DMA and SA for the system ... was initially performed (see table 3). As a result, we chose to use this grouping for the final 4 feature sets on which the classifier experiments were performed, in the following order: Experiment 1:...
  • 9
  • 277
  • 0
techniques  for  noise  robustness  in  automatic  speech  recognition

techniques for noise robustness in automatic speech recognition

Ngày tải lên : 03/05/2014, 20:50
... we now return to the subject of HMM-based speech recognition. To recap, we restate the Bayesian formulation for the speech- recognition problem: given a speech recording X,the sequence of words ˆw 1 , ... to consider is the representation of the speech signal itself. Speech recognition is not performed directly with the speech signal. The information in speech is primarily in its spectral content ... 168mm 26 Techniques for Noise Robustness in Automatic Speech Recognition /IH//S/ /NG//AO//NG/ /S/ Figure 2.9 Illustrating the composition of HMMs for word sequences from the HMMs for smaller units. Here,...
  • 500
  • 267
  • 0