Hướng phát triển

Kết hợp với nhận dạng thanh điệu Tiếng Việt để tăng số lượng từ nhận dạng. Xây dựng mô hình ngôn ngữ để nâng kết quả nhận dạng trong nhận dạng

KHOA CNTT –

ĐH KHTN

Kết hợp thêm mô hình tương thích giọng người nói để có thể xây dựng hệ

nhận dạng độc lập người nói.

Xây dụng thuật giải huấn luyện tăng cường để mỗi khi tăng lượng dữ liệu học mẫu sẽ không tốn thời gian huấn luyện lại từđầu.

KHOA CNTT –

ĐH KHTN

TÀI LIỆU THAM KHẢO

[1] Mathew Magimai. Doss, Todd A. Stephenson, Herv Bourlard, and Samy Bengio, Phoneme-Grapheme Based Speech Recognition System, Dalle Molle Institute for Artificial Intelligence

[2] Ahmed M. Abdelatty Ali et al, An Acoustic-phonetic feature-based system

for automatic phoneme recognition in continuous speech, Dept. of Electrical

Engineering, University of Pennsylvania

[3] Guido Aversano et al, A New Text-Independent Method for Phoneme

Segmentation, a Dipartimento di Fisica “E.R. Caianiello”, Universit`a di Salerno.

[4] Youngjoo Suh and Youngjik Lee, Phoneme segmentation of continuous

speech using multi-layer perceptron, Electronics and Telecommunications Research

Institute, 1996.

[5] Jesper Salomon , Support Vector Machines for Phoneme Classification,

Master of Science, school of Artificial Intelligence, Division of Informatics, University of Edinburgh, 2001.

[6] Barbara Resch, Gaussian Statistics and Unsupervised Learning, A Tutorial

for the Course Computational Intelligence, Signal Processing and Speech Communication Laboratory, IEEE International Workshop on Circuits and System. [7] Giuseppe Riccardi and Dilek Hakkani , Active And Unsupervised Learning For Automatic Speech Recognition, AT&T Labs-Research

KHOA CNTT –

ĐH KHTN

[8] Steve Young, HTKBook (version 3.2), Cambridge University Engineering

Department.

[9] Lawrence R. Rabiner, A tutorial on Hidden Markov Models and Selected Applications in Speech Recognition, IEEE, 1989

[10] Đinh Lê Thư - Nguyễn Văn Huệ, Cơ cấu ngữ âm Tiếng Việt, NXB Giáo Dục, 1998

[11] Thái Hùng Văn, Luận án thạc sĩ, khoa Công nghệ thông tin, Đại học Khoa học Tự nhiên, 2000

[12] Đỗ Xuân Đạt – Võ Văn Tuấn, Luận văn tốt nghiêp khoa Công nghệ thông tin,

Đại học Khoa học Tự nhiên, 2003

[13] Tarun Agarwal, Pre-Processing of Noisy Speech for Voice Coders,

Department of Electrical & Computer Engineering, McGill University, Montreal, Canada, January 2002

[14] Ing Yann Soon and Soo Ngee Koh, Speech Enhancement Using 2-D Fourier

Transform, IEEE Transactions On Speech And Audio Processing, VOL. 11, NO. 6,

NOVEMBER 2003 (pages 717 through 724)

[15] John H.L. Hansen, Bryan Pellom, An Effective Quality Evaluation Protocol

For Speech Enhancement Algorithms, ICSLP-98: Inter. Conf. On Spoken Language

Processing, Sydney, Australia, Nov. 30 – Dec. 4, 1998.

[16] Hagai Attias John C. Platt Alex Acero Li Deng, Speech Denoising and

Dereverberation Using Probabilistic Models, Microsoft Research

[17] James G. Droppo III , Time-Frequency Features For Speech Recognition, Doctor of Philosophy thesis, University of Washington, 2000

KHOA CNTT –

ĐH KHTN

[18] C.J.Long and S.Datta, Wavelet Based Feature Extraction for Phoneme Recognition, Department of Electronic and Electrical Engineering, Loughborough

University of Technology

[19] Amit Juneja and Carol Espy-Wilson, Segmentation Of Continuous Speech

Using Acoustic-Phonetic Parameters And Statistical Learning, ECE Department,

University of Maryland

[20] Ryuta Terashima, Estimating Speech-Recognizer Performance Based on Log-Likelhood Difference Distribution of Word-Pairs, Special Issue Speech-Based

Sơ lược về âm tiết tiếng Việ t

Mỗi âm tiết có thanh điệu riêng