Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống
1
/ 13 trang
THÔNG TIN TÀI LIỆU
Thông tin cơ bản
Định dạng
Số trang
13
Dung lượng
73,99 KB
Nội dung
Trường Đại học Duy Tân Introduction to information system TRƯỜNG ĐẠI HỌC DUY TÂN - - INTRODUCTION TO IFNORMATION SYSTEM Personal Project SPEECH RECOGNITION TECHNOLOGY Class: CMU IS 100 EIS – Year (2023-2024) Phạm Thị Ngọc Huyền Phần 2,4 Hồ Văn Thanh Phần 3,4 Bùi Thị Như Phần 1,4 I Đồ án Class: CMU IS 100 EIS Trường Đại học Duy Tân Introduction to information system Preface Technology has become an indispensable part of our modern life A presentation on the intersection of technology and contemporary living will enhance understanding of the latest technologies, how they can be utilized to address life's challenges, and the overall impact of technology on our lives Particularly, it can contribute to raising awareness about safeguarding personal information, ensuring cybersecurity, and understanding the associated risks Furthermore, it encourages creativity and development in the field of technology In addition, it provides insights into how technology influences the business landscape and creates new business opportunities Therefore, selecting a presentation topic on technology and modern life is crucial in today's era The technology we want to introduce brings significant value to modern life, and it is voice recognition technology To delve deeper into this technology, please follow the upcoming sections Đồ án Class: CMU IS 100 EIS Trường Đại học Duy Tân Introduction to information system I Introduction to Speech Recognition Technology What is speech recognition technology? Speech recognition technology is an advanced technology based on artificial intelligence, machine learning, and biometrics It is a technology that enables computers or smart devices to understand and respond to human natural language, converting human speech into text or vice versa Currently, this technology is developing remarkably and is considered a part of artificial intelligence This has opened up many opportunities and conveniences for our daily lives The Development of Speech Recognition Technology Speech recognition technology has undergone significant development in recent years, particularly due to advancements in machine learning and natural language processing Below are key points regarding the evolution of speech recognition technology: Advancements in Machine Learning Models: The development of machine learning models, especially deep learning models, has significantly contributed to the effectiveness of speech recognition technology Models such as Recurrent Neural Networks (RNNs), Convolutional Neural Networks (CNNs), and, notably, Deep Neural Networks (DNNs) and Deep Convolutional Networks (DCNNs) have improved the understanding and classification of speech Large and Diverse Data: The progress of speech recognition technology heavily relies on the utilization of large and diverse datasets for model training Having a sufficiently large and diverse dataset helps the model learn various speech variations and enhances generalization capabilities Wide Range of Applications: Speech recognition technology has found numerous applications in everyday life The speech recognition feature is increasingly integrated into mobile devices, voice-controlled systems, automatic translation, and various other applications, enhancing user convenience Localization of Technology for Vietnamese: In many countries, including Vietnam, the development of speech recognition technology has been Đồ án Class: CMU IS 100 EIS Trường Đại học Duy Tân Introduction to information system accompanied by the improvement of algorithms and models tailored to specific languages This presents challenges and opportunities for the development of speech recognition applications for local languages Enhancing Security and Voice-Controlled Management: Speech recognition technology is increasingly employed in security and management fields, such as identity verification, access control, and surveillance This plays a crucial role in improving the efficiency and safety of these systems Integration with Artificial Intelligence (AI) and Internet of Things (IoT): Speech recognition technology is progressively integrated with artificial intelligence and the Internet of Things to create smarter systems The combination of speech recognition with other technologies opens up new possibilities for interaction with machines and smart devices In conclusion, the development of speech recognition technology has provided numerous opportunities and potentials for improving user experiences and creating innovative applications across various fields Operation Mechanism of Speech Recognition Technology Speech recognition technology is part of the natural language processing (NLP) field and often utilizes machine learning methods to analyze and understand human speech Below is the basic operational mechanism of speech recognition technology: Data Collection: Firstly, a speech recognition system needs to be trained with a large amount of speech data This data includes speech samples from various individuals with variations in voice, context, and language Preprocessing: Speech data is typically preprocessed to eliminate noise and enhance essential information This may involve removing background noise, cleaning signals, and converting audio into a suitable format for processing Feature Extraction: From the preprocessed data, crucial features of speech need to be extracted This often involves using methods such as melfrequency cepstral coefficients (MFCCs) to represent important information about the frequency and amplitude of speech Machine Learning Models: Machine learning models, often deep learning models, are used to train on speech data represented by the extracted Đồ án Class: CMU IS 100 EIS Trường Đại học Duy Tân Introduction to information system features This model can be Recurrent Neural Networks (RNN), Convolutional Neural Networks (CNNs), or other deep learning architectures Model Training: The model is trained to learn how to represent and classify speech features During this process, the model is adjusted through backpropagation to minimize the error between predicted output and actual values Classification and Recognition: Once the model is trained, it can be used to classify and recognize speech from new inputs Speech recognition systems often have the capability to identify speakers, languages, and even specific words and phrases in a language Optimization and Deployment: After the model is trained and tested, it can be optimized and deployed in real-world applications, such as voicecontrolled systems, speech recognition systems on mobile phones, or in security applications Speech recognition technologies are increasingly popular and integrated into various fields to enhance user experience and augment the interaction capabilities between machines and humans Đồ án Class: CMU IS 100 EIS Trường Đại học Duy Tân Introduction to information system II The features of Speech Recognition Technology Speech recognition technology provides flexibility in time and location for health care professionals, helping to provide high-quality care services at various facilities It also improves productivity and efficiency for health care professionals by allowing them to write notes and reports directly into the computer, saving time Moreover, speech recognition technology improves the accuracy and completeness of medical records, reduces the risk of errors and enhances patient safety In addition, this technology also helps people with disabilities to easily access and enter data through voice, creating favorable conditions for communication and control In summary, speech recognition technology brings many important benefits in the field of health care and daily life Speech recognition technology has many useful features First, it has the ability to convert voice to text on electronic devices with high accuracy In addition, this technology can also be used in applications such as virtual assistants, automatic question answering systems, integration in location lists, and providing communication and interaction services in Vietnamese on technology platforms Speech technology also has the ability to recognize speech and organize information, helping users perform common tasks such as payment and transaction, account status check, and product/service information update In addition, speech technology is also integrated with speech biometric solutions to enhance security for users For example: Speech recognition feature allows the computer to recognize the speaker, language, dialect, emotion and intention This helps the computer understand the content and context of the voice, as well as generate appropriate responses Identify and distinguish the voices of different speakers in an audio clip, distinguish the language that the speaker is using in the recording, recognize the intention or purpose of the speaker from their speech Speech synthesis feature allows the computer to create natural and lively voice from text This helps the computer communicate with humans more easily and friendly Speech translation feature allows the computer to convert voice data from one language to another and help users create text from speech This helps the computer support humans in communicating with people who speak different languages Speech control device feature allows users to control devices, applications, and systems by using voice Đồ án Class: CMU IS 100 EIS Trường Đại học Duy Tân Introduction to information system Mood and attitude classification feature recognizes the mood of the speaker from the way they express themselves through tone and speech variation Security and authentication feature uses speech to authenticate user identity, especially in security systems Converting Speech to Text (Speech-to-Text): Speech recognition technology has the capability to convert spoken words into text with high accuracy, aiding users in note-taking, writing, or interacting with electronic devices through voice Virtual Assistants: Virtual assistant applications utilize speech recognition technology to understand and execute commands, questions from users, providing information, scheduling, and performing various tasks based on voice input Automatic Question Answering: Speech recognition technology can be integrated into automatic question-answering systems, delivering quick and efficient information Location Integration in Applications: This technology can be employed to search for locations, provide directions, and interact with location-based services through voice commands Đồ án Class: CMU IS 100 EIS Trường Đại học Duy Tân Introduction to information system III Benefits of Speech Recognition Technology Speech recognition technology brings about significant advantages Firstly, it helps us save time and effort in data input Instead of typing each character manually, we can speak, and the technology will automatically convert speech into text This is particularly useful for writing, sending messages, or searching for information on the internet Secondly, speech recognition technology enhances the ability to access information We can easily look up information, listen to news, or hear audiobooks without the need to read Finally, speech recognition technology contributes to creating a convenient and user-friendly experience We can control devices through voice commands, such as adjusting lights, opening doors, or controlling applications on smartphones This technology also brings numerous benefits to users, businesses, and society Here are some key benefits User Communication: Speech recognition technology allows users to communicate with devices and services using their voice, eliminating the need for keyboard, mouse, or touch input This saves time, effort, and enhances convenience for users For example, users can use voice commands to control smart home devices, make calls, send messages, open applications, or search for information Authentication and Security: Speech recognition technology can be used for user authentication and identity verification by utilizing their unique voice as a biometric factor This helps prevent attacks and ensures the security of systems For instance, users can use their voice to unlock phones, computer, or access bank accounts Customer Experience Improvement: Speech recognition technology can enhance the customer experience by automating the identification of the purpose of customer calls to helpline These systems can automatically route calls to the appropriate department or provide preliminary information, reducing wait times and increasing customer satisfaction Support for people with disabilities: Speech recognition technology is a useful tool for individuals with disabilities, especially those who cannot use a keyboard or mouse They can interact with computers and electronic devices through their own voice Đồ án Class: CMU IS 100 EIS Trường Đại học Duy Tân Introduction to information system Fraud prevention: In anti-fraud applications, speech recognition can help determine whether the speaker is legitimate, reducing the risk of fraud and security breaches Integration in the Automotive Industry: In smart automotive systems, speech recognition technology assists drivers and passengers in interacting with entertainment systems, controlling vehicle functions without the need for manual input Đồ án Class: CMU IS 100 EIS Trường Đại học Duy Tân Introduction to information system IV Applications off Speech Recognition Technology in Various Field Speech recognition technology can be applied across various industries in the field of Information Technology For instance: Computer Programming: In computer programming, speech recognition technology can assist programmers in saving time and effort by converting spoken words into code, facilitating efficient input Computer Science Research: In computer science research, speech recognition technology can be used to analyze and predict complex issues, aiding researchers in dealing with intricate computational problems System Installation and Support: In the installation and support of computer systems, speech recognition technology can enhance efficiency and convenience, providing a hands-free approach to system control Moreover, the technology can be applied in creating internal networks for organizations and businesses Some examples include: Finance, Banking, and Insurance: In these sectors, speech recognition technology can be utilized for customer authentication and identity security through voice recognition This facilitates quick and secure transactions, payment verifications, and account inquiries Consumer and Retail: Speech recognition technology can be used to create virtual assistants, assisting customers in voice-activated search, order placement, payment, and providing information, promotion, and recommendations tailored to customer preferences Real Estate: In the real estate industry, speech recognition technology can be applied to develop applications that help customers search, view, schedule appointments, and obtain information, reviews, and advice about properties using voice commands Travel and Hospitality: In the travel and hotel industry, speech recognition technology can be employed to create applications for voice- activated search, room booking, payment, and providing information, directions, and suggestions for destinations, activities, and travel 10 Đồ án Class: CMU IS 100 EIS Trường Đại học Duy Tân Introduction to information system Automotive Industry: In the automotive industry, speech recognition technology can be used to develop virtual assistants for vehicles, allowing drivers to control functions such as air conditioning, audio, navigation, etc., advice on the vehicle’s status, road conditions, traffic, etc Furthermore, speech recognition technology finds applications in various other field such as healthcare, education, and entertainment : Education: In education, speech recognition technology can assist students and teacher in learning and teaching tasks Students can use this technology to read books, complete assignments take tests, or interact with teachers Teachers can use it to lessons, lecture, grade assignments, or provide feedback Healthcare: In healthcare, speech recognition technology can aid doctors and patients in healthcare management Doctors can use it to record patient records, schedule appointment, or provide advice to patients Patients can use it to monitor their health, receive medical advice, or communicate with doctors Entertainment: In the entertainment industry, speech recognition technology can enhance user experience in enjoying various content Users can use this technology to control smart devices, play games, listen to music, watch movies, or read books through voice commands Currently, there are widely used voice recognition applications: Gboard Speech Recognition Software: The Gboard software (formerly Google Keyboard) supports over 120 different languages and integrates numerous powerful features such as voice input, animated GIF search, emojis, information lookup, and real-time message translation directly on the keyboard Importantly, the application also allows users to input text by swiping their fingers from one letter to another ListNote Speech-to-Text Notes: 11 Đồ án Class: CMU IS 100 EIS Trường Đại học Duy Tân Introduction to information system With the ListNote Speech-to-Text Notes voice recognition application, users can quickly create notes using voice input Compared to many other voice recognition applications, ListNote Speech-to-Text Notes is relatively easy to use, and all data (notes) are indexed for fast searching Users can also set a password or encrypt content according to AES standards to protect files from prying eyes V-IONE Voice Recognition Technology: V-IONE Voice Recognition Technology is software that converts speech into text and possesses many features, including high accuracy It performs well at a distance of up to meters with accuracy up to 98%, and at a distance of 3-5 meters, the accuracy is 95% It displays real-time results, supports the import of audio in various formats, and standardizes the output text with names, dates, and months The software includes a vocabulary of 7000 words in Vietnamese and optimizes voice recognition for the three regions: North, Central, and South 12 Đồ án Class: CMU IS 100 EIS Trường Đại học Duy Tân Introduction to information system Mục lục Preface I Introduction to Speech Recognition Technology II The features of Speech Recognition Technology III Benefits of Speech Recognition Technology IV Applications off Speech Recognition Technology in Various Field 13 Đồ án Class: CMU IS 100 EIS