The machine also speaks - revealing the voice semantic recognition technology | |
Publish time:2023-04-11 Reads: | |
You may not know what Nuance does, but you must know that the iPhone 4S brought a revolutionary human-computer interaction product - "Siri". In fact, in fact, Nuance is the technology provider of Siri. As the world's leading provider of voice and language solutions, Nuance is known as the owner of the T9 input method, which is currently used by more than 90% of cell phones worldwide and was originally developed by a company called Tegic Communications, which was later bought by Nuance. Nuance has also recently acquired Swype, a sliding input method company. On May 12, Nuance held the "Nuance Mobile Forum 2012" in Shenzhen, where the company, which used to be "invisible" behind major international companies, appeared in China, attracting the interest of many manufacturers in Shenzhen. At the conference, Nuance showed three different videos, including the well-known Siri commercial, another one is Nuance's own "Sound Dragon" series of products, and the last one is the application of voice technology combined with gesture control to the smart TV scene. Through the short film, Nuance not only shows the latest developments and trends in the development of voice technology, but also shows the key point that many international companies attach importance to - user experience, which will be the killer app to win in the future competition. Nuance's goal is to help these companies improve the user experience. The emergence of voice technology has significantly changed the way people and machines interact, but if you have studied Siri, you will find that voice recognition (ASR) technology only accounts for 20% of the total, and the real importance is semantic recognition technology. Semantic recognition can help users to search for the desired results more accurately. This technology is based on Statistic Language model (language model statistics), which requires a large amount of data to improve the search results. Also for natural language understanding (NLU technology), data from databases are needed for grammar collection. "The more data we have, the easier it is to help us match what users need, understand their intentions, and translate intentions into actions, and the effect ultimately depends on the quality of the data itself." said Yuqing Zheng, general manager of Nuance Greater China. Nuance's Dragon Go, known to users for its huge downloads on two mobile application platforms - Apple App Store and Android Market in the U.S. Dragon Go combines Nuance's Voice Dragon speech recognition and natural language understanding technology with artificial intelligence technology to significantly simplify the experience of searching for mobile content. As a result, users can get the content they want with just their voice and spend more time browsing rather than finding online content. In other words, Dragon Go understands what the user is saying and what the user is trying to say. Users simply say a simple phrase and their favorite and most relevant content providers are displayed, making it easy to get restaurant reviews, buy movie tickets, watch streaming movies and TV shows, shop online, find directions, listen to their favorite music, and book hotels through Expedia. |
|
Previous:World Health/ADI Automotive and Industrial Instrumentation Technology Seminar | |
Next:Innovative MCU Applications in Next-Generation Vehicles |