Recognize VoiceXML - an old article.

xiaoxiao2021-03-06  52

Huang Weifeng 2001/11/22

With the development of CTI (Computer and Telephone Integrated) technology and voice technology, IBM, Lucent, AT & T and Motorola's four major communications companies have set up VoiceXML (Voice Extensible Markup Language), which enables users to make users by using this new language Access the Internet by phone and speech. This technology can help companies, telecommunications companies, and internetline companies increase network usage, improve user loyalty, develop new markets, and enhance the competitiveness of enterprises.

VoiceXML origin

Voice XML is a new XML Schema used to develop the content of the content through voice conversation and its interactive voice response. In early 1999, IBM, Motorola, Lucent, and AT & T set up a Voice XML Forum to coordinate existing voice technology to allow access to the Internet via sound and telephone. Voice technology not only allows those who cannot use the graphic browser due to environment or physiological restrictions, but also provide more convenient web access features for all users. New voice technology can create dialog-driven applications, such as speech recognition technology (ASR), speech synthesis technology (TTS), and recording and playback digit speech on PC and servers (distributed to client devices). Voice XML provides a technical language that can be used in voice applications. These applications proceed with the back-end service and process mechanism to the front-end Voice XML-based representation. For example, a well-designed Web site can easily support voice-driven browsers (such as that you are likely to use on your mobile phone), and it can support other browsers (such as a WAP browser or HTML Browser). When receiving the initial request from the browser, the server will monitor the type of browser. If the browser is confirmed as a voice browser, the server will return the corresponding Voice XML page. Due to the rapid development of voiveXML technology and voice technology, more than 150 companies and organizations have added to the Voice XML (http://www.voicexml.org) Forum, including some very well-known communications companies such as AT & T, Lucent, Motorola Alcatel, Cisco, Hitachi, and my country's Huawei Communications Company.

VoiceXML system structure and application examples

VoiceXML 1.0 Specifies W3C-based industrial standard XML, providing a smart API for developers, service providers, and equipment manufacturers for voice and telephone applications. VoiceXML standards simplifies the creation of personalized interfaces with voice-sounding services on the Web, enabling people to access information and services on the website via voice and calls while with CGI (Perl, PHP, C, Java servlet, etc.) Combined with the background database, access the enterprise internal network, and eventually combine the voice browser with the micro browser to achieve the perfect combination of computer network and telephone technology. The specific system structure is as follows:

From the figure we can see that compared with the traditional Internet site, you can add a VoiceXML server to add a VoiceXML server to the Internet. In the VoiceXML server, the VoiceXML interpretation (VoiceXML interpretation language), VoiceXML comes with browser, automatic speech recognition (ASR), and text-to-speech (TTS) conversion devices. The VoiceXML interpreter is a computer program that explains a voiceXML file, booting and controlling the interaction between users and execution platforms. VoiceXML Interpretation is also a computer program that explains a VoiceXML file with a VoiceXML interpreter and can interact with the VoiceXML interpretation with the execution platform. The specific process is shown below: For example, the user wants to know the current stock price of Intel, call to the company that provides the service, mapped by DN-URL, to the VoiceXML server of the site, VoiceXML server The corresponding VoiceXML file is called, and the program is processed by the VoiceXML, and the voice output is generated by TTS to reply to the user's request. Of course, during the process of processing the VoiceXML file, sometimes it is necessary to hand over the CGI program processing of the background, and the resulting result is handed over from the Web Server to the VoiceXML Server processing. After processing, the user may hear the answer is "Welcome to the stock market, Which stock price would you like to know?", Its corresponding VoiceXML file is Welcome.vxml (see Resources). Users only need to answer the stocks interested in him, such as "Intel", and get rid of a large pile of traditional IVR's fuzzy. At this time, the user's answer passes the process of VoiceXML Server, handed over the price of the INTEL stock in the background of the CGI program in the database, and the user can be "$ 55" in the processing of the VoiceXML Server.

VoiceXML features and scope of application

VoiceXML is a tag language, mainly with the following features: 1. VoiceXML as a multiple interaction specified by each file, minimizes interaction between client / servers 2. Implement application developers and low-level Software and hardware details on the software and system platforms are independent. Cross different execution platforms. For content service providers, tool providers, and platform providers, VoiceXML is a public language. 5. Make simple interactions very easy to use, requiring the voice interface provided to support complex dialogueXML language to implement human-computer interaction communication through voice response systems, including: Synthetic Voice Output (TTS), audio file Output, identification of voice input, DTMF input identification, voice input recording, telephone function like call transfer, etc. VoiceXML provides characters and voice input collections, and assigns the request variable assigned to the file definition, and makes a decision after the user answers. VoiceXML determines that the file may be connected to another file via a general resource marker (URI). VoiceXML has a wide range of applications in the following areas. 1. Acquisition of information. Such as stock information, weather conditions, sports news, traffic information, etc. 2. Electronic transactions (including e-commerce, electronic retail). Such as bank account query, access, stock trading, etc. 3, service in telecommunications. Such as Unified Message, Call Center, etc. Some product introductions about VoiceXML

1, IBM IBM mainly developed ViaVoice's VoiceXML server, as well as the VoiceXML development kit, which can be combined with WebSphere to implement the perfect combination of computer networks. But the server only supports English, French and German. 2, Motorola Motorola also has its own voiceXML gateway and development pack for developing VoiceXML. But do not support Chinese. 3, Nuance Nuance is a manufacturer specializing in developing voice. He has a set of tools for development and architecture VoiceXML. These include Voice Web Server, V-Builder (Makeup Tools for VoiceXML), Secure Verifier. In particular, he has won many market points for more than 20 languages ​​(including Chinese and Cantonese), and its excellent stability has won many markets, including American Airlines, Bell Atlantic, UPS and other companies have become his customers.

Tianji Net

转载请注明原文地址:https://www.9cbs.com/read-83096.html

New Post(0)