Voice portal

From HandWiki
Short description: Web portals where information is accessed via voice commands


Voice portals are the voice equivalent of web portals, giving access to information through spoken commands and voice responses. Ideally a voice portal could be an access point for any type of information, services, or transactions found on the Internet. Common uses include movie time listings and stock trading. In telecommunications circles, voice portals may be referred to as interactive voice response (IVR) systems, but this term also includes DTMF services. With the emergence of conversational assistants such as Apple's Siri, Amazon Alexa, Google Assistant, Microsoft Cortana, and Samsung's Bixby, Voice Portals can now be accessed through mobile devices and Far Field voice smart speakers such as the Amazon Echo and Google Home.[1]

Advantages

Voice portals have no dependency on the access device; even low end mobile handsets can access the service. Voice portals talk to users in their local language and there is reduced customer learning required for using voice services compared to Internet/SMS based services.[2]

A complex search query that otherwise would take multiple widgets (drop down, check box, text box filling), can easily and effortlessly be formulated by anyone who can speak without needing to be familiar with any visual interfaces. For instance, one can say, "Find me an eyeliner, not too thick, dark brown, from Estee Lauder MAC, that's below thirty dollars" or "What is the closest liquor store from here and what time do they close?"

Limitations

Voice is the most natural communication medium, but the information that can be provided is limited compared to visual media.[3]

For example, most Internet users try a search term, scan results, then adjust the search term to eliminate irrelevant results. They may take two or three quick iterations to get a list that they are confident will contain what they are looking for. The equivalent approach is not practical when results are spoken, as it would take far too long. In this case, a multimodal interaction would be preferable to a voice-only interface.

Trends

Live-agent and Internet-based voice portals are converging, and the range of information they can provide is expanding.

Live-agent portals are introducing greater automation through speech recognition and text-to-speech technology, in many cases providing fully automated service, while automated Internet-based portals are adding operator fallback in premium services. The live-agent portals, which used to rely entirely on pre-structured databases holding specific types of information are expanding into more free-form Internet access, while the Internet-based portals are adding pre-structured content to improve automation of the more common types of request.[example needed]

Speech technology is starting to introduce Artificial Intelligence concepts that make it practical to recognise a much broader range of utterances, learning from experience. This promises to make it practical to greatly improve speaker recognition rates and expand the range of information that can be provided by a voice portal.

Technology providers

A number of web-based companies are dedicated to providing voice-based access to Internet information to consumers. Quack.com launched its service in March 2000 and has since obtained the first overall voiceportal patent.[4][5] Quack.com was acquired by AOL in 2000 and relaunched as AOL By Phone later that year. Tellme Networks was acquired by Microsoft in 2007. Nuance, the dominant provider of speech recognition and text-to-speech technology, is starting to deliver voice portal solutions. Other companies in this space include TelSurf Networks, FonGenie, Apptera and Call Genie.

Apart from public voice portal services, a number of technology companies, including Alcatel-Lucent, Avaya, and Cisco, offer commercial enterprise-grade voice portal products to be used by companies to serve their clients. Avaya also has a carrier-grade portfolio.

See also

References

  1. "Launch your Voice First Portal – WITLINGO" (in en-US). http://www.witlingo.com/voicefirstportal/. 
  2. Nass, Clifford; Brave, Scott (2007-02-23) (in en). Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship. Cambridge Mass.: The MIT Press. ISBN 9780262640657. 
  3. Bouzid, Dr Ahmed; Ma, Dr Weiye (2013-08-27) (in en). Don't Make Me Tap!: A Common Sense Approach to Voice Usability. CreateSpace Independent Publishing Platform. ISBN 9781492196518. 
  4. "System and method for voice access to internet-based information". http://www.freepatentsonline.com/6510417.html. 
  5. "System and method for voice access to internet-based information". 2008. http://www.google.com/patents?id=Z1kOAAAAEBAJ&dq=Steven+Woods. 

External links