Direct voice input

From HandWiki
Short description: Style of human–machine interaction

Direct voice input (DVI), sometimes called voice input control (VIC), is a style of human–machine interaction "HMI" in which the user makes voice commands to issue instructions to the machine through speech recognition.

In the field of military aviation, DVI has been introduced into the cockpits of several modern military aircraft, such as the Eurofighter Typhoon, the Lockheed Martin F-35 Lightning II, the Dassault Rafale, the KF-21 Boramae and the Saab JAS 39 Gripen. Such systems have been also been used for various other purposes, including industry control systems and speech recognition assistance for impaired individuals.

Overview

DVI systems can be divided into two major categories of functionality: "user-dependent" or "user-independent". A user-dependent system requires that a personal voice template to be generated for a specific person; the template for this individual has to be loaded onto their assigned machine prior to use of the DVI system for it to function properly. In contrast, a user-independent system does not require any personal voice template, being intended to respond correctly to the voice of any user.[1] They can also be categorised between "discrete recognition" and "continuous recognition". Users of a discrete recognition system must pause between each word so that the DVI system can identify the separations between each word, while a continuous speech recognition system is capable of understanding a normal rate of speech.[2]

During the mid-2000s, researchers at the National Aerospace Laboratory in the Netherlands examined the use of DVI in the "GRACE" simulator; a total of twelve pilots participated in the ensuing experiment. The tests performed reportedly revealed that, while the hardware itself functioned well, several improvements were desirable prior to real-world deployment on aircraft since DVI operations actually consumed more time in comparison to traditional existing methods. Recommendations for improvements included the adoption of simpler syntax, the achievement of a greater recognition rate, and a decrease in response times; all of the issues encountered were determined to be of a technological nature, and were deemed feasible to resolve. The researchers concluded that in cockpits, especially during emergencies where pilots have to operate entirely on their own, a DVI system could be highly relevant, but that it was not of crucial importance during most other conceivable scenarios.[3]

Around the same time, evaluations of DVI systems for civil aviation purposes were conducted within the framework of Project SafeSound, coordinated by the European Union. It involved the observation of pilot workloads in real-world cockpits and contrasting them against pilot activity in flight simulators using both conventional systems and DVI assistance. The project aimed to enhance aviation safety and to decrease the workload in both ground and flight operations via the application of enhanced audio functions.[4]

Applications

Aviation

Prior to its widespread deployment, a handful of conventional military aircraft were converted to trial DVI systems; examples include the Harrier AV-8B and F-16 VISTA. In another case, an General Dynamics F-16 Fighting Falcon simulator was modified with DVI for a voice control study that was undertaken by the Royal Netherlands Air Force .[5] DVI trials have also been conducted on helicopters, including the Boeing AH-64 Apache, showing the potential to improve flight safety and mission effectiveness.[6][7]

Numerous modern fighter aircraft have been outfitted with DVI systems, often in combination with various other man-machine interface schemes, such as HOTAS-compliant controls and other advanced control technologies. The combination of Voice and HOTAS control schemes has sometimes been referred to as the "V-TAS" concept. A prominent fighter aircraft to be furnished with a V-TAS cockpit is the Eurofighter Typhoon.[8] The Lockheed Martin F-35 Lightning II also features a DVI system, which was developed by Adacel.[9] Other examples includes the Dassault Rafale and the Saab JAS 39 Gripen.[5]

Numerous aircraft have been planned to use DVI. At one stage, the United States Air Force had sought to integrate DVI upon the Lockheed Martin F-22 Raptor; however, the technology was eventually judged to pose too many technical risks at that point in time, and thus such efforts were abandoned.[10]

Personal

By 1990, working prototypes of speech recognition systems were being demonstrated; these were being promoted for the purpose of providing an effective man-machine interface for individuals with impaired speech.[11] Techniques employed included time-encoded digital speech and automatic token set selection. Investigations of these early DVI systems reportedly included the use of automatic diagnostic routines and limited-scale trials using volunteers.[11]

During the 2010s, various companies were offering voice recognition systems to the general public in the form of personal digital assistants.[2] One example is the Google Voice service, which allows users to pose questions via a DVI package installed on either a personal computer, tablet, or mobile phone. Numerous digital assistants have been developed, such as Amazon Echo, Siri, and Cortana, that use DVI to interact with users.[2]

Commercial

DVI technology has enabled automated telephone systems to be widely deployed. Many companies commonly use centralised phone systems that route callers to the correct department via such methods.[2] Various car manufacturers have also furnished their road vehicles with DVI systems; these typically allow drivers to control infotainment systems and interact with mobile phones with more convenience than legacy methods.[2]

During the late 1980s, investigations into the use of DVI systems for controlling CNC Machines and other manufacturing apparatus were underway.[12] During the 2010s, such systems were being used for logistics and warehouse management purposes.[13]

References

  1. Jarrett, D.N. (2005), Cockpit Engineering. Aldershot, Ashgate Publishing Limited ISBN:978-0-7546-1751-8.
  2. 2.0 2.1 2.2 2.3 2.4 "Voice recognition". computerhope.com. 16 May 2020. https://www.computerhope.com/jargon/v/voicreco.htm. 
  3. Zon, G.D.R and Roerdink, M.I., (2007), Using Voice to Control the Civil Flightdeck, Report NLR-TP-2006-720, (This report is based on a presentation held at the HCI Aero, Seattle, USA, 20–22 September 2006).[1]
  4. Steen, Weber; Jette, Lundtang Paulsen (8–10 May 2006). "Workshop on future control station designs and human performance issues in nuclear power plants". International Atomic Energy Agency (IAEA). pp. 296–320. https://inis.iaea.org/search/search.aspx?orig_q=RN:39091026. 
  5. 5.0 5.1 Gibbon, D,, Mertins, I. and Moore, R.K. (2000) "Handbook of Multimodal and Spoken Dialogue Systems Resources, Terminology and Product Evaluation" (The Springer International Series in Engineering and Computer Science, Vol. 565), Massachusetts, Kluwer Academic Publishers ISBN:978-0-7923-7904-1
  6. Hunter, Donald James (December 2003). "Identification of aircrew tasks for using direct voice input (DVI) to reduce pilot workload in the AH-64D Apache Longbow". Masters Theses (trace.tennessee.edu). https://trace.tennessee.edu/utk_gradthes/5239/. 
  7. Herdman, Chris; Johannsdottir, Kamilla (January 2001). "Attentional benefits and costs associated with integrating a direct voice input (DVI) system into a multi-crew helicopter environment". https://www.researchgate.net/publication/228401357. 
  8. Owen, Paul S. "Eurofighter cockpit." Eurofighter-typhoon.co.uk 7 December 1997. Retrieved: 28 November 2009.
  9. Schutte, John (10 October 2007). "Researchers fine-tune F-35 pilot-aircraft speech system". US Air Force. http://www.afmc.af.mil/news/story.asp?id=123071564. 
  10. Goebel, Greg. "The Lockheed Martin F-22 Raptor." airvectors.net, 1 July 2011. Retrieved: 10 November 2012.
  11. 11.0 11.1 "A direct voice input man-machine interface strategy to provide voice access for severely impaired speakers". UK IT 1990 Conference. 1990. pp. 279–285. https://ieeexplore.ieee.org/document/114301. 
  12. J.Y. Zhu; C.G. Xu (7 January 1988). "An Investigation on Voice Input System for CNC Machine Tools". Cirp Annals (Nanjing Aeronautical Institute) 37: 477–480. doi:10.1016/S0007-8506(07)61681-3. https://www.sciencedirect.com/science/article/abs/pii/S0007850607616813. 
  13. "Voice Directed Applications". lucasware.com. https://www.lucasware.com/jennifer-voice-picking/. 

External links