333
Views
6
CrossRef citations to date
0
Altmetric
Research on Products and Devices

Human-computer multimodal interface to internet navigation

ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Pages 807-820 | Received 30 Jan 2020, Accepted 18 Jul 2020, Published online: 30 Jul 2020
 

Abstract

Purpose

Propose and develop new multimodal interfaces that allow people with motor impairments to control mass use applications in a natural way through gestures and voice.

Methods

A multimodal interaction interface was developed for using Google Chrome, Gmail and Facebook applications through gestural and verbal commands. The interface activates mouse and keyboard commands from the processing of voice signals and videos of the user’s head movement. The interface does not disable traditional keyboard and mouse functions; moreover, it only requires a webcam and a microphone, which are usually built into portable computers.

Results

The tests were performed on three groups of people: young adults, older adults and people with motor impairments. The verbal interaction was tested on a total of 189 voice commands with an average performance of 75.7% and a total of 105 dictations. The dictations had an average of 13 words; the system had a performance of 81.1%. Moreover, the gestural interaction activated 126 commands without errors using a drop-down menu; a click was activated 84 times with a success rate of 70.2%.

Conclusion

The motor impairments group especially valued the option of using Google Chrome, Gmail and Facebook without physically manipulating a mouse and keyboard. This group showed a greater preference for verbal control than for gestural control. An adaptation period is required for the adults group to acquire greater skill in using the interface. The young adults group preferred the types of interactions they are accustomed to due to their familiarity with Information and Communication Technologies (ICT); they considered the interface fun.

    Implication for rehabilitation

  • Computers and their applications were conceived with unnatural interaction mechanisms, such as the keyboard and mouse, which prevent their use by people with psychophysiological limitations or digital literacy. For this reason, the need arises to design new natural interfaces commanded by gestures and voice.

  • It is necessary to develop low-cost interfaces that can control mass-use applications such as Google Chrome, Facebook and Gmail, which do not require additional hardware using webcams and microphones, which are usually integrated into laptops.

  • The development of these multimodal interfaces improves the quality of life of people with motor impairments, allowing them to have access to Information and Communication Technologies (ICT).

Acknowledgment

The authors thank the Graduate Program of the School of Electrical and Electronic Engineering and the Perception and Intelligent Systems (PSI, in Spanish) research group of the Universidad del Valle.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was financed with resources of the Internal Call to Support the 2017 Graduate Students of the Universidad del Valle [code CI-2940].

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 65.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 340.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.