Electroglottography based real-time voice-to-MIDI controller

Donati, Eugenio ORCID: https://orcid.org/0000-0002-0048-1858 and Chousidis, Christos ORCID: https://orcid.org/0000-0003-3762-8208 (2022) Electroglottography based real-time voice-to-MIDI controller. Neuroscience Informatics, 2 (2). p. 100041. ISSN 2772-5286

[thumbnail of Donati_and_Chousidis_2022_j.neuri._Electroglottography_based_real-time_voice-to-MIDI_controller.pdf]
Donati_and_Chousidis_2022_j.neuri._Electroglottography_based_real-time_voice-to-MIDI_controller.pdf - Published Version
Available under License Creative Commons Attribution.

Download (3MB) | Preview


Voice-to-MIDI real-time conversion is a challenging problem that comes with a series of obstacles and complications. The main issue is the tracking of the human voice pitch. Extracting the voice fundamental frequency can be inaccurate and highly computationally exacting due to the spectral complexity of voice signals. In addition, on account of microphone usage, the presence of environmental noise can further affect voice processing. An analysis of the current research and status of the market shows a plethora of voice-to-MIDI implementations revolving around the processing of audio signals deriving from microphones. This paper addresses the above-mentioned issues by implementing a novel experimental method where electroglottography is employed instead of microphones as a source for pitch-tracking. In the proposed system, the signal is processed and converted through an embedded hardware device. The use of electroglottography improves both the accuracy of pitch evaluation and the ease of voice information processing; firstly, it provides a direct measurement of the vocal folds' activity and, secondly, it bypasses the interferences caused by external sound sources. This allows the extraction of a simpler and cleaner signal that yields a more effective evaluation of the fundamental frequency during phonation. The proposed method delivers a faster and less computationally demanding conversion thus in turn, allowing for an efficacious real-time voice-to-MIDI conversion.

Item Type: Article
Identifier: 10.1016/j.neuri.2022.100041
Keywords: Electroglottography, Bioimpedance Measurements, EGG-to-MIDI, voice-to- MIDI, Voice Information Retrieval, Real-time audio conversion
Subjects: Construction and engineering > Digital signal processing
Computing > Intelligent systems
Construction and engineering > Sound engineering
Music > Music/audio technology
Related URLs:
Depositing User: Christos Chousidis
Date Deposited: 09 May 2022 12:58
Last Modified: 06 Feb 2024 16:10
URI: https://repository.uwl.ac.uk/id/eprint/9005


Downloads per month over past year

Actions (login required)

View Item View Item