Future speech interfaces with sensors and machine intelligence / / Bruce Denby, Michael Wand, Tamás Gábor Csapó [editors].

Speech is the most spontaneous and natural means of communication, as well as the preferred modality for interacting with mobile or fixed electronic devices, but speech in-terfaces have drawbacks, such as a lack of user privacy; non-inclusivity for certain users; poor robustness in noisy conditions;...

Full description

Saved in:
Bibliographic Details
TeilnehmendeR:
Place / Publishing House:Basel : : MDPI,, [2023]
©2023
Year of Publication:2023
Language:English
Physical Description:1 online resource
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 04624nam a2200337 i 4500
001 993599196704498
005 20230703072134.0
006 m o d
007 cr |||||||||||
008 230703s2023 sz o 000 0 eng d
020 |a 3-0365-6939-1 
035 |a (CKB)5700000000354499 
035 |a (NjHacI)995700000000354499 
035 |a (EXLCZ)995700000000354499 
040 |a NjHacI  |b eng  |e rda  |c NjHacl 
050 4 |a QA76.9.C65  |b .F88 2023 
082 0 4 |a 003.3  |2 23 
245 0 0 |a Future speech interfaces with sensors and machine intelligence /  |c Bruce Denby, Michael Wand, Tamás Gábor Csapó [editors]. 
264 1 |a Basel :  |b MDPI,  |c [2023] 
264 4 |c ©2023 
300 |a 1 online resource 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 |a Description based on publisher supplied metadata and other sources. 
520 |a Speech is the most spontaneous and natural means of communication, as well as the preferred modality for interacting with mobile or fixed electronic devices, but speech in-terfaces have drawbacks, such as a lack of user privacy; non-inclusivity for certain users; poor robustness in noisy conditions; and the difficulty of creating complex man-machine interfaces. The Special Issue "Future Speech Interfaces with Sensors and Machine Intelligence" assembles eleven contributions that cover multimodal and silent speech interfaces; lip reading applications; novel sensors for speech interfaces; and enhanced speech inclusivity tools for future speech interfaces. The articles make important improvements beyond the state of the art, advancing the state of the art to new frontiers in some cases. Short summaries of all articles, grouped by topic, are presented, followed by a global commentary and evaluation. 
505 0 |a Bruce Denby, Tam ´as G ´abor Csap ´o and Michael Wand -- Future Speech Interfaces with Sensors and Machine Intelligence -- Reprinted from: Sensors 2023, 23, 1971, doi:10.3390/s23041971 1 -- Wentao Yu, Steffen Zeiler and Dorothea Kolossa -- Reliability-Based Large-Vocabulary Audio-Visual Speech Recognition -- Reprinted from: Sensors 2022, 22, 5501, doi:10.3390/s22155501 7 -- Sanghun Jeon and Mun Sang Kim -- Noise-Robust Multimodal Audio-Visual Speech Recognition System for Speech-Based -- Interaction Applications -- Reprinted from: Sensors 2022, 22, 7738, doi:10.3390/s22207738 27 -- Sanghun Jeon and Mun Sang Kim -- End-to-End Lip-Reading Open Cloud-Based Speech Architecture -- Reprinted from: Sensors 2022, 22, 2938, doi:10.3390/s22082938 55 -- Sanghun Jeon and Mun Sang Kim -- End-to-End Sentence-Level Multi-View Lipreading Architecture with Spatial Attention Module -- Integrated Multiple CNNs and Cascaded Local Self-Attention-CTC -- Reprinted from: Sensors 2022, 22, 3597, doi:10.3390/s22093597 77 -- Beiming Cao, Alan Wisler and Jun Wang -- Speaker Adaptation on Articulation and Acoustics for Articulation-to-Speech Synthesis -- Reprinted from: Sensors 2022, 22, 6056, doi:10.3390/s22166056 105 -- Tam´as G ´abor Csap ´o, G ´abor Gosztolya, L ´aszl ´o T´oth, Amin Honarmandi Shandiz and -- Alexandra Mark ´o -- Optimizing the Ultrasound Tongue Image Representation for Residual Network-Based -- Articulatory-to-Acoustic Mapping -- Reprinted from: Sensors 2022, 22, 8601, doi:10.3390/s22228601 121 -- David Ferreira, Samuel Silva, Francisco Curado and Ant ´onio Teixeira -- Exploring Silent Speech Interfaces Based on Frequency-Modulated Continuous-Wave Radar -- Reprinted from: Sensors 2022, 22, 649, doi:10.3390/s22020649 135 -- Sanghun Jeon, Ahmed Elsharkawy and Mun Sang Kim -- Lipreading Architecture Based on Multiple Convolutional Neural Networks for Sentence-Level -- Visual Speech Recognition -- Reprinted from: Sensors 2022, 22, 72, doi:10.3390/s22010072 153 -- Alan Wrench and Jonathan Balch-Tomes -- Beyond the Edge: Markerless Pose Estimation of Speech Articulators from Ultrasound and -- Camera Images Using DeepLabCut -- Reprinted from: Sensors 2022, 22, 1133, doi:10.3390/s22031133 173 -- Dan Oneat,a, Be ´ ˘ ata L ˝orincz, Adriana Stan and Horia Cucu -- FlexLip: A Controllable Text-to-Lip System -- Reprinted from: Sensors 2022, 22, 4104, doi:10.3390/s22114104 201 -- Laith H. Baniata, Isaac. K. E. Ampomah and Seyoung Park -- A Transformer-Based Neural Machine Translation Model for Arabic Dialects That Utilizes -- Subword Units -- Reprinted from: Sensors 2021, 21, 6509, doi:10.3390/s21196509 217. 
650 0 |a Computer simulation. 
776 |z 3-0365-6938-3 
700 1 |a Denby, Bruce ,  |e editor. 
700 1 |a Wand, Michael ,  |e editor. 
700 1 |a Csapó, Tamás Gábor,  |e editor. 
906 |a BOOK 
ADM |b 2023-07-08 12:21:07 Europe/Vienna  |f system  |c marc21  |a 2023-04-02 14:12:45 Europe/Vienna  |g false 
AVE |i DOAB Directory of Open Access Books  |P DOAB Directory of Open Access Books  |x https://eu02.alma.exlibrisgroup.com/view/uresolver/43ACC_OEAW/openurl?u.ignore_date_coverage=true&portfolio_pid=5345655520004498&Force_direct=true  |Z 5345655520004498  |b Available  |8 5345655520004498