TY - GEN
T1 - Genisletilmis DTK Yardimi ile Akustik Isaret Tabanli, Dilden Bagimsiz Bir Ses-Dudak Sekli Eslestirme Yontemi ve Gerceklemesi
AU - Cankurtaran, Halil Said
AU - Boyaci, Ali
AU - Yarkan, Serhan
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/10/5
Y1 - 2020/10/5
N2 - Processing human speech with the use of digital technologies leads to several important fields of research. Speech- to-text and lip-syncing are among the instances of relevant prominent research areas. In this regard, audio-visualization of acoustic signals, providing visual aid in real-time for disabled people, and realization of text-free animation applications are just to name a few. Therefore, in this study, a language-independent lip-sync method that is based on extended linear predictive coding is proposed. The proposed method operates on baseband electrical signal that is acquired by a standard single-channel off-the-shelf microphone and exploits the statistical characteristics of acoustic signals produced by human speech. In addition, the proposed method is implemented on an embedded system, tested, and its performance is evaluated. Results are given along with discussions and future directions.
AB - Processing human speech with the use of digital technologies leads to several important fields of research. Speech- to-text and lip-syncing are among the instances of relevant prominent research areas. In this regard, audio-visualization of acoustic signals, providing visual aid in real-time for disabled people, and realization of text-free animation applications are just to name a few. Therefore, in this study, a language-independent lip-sync method that is based on extended linear predictive coding is proposed. The proposed method operates on baseband electrical signal that is acquired by a standard single-channel off-the-shelf microphone and exploits the statistical characteristics of acoustic signals produced by human speech. In addition, the proposed method is implemented on an embedded system, tested, and its performance is evaluated. Results are given along with discussions and future directions.
KW - formant frequency
KW - linear predictive coding
KW - lip sync
UR - http://www.scopus.com/inward/record.url?scp=85100299991&partnerID=8YFLogxK
U2 - 10.1109/SIU49456.2020.9302377
DO - 10.1109/SIU49456.2020.9302377
M3 - Conference contribution
AN - SCOPUS:85100299991
T3 - 2020 28th Signal Processing and Communications Applications Conference, SIU 2020 - Proceedings
BT - 2020 28th Signal Processing and Communications Applications Conference, SIU 2020 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 28th Signal Processing and Communications Applications Conference, SIU 2020
Y2 - 5 October 2020 through 7 October 2020
ER -