We are working on microphone array processing and distant speech recognition,
aiming to create hands-free, voice-enabled interfaces for home automation control.
The user will be able to control appliances and perform actions without
having to move from his/her place, by using their voice. For this purpose,
microphone array processing is employed, with microphones placed on walls and ceiling.
Our research is focused on acoustic speaker localization, voice activity detection, acoustic event detection,
speech enhancement/beamforming, activation keyword spotting and distant speech recognition.
We have also collected a distant speech database in Greek that is publicly available:
ATHENA database
|
- Panagiotis Giannoulis, Alessio Brutti, Marco Matassoni, Alberto Abad, Athanasios Katsamanis, Miguel Matos, Gerasimos Potamianos
and Petros Maragos
Multi-room speech activity detection using a distributed microphone network in domestic environments,
Proc. European Signal Processing Conf. (EUSIPCO-2015), Nice, France, Sep. 2015.
- Z. I. Skordilis, A Tsiami, P. Maragos, G. Potamianos, L. Spelgatti and R. Sannino,
Multichannel Speech Enhancement Using MEMS Microphones
, Proc. IEEE Int'l Conf. on Acoustics, Speech, and Signal Processing (ICASSP-2015), Brisbane, Australia, Apr. 2015.
- A. Tsiami, I. Rodomagoulakis, P. Giannoulis, A. Katsamanis, G. Potamianos and P. Maragos,
ATHENA: A Greek Multi-Sensory Database for Home Automation Control
, Proc. 15th Annual Conf. of International Speech Communication Association (INTERSPEECH-2014), Singapore, Sep. 2014.
- P. Giannoulis, G. Potamianos, A. Katsamanis and Petros Maragos,
Multi-Microphone Fusion for Detection of Speech and Acoustic Events in Smart
Spaces,
Proc. 22th European Signal Processing Conference (EUSIPCO-2014), Lisbon, Portugal, Sep. 2014.
- A. Tsiami, A. Katsamanis, P. Maragos and G. Potamianos
Experiments In Acoustic Source Localization Using Sparse Arrays In
Adverse Indoors Environments
, Proc. 22th European Signal Processing Conference (EUSIPCO-2014), Lisbon, Portugal, Sep. 2014.
- P. Giannoulis, A. Tsiami, I. Rodomagoulakis, A. Katsamanis, G. Potamianos, P. Maragos, The ATHENA-RC system for speech activity detection and speaker localization in the DIRHA smart home, Proc. 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA-2014).
- A. Katsamanis, I. Rodomagoulakis, G. Potamianos, P. Maragos and A. Tsiami,
Robust Far-Field Spoken Command Recognition for Home Automation
Combining Adaptation and Multichannel Processing
, Proc. Int'l. Conf. on Acoustics, Speech and Signal Processing (ICASSP-2014), Florence, Italy, May 2014.
-
I. Rodomagoulakis, G. Potamianos and P. Maragos,
Advances In Large Vocabulary Continuous Speech Recognition In Greek: Modeling And Nonlinear Features,
Proc. 21st European Signal Processing Conference (EUSIPCO-2013), Marrakech, Morocco, Sep. 2013.
-
I. Rodomagoulakis, P. Giannoulis, Z.-I. Skordilis, P. Maragos and G. Potamianos,
Experiments on Far-field Multichannel Speech Processing in Smart Homes,
Proc. 18th Int’l Conf. Digital Signal Processing (DSP-2013), Santorini, Greece, July 2013.
|
Some of our tools are publicly available via GitHub:
Multi-channel speech enhancement
Please cite:
Z. I. Skordilis, A. Tsiami, P. Maragos, G. Potamianos, L. Spelgatti and R. Sannino,
Multichannel Speech Enhancement Using MEMS Microphones,
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing 2015, Brisbane, Australia, (ICASSP-2015).
and
S. Lefkimmiatis and P. Maragos,
A Generalized Estimation for Linear and and Nonlinear Microphone Array Post-Filters
Speech Communication, vol.49, pp.657-666, 2007.
Sweet Home Listen: A distant speech recognition system for home automation control
Please cite:
A. Katsamanis, I. Rodomagoulakis, G. Potamianos, P. Maragos and A. Tsiami, Robust Far-Field Spoken Command Recognition for Home Automation
Combining Adaptation and Multichannel Processing
, Proc. Int'l. Conf. on Acoustics, Speech and Signal Processing (ICASSP-2014), Florence, Italy, May 2014.
|
We have collected a real distant speech corpus in Greek that is publicly available.
The description and reference for the database is:
A. Tsiami, I. Rodomagoulakis, P. Giannoulis, A. Katsamanis, G. Potamianos and P. Maragos, ATHENA: A Greek Multi-Sensory Database for Home Automation Control
, Proc. 15th Annual Conf. of International Speech Communication Association (INTERSPEECH-2014), Singapore, Sep. 2014.
For more information visit:
ATHENA database
|