Computer Vision, Speech Communication &

Signal Processing Group

NTUA | ECE
Faculty | PhD Students | Collaborators
Journal | Book Chapters | Conference
Undergraduate | Graduate | Diploma Theses

Microphone Array Speech Processing

Overview

We are working on microphone array processing and distant speech recognition, aiming to create hands-free, voice-enabled interfaces for home automation control. The user will be able to control appliances and perform actions without having to move from his/her place, by using their voice. For this purpose, microphone array processing is employed, with microphones placed on walls and ceiling. Our research is focused on acoustic speaker localization, voice activity detection, acoustic event detection, speech enhancement/beamforming, activation keyword spotting and distant speech recognition. We have also collected a distant speech database in Greek that is publicly available: ATHENA database

People

Publications

Software

Some of our tools are publicly available via GitHub:

  • Multi-channel speech enhancement

    Please cite:

    Z. I. Skordilis, A. Tsiami, P. Maragos, G. Potamianos, L. Spelgatti and R. Sannino,
    Multichannel Speech Enhancement Using MEMS Microphones,
    Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing 2015, Brisbane, Australia, (ICASSP-2015).

    and

    S. Lefkimmiatis and P. Maragos,
    A Generalized Estimation for Linear and and Nonlinear Microphone Array Post-Filters Speech Communication, vol.49, pp.657-666, 2007.

  • Sweet Home Listen: A distant speech recognition system for home automation control

    Please cite:

    A. Katsamanis, I. Rodomagoulakis, G. Potamianos, P. Maragos and A. Tsiami,
    Robust Far-Field Spoken Command Recognition for Home Automation Combining Adaptation and Multichannel Processing ,
    Proc. Int'l. Conf. on Acoustics, Speech and Signal Processing (ICASSP-2014), Florence, Italy, May 2014.

  • Data

    We have collected a real distant speech corpus in Greek that is publicly available.
    The description and reference for the database is:

  • A. Tsiami, I. Rodomagoulakis, P. Giannoulis, A. Katsamanis, G. Potamianos and P. Maragos,
    ATHENA: A Greek Multi-Sensory Database for Home Automation Control ,
    Proc. 15th Annual Conf. of International Speech Communication Association (INTERSPEECH-2014), Singapore, Sep. 2014.

    For more information visit: ATHENA database

  • Last modified: Thursday, 09 June 2016 | Created by Nassos Katsamanis and George Papandreou