Personal Photo
Nassos Katsamanis
Postdoctoral Research Associate
University of Southern California
Office: 3740 McClintock Ave., Room 427
University of Southern California
Los Angeles, CA 90089-2564
Phone: (+001) 213-7404148
Fax:
E-mail: nkatsam@sipi
address is formatted username@sipi.usc.edu
URL: http://cvsp.cs.ntua.gr/~nassos
 

Biosketch

Currently I am a Postdoctoral Research Associate at the School of Electrical Engineering in the University of Southern California, member of the Signal Analysis and Interpretation Laboratory (SAIL).

My research interests lie in the area of speech and multimodal signal analysis and include speech/audiovisual production, synthesis, inversion, recognition and processing. I received the Diploma in electrical and computer engineering (with highest honors) and the Ph.D. degree from the National Technical University of Athens, Athens, Greece, in 2003 and 2009 respectively.

Curriculum Vitae: Katsamanis_resume.pdf

Publications


Journal Publications

  • A. Katsamanis, G. Papandreou and P. Maragos,
    Face Active Appearance Modeling and Speech Acoustic Information to Recover Articulation,
    IEEE Transactions on Audio, Speech and Language Processing, vol. 17, no. 3, pp. 411-422, Mar. 2009.
    [pdf] [bib]
  • G. Papandreou, A. Katsamanis, V. Pitsikalis, and P. Maragos,
    Adaptive Multimodal Fusion by Uncertainty Compensation with Application to Audio-Visual Speech Recognition,
    IEEE Transactions on Audio, Speech and Language Processing, vol. 17, no. 3, pp. 423-435, Mar. 2009.
    [pdf] [bib]

Conference Papers

  • A. Roussos, A. Katsamanis, P. Maragos,
    Tongue Tracking in Ultrasound Images with Active Appearance Models,
    Proc. IEEE Int'l Conf. on Image Processing (ICIP-09), Cairo, Egypt, Nov. 7-11, 2009.
  • S. Theodorakis, A. Katsamanis, P. Maragos,
    Product-HMMs for automatic sign language recognition,
    Proc. IEEE Int'l Conference on Acoustics, Speech, and Signal Processing (ICASSP-2009), Taipei, Taiwan, Apr. 2009.
    [bib]
  • A. Katsamanis, T. Roussos, P. Maragos, M. Aron and M.-O. Berger,
    Inversion from Audiovisual Speech to Articulatory Information by Exploiting Multimodal Data,
    International Seminar on Speech Production (ISSP 2008), Strasbourg, France, Dec. 2008.
    [pdf] [bib] [presentation]
  • A. Katsamanis, G. Ananthakrishnan, G. Papandreou, P. Maragos, O. Engwall,
    Audiovisual Speech Inversion by Switching Dynamical Modeling Governed by a Hidden Markov Process,
    European Signal Processing Conference (EUSIPCO 2008), Lausanne, Switzerland, Aug. 2008.
    [pdf] [bib]
  • A. Katsamanis, G. Papandreou, and P. Maragos,
    Audiovisual-to-Articulatory Speech Inversion Using Active Appearance Models for the Face and Hidden Markov Models for the Dynamics,
    Proc. IEEE Int'l Conference on Acoustics, Speech, and Signal Processing (ICASSP-2008), Las Vegas, NV, U.S.A., Mar.-Apr. 2008.
    [pdf] [poster] [bib]
  • S. Lefkimmiatis, P. Maragos, A. Katsamanis,
    Multisensor Multiband Cross-Energy Tracking for Feature Extraction and Recognition,
    Proc. IEEE Int'l Conference on Acoustics, Speech, and Signal Processing (ICASSP-2008), Las Vegas, NV, U.S.A., Mar.-Apr. 2008.
    [pdf] [bib]
  • G. Papandreou, A. Katsamanis, V. Pitsikalis, and P. Maragos,
    Multimodal Fusion and Learning with Uncertain Features Applied to Audiovisual Speech Recognition,
    Proc. IEEE Workshop on Multimedia Signal Processing (MMSP-2007), pp. 264-267, Chania, Greece, October 1-3, 2007.
    [pdf] [bib]
  • A. Katsamanis, G. Papandreou, and P. Maragos,
    Audiovisual-to-Articulatory Inversion Using Hidden Markov Models,
    Proc. IEEE Workshop on Multimedia Signal Processing (MMSP-2007), pp. 457-460, Chania, Greece, October 1-3, 2007.
    [pdf] [bib]
  • A. Katsamanis, P. Tsiakoulis, P. Maragos and A. Potamianos
    Investigations in Articulatory Synthesis,
    Proc. 16th International Congress of Phonetic Sciences (ICPhS-2007), pp. 877-880, Saarbruecken, Germany, August 6-10, 2007.
    [pdf] [bib]
  • V. Pitsikalis, A. Katsamanis, G. Papandreou, and P. Maragos,
    Adaptive Multimodal Fusion by Uncertainty Compensation,
    Proc. Int'l Conference on Spoken Language Processing (ICSLP-2006), pp. 2458-2461, Pittsburgh PA, USA, Sep. 17-21, 2006.
    [pdf] [bib]
  • A. Katsamanis, G. Papandreou, V. Pitsikalis, and P. Maragos,
    Multimodal Fusion by Adaptive Compensation for Feature Uncertainty with Application to Audiovisual Speech Recognition,
    Proc. 14th European Signal Processing Conference (EUSIPCO-2006), Florence, Italy, Sept. 4-8 2006.
    [pdf] [bib]
  • A. Katsamanis and P. Maragos,
    Advances in Statistical Estimation and Tracking of AM-FM Speech Components,
    Proc. Interspeech 2005 - Eurospeech -- 9th European Conference on Speech Communication and Technology, Lisbon, Portugal, September 2005.
    [pdf] [bib]
  • D. Dimitriadis, N. Katsamanis, P. Maragos, G. Papandreou and V. Pitsikalis,
    Towards Automatic Speech Recognition in Adverse Environments,
    Proc. HERCMA 2005 -- 7th Hellenic European Conference on Research on Computer Mathematics and its Applications, Athens, Greece, September 2005.

Book Chapters

  • G. Papandreou, A. Katsamanis, V. Pitsikalis and P. Maragos,
    Adaptive Multimodal Fusion by Uncertainty Compensation with Application to Audio-Visual Speech Recognition,
    in Multimodal Processing and Interaction: Audio, Video, Text, edited by P. Maragos, A. Potamianos, and P. Gros, Springer-Verlag, New York, 2008.
    [springer] [amazon.com]
  • P. Maragos, P. Gros, A. Katsamanis and G. Papandreou,
    Cross-Modal Integration for Performance Improving in Multimedia: A Review,
    in Multimodal Processing and Interaction: Audio, Video, Text, edited by P. Maragos, A. Potamianos, and P. Gros, Springer-Verlag, New York, 2008.
    [springer] [amazon.com]

Theses


Talks, Seminars, Presentations, Posters

  • Fricative synthesis investigations using the transmission line matrix method,
    at the 2nd ASA-ESA joint conference Acoustics'08, Paris.
    [abstract]
  • Audio-Visual Speech Analysis and Recognition,
    at MUSCLE joint with VITALAS Conference, Cannes, France, 2008.
    [presentation] [video]
  • Towards Automatic Speech Recognition in Adverse Environments,
    at the Nonlinear Speech Processing Workshop (WNSP05), Heraklion, Greece, 2005.
    [presentation]
  • Statistical Speech Analysis and Nonlinear Modeling,
    at COST277 Seminar in Limerick, 2004.
    [presentation]
  • Speech Technologies for E-Commerce and Transactional Services, (in Greek)
    at EFTEHNOS series of seminars on supportive information and telecommunication technologies for people with special needs, University of Athens, 2004.
    [presentation]