Publications
Ph.D. Thesis
Articulatory representations to address acoustic variability in speech, University of Maryland College Park, December 2017 [thesis]
Journal Articles
- G. Sivaraman, V. Mitra, H. Nam, M. Tiede, and C. Espy-Wilson, Unsupervised speaker adaptation for speaker independent
acoustic to articulatory speech inversion, The Journal of the Acoustical Society of America 146 (1), 316-329, 2019 [paper]. - E. Ylmaz, V. Mitra, G. Sivaraman, H. Franco, Articulatory and bottleneck features for speaker-independent ASR of dysarthric
speech, Computer Speech & Language 58, 319-334, 2019 [paper]. - V. Mitra, G. Sivaraman, H. Nam, C. Espy-Wilson,E. Saltzman, and M.K. Tiede, Hybrid Convoultional Neural Networks for Articulatory and Acoustic information based speech recognition, Speech Communication 89, 103-112 [paper]
- G. Sivaraman and K Samudravijaya, Hindi Speech Recognition and Online Speaker Adaptation. IJCA Proceedings on International Conference on Technology Systems and Management (ICTSM) (1):27-30, 2011 [paper]
Conference Articles
- Rahil Parikh, Nadee Seneviratne, Ganesh Sivaraman, Shihab Shamma, Carol Espy-Wilson, Acoustic To Articulatory Speech Inversion Using Multi-Resolution Spectro-Temporal Representations Of Speech Signals, to appear at Interspeech 2022.
- Yashish M Siriwardena, Ganesh Sivaraman, Carol Espy-Wilson, Acoustic-to-articulatory Speech Inversion with Multi-task Learning, to appear at Interspeech 2022.
- Ganesh Sivaraman, Ricardo Casal, Matt Garland, Elie Khoury, Unsupervised Model Adaptation for End-to-End ASR, ICASSP 2022.
- G. Sivaraman, A. Vidwans, E. Khoury, Speech Bandwidth Expansion For Speaker Recognition On Telephony Audio,
accepted for publication at Odyssey: The Speaker and Language Recognition Workshop, 2020 [paper] - T. Chen, A. Kumar, P. Nagarsheth, G. Sivaraman, and E. Khoury, Generalization of Audio Deepfake Detection, in Odyssey 2020 The Speaker and Language Recognition Workshop, 2020, pp. 132–137 [paper]
- N. Seneviratne, G. Sivaraman, C. Espy-Wilson, Multi-corpus Acoustic-to-articulatory Speech Inversion, Proc. INTER-
SPEECH 2019 [paper]. - E. Khoury, K. Lakhdhar, A. Vaughan, G. Sivaraman, P. Nagarsheth, Pindrop Labs Submission to the First Multi-target Speaker
Detection and Identification Challenge, Proc. INTERSPEECH 2019 [paper] - S. Sahu, R. Gupta, G. Sivaraman, Carol Espy-Wilson, Smoothing model predictions using adversarial training procedures for speech based emotion recognition, Proc. ICASSP 2018 [paper]
- G. Sivaraman , C. Espy-Wilson, M. Wieling, Analysis of acoustic-to-articulatory speech inversion across different accents and languages, Proc. Interspeech 2017, 974-978 [paper] [slides]
- S. Sahu, R. Gupta, G. Sivaraman, C. Espy-Wilson, Adversarial Auto-encoders for Speech Based Emotion Recognition, Proc. Interspeech 2017, 1243-1247 [paper]
- V. Mitra, G. Sivaraman, C. Bartels, H. Nam, W. Wang, C. Espy-Wilson, D. Vergyri, H. Franco, Joint modeling of
articulatory and acoustic spaces for continuous speech recognition tasks, in Proc. ICASSP 2017 [paper] - G. Sivaraman, V.Mitra, H. Nam, M.K. Tiede, C. Espy-Wilson (2016) Vocal tract length normalization for speaker independent acoustic-to-articulatory speech inversion, Proc. of INTERSPEECH 2016 [paper] [slides]
- G. Sivaraman, V. Mitra, M.K. Tiede, E. Saltzman, L. Goldstein, C. Espy-Wilson (2015). Analysis of Coarticulated Speech Using Estimated Articulatory Trajectories, Proc. of INTERSPEECH 2015 [paper] [slides]
- V. Mitra, G. Sivaraman, H. Nam, C. Espy-Wilson, E. Saltzman, Articulatory features from deep neural networks and their role in speech recognition, Proc. of ICASSP, pp.3041-3045, Florence, 2014 [paper]
- G. Sivaraman, V. Mitra, C.Y. Espy-Wilson, Fusion of acoustic, perceptual and production features for robust speech recognition in highly non-stationary noise, Proc. of CHiME-2013, pp. 6570, Vancouver, Canada, June 2013 [paper]
- G. Sivaraman, S. Mehta, N. Nabar and Samudravijaya K, Higher Accuracy of Hindi Speech Recognition Due to Online Speaker Adaptation, Communications in Computer and Information Science, 2011 [paper]
Conference Abstracts
- G. Sivaraman, V. Mitra, H. Nam, E. Saltzman; C. Espy-Wilson (2015). Augmenting acoustic phonetics with articulatory features for phone recognition, Spring 2015 Meeting of the Acoustical Society of America, Pittsburgh, 2015 [poster]
- G. Sivaraman, C. Espy-Wilson, V. Mitra, H. Nam, E. Saltzman (2014). Analysis of acoustic to articulatory speech inversion for natural speech, Fall 2014 Meeting of The Acoustical Society of America, Indianapolis, 2014 [poster]