Hynek Hermansky - Publications

Affiliations: 
Johns Hopkins University, Baltimore, MD 
Area:
automatic speech recognition

108/185 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2022 Kayser H, Hermansky H, Meyer BT. Spatial speech detection for binaural hearing aids using deep phoneme classifiers. Acta Acustica. European Acoustics Association. 6. PMID 36159631 DOI: 10.1051/aacus/2022013  0.351
2020 Li R, Wang X, Mallidi SH, Watanabe S, Hori T, Hermansky H. Multi-Stream End-to-End Speech Recognition Ieee/Acm Transactions On Audio, Speech, and Language Processing. 28: 646-655. DOI: 10.1109/TASLP.2019.2959721  0.381
2019 Mahajan NR, Mesgarani N, Hermansky H. General properties of auditory spectro-temporal receptive fields. The Journal of the Acoustical Society of America. 146: EL459. PMID 31893764 DOI: 10.1121/1.5135021  0.65
2019 Hermansky H. Coding and decoding of messages in human speech communication: Implications for machine recognition of speech Speech Communication. 106: 112-117. DOI: 10.1016/J.SPECOM.2018.12.004  0.503
2019 Castro Martinez AM, Gerlach L, Payá-Vayá G, Hermansky H, Ooster J, Meyer BT. DNN-based performance measures for predicting error rates in automatic speech recognition and optimizing hearing aid parameters Speech Communication. 106: 44-56. DOI: 10.1016/j.specom.2018.11.006  0.45
2016 Hsiao R, Ma J, Hartmann W, Karafiát M, Grézl F, Burget L, Szöke I, Černocky JH, Watanabe S, Chen Z, Mallidi SH, Hermansky H, Tsakalidis S, Schwartz R. Robust speech recognition in unknown reverberant and noisy conditions 2015 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2015 - Proceedings. 533-538. DOI: 10.1109/ASRU.2015.7404841  0.516
2014 Ganapathy S, Mallidi SH, Hermansky H. Robust feature extraction using modulation filtering of autoregressive models Ieee Transactions On Audio, Speech and Language Processing. 22: 1285-1295. DOI: 10.1109/Taslp.2014.2329190  0.675
2014 Mahajan N, Mesgarani N, Hermansky H. Principal components of auditory spectro-temporal receptive fields Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1983-1987.  0.571
2014 Li F, Nidadavolu PS, Hermansky H. A long, deep and wide artificial neural net for robust speech recognition in unknown noise Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 358-362.  0.322
2013 Garimella S, Hermansky H. Factor analysis of auto-associative neural networks with application in speaker verification. Ieee Transactions On Neural Networks and Learning Systems. 24: 522-8. PMID 24808374 DOI: 10.1109/Tnnls.2012.2236652  0.713
2013 Hermansky H. Multistream recognition of speech: Dealing with unknown unknowns Proceedings of the Ieee. 101: 1076-1088. DOI: 10.1109/JPROC.2012.2236871  0.4
2013 Jansen A, Dupoux E, Goldwater S, Johnson M, Khudanpur S, Church K, Feldman N, Hermansky H, Metze F, Rose R, Seltzer M, Clark P, McGraw I, Varadarajan B, Bennett E, et al. A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 8111-8115. DOI: 10.1109/ICASSP.2013.6639245  0.502
2013 Jansen A, Thomas S, Hermansky H. Weak top-down constraints for unsupervised acoustic model training Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 8091-8095. DOI: 10.1109/ICASSP.2013.6639241  0.456
2013 Plchot O, Matsoukas S, Matejka P, Dehak N, Ma J, Cumani S, Glembek O, Hermansky H, Mallidi SH, Mesgarani N, Schwartz R, Soufifar M, Tan ZH, Thomas S, Zhang B, et al. Developing a speaker identification system for the DARPA RATS project Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 6768-6772. DOI: 10.1109/ICASSP.2013.6638972  0.605
2013 Thomas S, Seltzer ML, Church K, Hermansky H. Deep neural network features and semi-supervised training for low resource speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 6704-6708. DOI: 10.1109/ICASSP.2013.6638959  0.539
2013 Kintzley K, Jansen A, Hermansky H. Text-to-speech inspired duration modeling for improved whole-word acoustic models Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1253-1257.  0.328
2013 Variani E, Li F, Hermansky H. Multi-stream recognition of noisy speech with performance monitoring Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2978-2981.  0.37
2013 Mallidi SH, Ganapathy S, Hermansky H. Robust speaker recognition using spectro-temporal autoregressive models Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 3689-3693.  0.327
2012 Ganapathy S, Hermansky H. Temporal resolution analysis in frequency domain linear prediction. The Journal of the Acoustical Society of America. 132: EL436-42. PMID 23145707 DOI: 10.1121/1.4758826  0.661
2012 Sivaram GSVS, Hermansky H. Sparse multilayer perceptron for phoneme recognition Ieee Transactions On Audio, Speech and Language Processing. 20: 23-29. DOI: 10.1109/TASL.2011.2129510  0.412
2012 Garimella S, Mallidi SH, Hermansky H. Regularized auto-associative neural networks for speaker verification Ieee Signal Processing Letters. 19: 841-844. DOI: 10.1109/Lsp.2012.2221706  0.725
2012 Thomas S, Ganapathy S, Hermansky H. Multilingual MLP features for low-resource LVCSR systems Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4269-4272. DOI: 10.1109/ICASSP.2012.6288862  0.624
2012 Garcia-Romero D, Zhou X, Zotkin D, Srinivasan B, Luo Y, Ganapathy S, Thomas S, Nemala S, Sivaram GSVS, Mirbagheri M, Mallidi SH, Janu T, Rajan P, Mesgarani N, Elhilali M, ... Hermansky H, et al. The UMD-JHU 2011 speaker recognition system Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4229-4232. DOI: 10.1109/ICASSP.2012.6288852  0.769
2012 Ikbal S, Misra H, Hermansky H, Magimai-Doss M. Phase AutoCorrelation (PAC) features for noise robust speech recognition Speech Communication. 54: 867-880. DOI: 10.1016/j.specom.2012.02.005  0.471
2012 Thomas S, Mallidi SH, Janu T, Hermansky H, Mesgarani N, Zhou X, Shamma S, Ng T, Zhang B, Nguyen L, Matsoukas S. Acoustic and data-driven features for robust speech activity detection 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1983-1986.  0.736
2012 Jansen A, Thomas S, Hermansky H. Intrinsic spectral analysis for zero and high resource speech recognition 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 878-881.  0.36
2012 Thomas S, Ganapathy S, Jansen A, Hermansky H. Data-driven posterior features for low resource speech recognition applications 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 790-793.  0.375
2011 Mesgarani N, Thomas S, Hermansky H. Toward optimizing stream fusion in multistream recognition of speech. The Journal of the Acoustical Society of America. 130: EL14-8. PMID 21786862 DOI: 10.1121/1.3595744  0.743
2011 Hermansky H. Dealing with unknown unknowns in speech The Journal of the Acoustical Society of America. 130: 2408-2408. DOI: 10.1121/1.3654655  0.428
2011 Pinto J, Garimella S, Magimai-Doss M, Hermansky H, Bourlard H. Analysis of MLP-based hierarchical phoneme posterior probability estimator Ieee Transactions On Audio, Speech and Language Processing. 19: 225-241. DOI: 10.1109/Tasl.2010.2045943  0.395
2011 Thomas S, Nguyen P, Zweig G, Hermansky H. MLP based phoneme detectors for automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5024-5027. DOI: 10.1109/ICASSP.2011.5947485  0.598
2011 Ganapathy S, Rajan P, Hermansky H. Multi-layer perceptron based speech activity detection for speaker verification Ieee Workshop On Applications of Signal Processing to Audio and Acoustics. 321-324. DOI: 10.1109/ASPAA.2011.6082323  0.569
2011 Hermansky H. Speech recognition from spectral dynamics Sadhana - Academy Proceedings in Engineering Sciences. 36: 729-744. DOI: 10.1007/s12046-011-0044-2  0.518
2011 Hermansky H. Dealing with unexpected words in automatic recognition of speech Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6836: 1-15. DOI: 10.1007/978-3-642-23538-2_1  0.372
2011 Mallidi SH, Ganapathy S, Hermansky H. Modulation spectrum analysis for recognition of reverberant speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 189-192.  0.412
2011 Mesgarani N, Thomas S, Hermansky H. Adaptive stream fusion in multistream recognition of speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2329-2332.  0.671
2010 Ganapathy S, Thomas S, Hermansky H. Temporal envelope compensation for robust phoneme recognition using modulation spectrum. The Journal of the Acoustical Society of America. 128: 3769-80. PMID 21218908 DOI: 10.1121/1.3504658  0.754
2010 Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Wide-band audio coding based on frequency-domain linear prediction Eurasip Journal On Audio, Speech, and Music Processing. 2010. DOI: 10.1155/2010/856280  0.656
2010 Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Wide-Band Audio Coding Based on Frequency-Domain Linear Prediction Eurasip Journal On Audio, Speech, and Music Processing. 2010: 1-14. DOI: 10.1155/2010/856280  0.532
2010 Hermansky H. Posterior‐based attributes in machine recognition of speech. The Journal of the Acoustical Society of America. 127: 2041-2041. DOI: 10.1121/1.3385373  0.483
2010 Ganapathy S, Motlicek P, Hermansky H. Autoregressive models of amplitude modulations in audio compression Ieee Transactions On Audio, Speech and Language Processing. 18: 1624-1631. DOI: 10.1109/Tasl.2009.2038813  0.628
2010 Sivaram GSVS, Nemala SK, Mesgarani N, Hermansky H. Data-driven and feedback based spectro-temporal features for speech recognition Ieee Signal Processing Letters. 17: 957-960. DOI: 10.1109/Lsp.2010.2079930  0.691
2010 Liu SC, Mesgarani N, Harris J, Hermansky H. The use of spike-based representations for hardware audition systems Iscas 2010 - 2010 Ieee International Symposium On Circuits and Systems: Nano-Bio Circuit Fabrics and Systems. 505-508. DOI: 10.1109/ISCAS.2010.5537588  0.558
2010 Ganapathy S, Thomas S, Hermansky H. Robust spectro-temporal features based on autoregressive models of Hilbert envelopes Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4286-4289. DOI: 10.1109/ICASSP.2010.5495668  0.681
2010 Sivaram GSVS, Nemala SK, Elhilali M, Tran TD, Hermansky H. Sparse coding for speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4346-4349. DOI: 10.1109/ICASSP.2010.5495649  0.332
2010 Ganapathy S, Thomas S, Hermansky H. Comparison of modulation features for phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5038-5041. DOI: 10.1109/ICASSP.2010.5495057  0.699
2010 Thomas S, Patil K, Ganapathy S, Mesgarani N, Hermansky H. A phoneme recognition framework based on auditory spectro-temporal receptive fields Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 2458-2461.  0.628
2010 Mesgarani N, Thomas S, Hermansky H. A multistream multiresolution framework for phoneme recognition Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 318-321.  0.645
2009 Ganapathy S, Thomas S, Hermansky H. Modulation frequency features for phoneme recognition in noisy speech. The Journal of the Acoustical Society of America. 125: EL8-12. PMID 19173383 DOI: 10.1121/1.3040022  0.762
2009 Hermansky H. Nonlinear mapping for feature extraction in automatic speech recognition The Journal of the Acoustical Society of America. 125: 4109. DOI: 10.1121/1.3155499  0.449
2009 Thomas S, Ganapathy S, Hermansky H. Phoneme recognition using spectral envelope and modulation frequency features Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4453-4456. DOI: 10.1109/ICASSP.2009.4960618  0.695
2009 Ganapathy S, Thomas S, Hermansky H. Temporal envelope subtraction for robust speech recognition using modulation spectrum Proceedings of the 2009 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2009. 164-169. DOI: 10.1109/ASRU.2009.5372922  0.731
2009 Ganapathy S, Thomas S, Motlicek P, Hermansky H. Applications of signal analysis using autoregressive models for amplitude modulation Ieee Workshop On Applications of Signal Processing to Audio and Acoustics. 341-344. DOI: 10.1109/ASPAA.2009.5346495  0.621
2009 Ganapathy S, Motlicek P, Hermansky H. Error resilient speech coding using sub-band hilbert envelopes Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5729: 355-362. DOI: 10.1007/978-3-642-04208-9_49  0.541
2009 Thomas S, Ganapathy S, Hermansky H. Tandem representations of spectral envelope and modulation frequency features for ASR Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2955-2958.  0.305
2009 Mesgarani N, Sivaram GSVS, Nemala SK, Elhilali M, Hermansky H. Discriminant spectrotemporal features for phoneme recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2983-2986.  0.662
2009 Ganapathy S, Thomas S, Hermansky H. Static and dynamic modulation spectrum for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2823-2826.  0.329
2009 Kombrink S, Burget L, Matějka P, Karafiát M, Hermansky H. Posterior-based out of vocabulary word detection in telephone speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 80-83.  0.317
2008 Thomas S, Ganapathy S, Hermansky H. Recognition of reverberant speech using frequency domain linear prediction Ieee Signal Processing Letters. 15: 681-684. DOI: 10.1109/Lsp.2008.2002708  0.76
2008 Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Temporal masking for bit-rate reduction in audio codec based on Frequency Domain Linear Prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4781-4784. DOI: 10.1109/ICASSP.2008.4518726  0.538
2008 Krishnan Parthasarathi SH, Motlíček P, Hermansky H. Exploiting contextual information for speech/non-speech detection Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 451-459. DOI: 10.1007/978-3-540-87391-4_58  0.316
2008 Motlíček P, Ganapathy S, Hermansky H, Garudadri H, Athineos M. Perceptually motivated sub-band decomposition for FDLP audio coding Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 435-442. DOI: 10.1007/978-3-540-87391-4_56  0.499
2008 Thomas S, Ganapathy S, Hermansky H. Hilbert envelope based features for far-field speech recognition Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5237: 119-124. DOI: 10.1007/978-3-540-85853-9-11  0.394
2008 Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Frequency domain linear prediction for QMF sub-bands and applications to audio coding Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4892: 248-258. DOI: 10.1007/978-3-540-78155-4_22  0.525
2008 Thomas S, Ganapathy S, Hermansky H. Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain European Signal Processing Conference 0.394
2008 Ganapathy S, Thomas S, Hermansky H. Front-end for far-field speech recognition based on frequency domain linear prediction Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 984-987.  0.31
2008 Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Spectral noise shaping: Improvements in speech/audio codec based on linear prediction in spectral domain Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 675-678.  0.304
2008 Sivaram GSVS, Hermansky H. Introducing temporal asymmetries in feature extraction for automatic speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 890-893.  0.411
2008 Thomas S, Ganapathy S, Hermansky H. Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1521-1524.  0.458
2007 Prasanna SRM, Hermansky H. MRASTA and PLP in automatic speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 137-140.  0.416
2005 Morgan N, Zhu Q, Stolcke A, Sönmez K, Sivadas S, Shinozaki T, Ostendorf M, Jain P, Hermansky H, Ellis D, Doddington G, Chen B, Çetin O, Bourlard H, Athineos M. Pushing the envelope - Aside Ieee Signal Processing Magazine. 22: 81-88. DOI: 10.1109/Msp.2005.1511826  0.505
2004 Ikbal S, Misra H, Bourlard H, Hermansky H. Phase AutoCorrelation (PAC) features in entropy based multi-stream for robust speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I205-I208.  0.377
2003 Hermansky H. Recognition of information‐bearing elements in speech The Journal of the Acoustical Society of America. 114: 2424-2424. DOI: 10.1121/1.4778809  0.496
2003 Hermansky H. TRAP-TANDEM: Data-driven extraction of temporal features from speech 2003 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2003. 255-260. DOI: 10.1109/ASRU.2003.1318450  0.343
2003 Malayath N, Hermansky H. Data-driven spectral basis functions for automatic speech recognition Speech Communication. 40: 449-466. DOI: 10.1016/S0167-6393(02)00127-9  0.515
2000 Hermansky H. Method and system for generating an estimated clean speech signal from a noisy speech signal The Journal of the Acoustical Society of America. 107: 1816. DOI: 10.1121/1.428550  0.415
2000 Yang HH, Van Vuuren S, Sharma S, Hermansky H. Relevance of time-frequency features for phonetic and speaker-channel classification Speech Communication. 31: 35-50. DOI: 10.1016/S0167-6393(00)00007-8  0.406
2000 Malayath N, Hermansky H, Kajarekar S, Yegnanarayana B. Data-driven temporal filters and alternatives to GMM in speaker verification Digital Signal Processing: a Review Journal. 10: 55-74. DOI: 10.1006/dspr.1999.0363  0.363
2000 Kajarekar SS, Hermansky H. Analysis of information in speech and its application in speech recognition Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1902: 283-288.  0.357
1999 Arai T, Pavel M, Hermansky H, Avendano C. Syllable intelligibility for temporally filtered LPC cepstral trajectories. The Journal of the Acoustical Society of America. 105: 2783-91. PMID 10335630 DOI: 10.1121/1.426895  0.359
1999 Hermansky H. Data‐driven speech analysis for ASR The Journal of the Acoustical Society of America. 105: 1352-1352. DOI: 10.1121/1.426410  0.505
1999 Sharma S, Hermansky H. Recognition of speech from temporal patterns The Journal of the Acoustical Society of America. 105: 1158-1158. DOI: 10.1121/1.425505  0.499
1999 Kanedera N, Arai T, Hermansky H, Pavel M. On the relative importance of various components of the modulation spectrum for automatic speech recognition Speech Communication. 28: 43-55. DOI: 10.1016/S0167-6393(99)00002-3  0.499
1999 Yegnanarayana B, Avendano C, Hermansky H, Satyanarayana Murthy P. Speech enhancement using linear prediction residual Speech Communication. 28: 25-42. DOI: 10.1016/S0167-6393(98)00070-3  0.453
1998 Kanedera N, Hermansky H, Arai T. On properties of modulation spectrum for robust automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 2: 613-616. DOI: 10.1109/ICASSP.1998.675339  0.408
1998 Hermansky H. Should recognizers have ears? Speech Communication. 25: 3-27. DOI: 10.1016/S0167-6393(98)00027-2  0.466
1997 Hermansky H, Morgan NH. Noise resistant auditory model for parameterization of speech The Journal of the Acoustical Society of America. 101: 2426. DOI: 10.1121/1.418514  0.41
1997 Avendano C, Hermansky H. On the effects of short-term spectrum smoothing in channel normalization Ieee Transactions On Speech and Audio Processing. 5: 372-374. DOI: 10.1109/89.593318  0.301
1996 Hermansky H. Beyond a ‘‘short‐term’’ analysis of speech The Journal of the Acoustical Society of America. 100: 2792-2792. DOI: 10.1121/1.416495  0.475
1996 Arai T, Pavel M, Hermansky H, Avendano C. Intelligibility of speech with filtered time trajectories of LPC cepstrum The Journal of the Acoustical Society of America. 100: 2756-2756. DOI: 10.1121/1.416322  0.458
1996 Bourlard H, Hermansky H, Morgan N. Towards increasing speech recognition error rates Speech Communication. 18: 205-231. DOI: 10.1016/0167-6393(96)00003-9  0.438
1995 Cole R, Hermansky H, Novick DG, Oviatt S, Hirschman L, Atlas L, Beckman M, Biermann A, Bush M, Clements M, Cohen J, Garcia O, Hanson B, Levinson S, McKeown K, et al. The Challenge of Spoken Language Systems: Research Directions for the Nineties Ieee Transactions On Speech and Audio Processing. 3: 1-21. DOI: 10.1109/89.365385  0.384
1995 Morgan N, Bourlard H, Greenberg S, Hermansky H, Wu SL. Stochastic perceptual models of speech Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 397-400.  0.312
1994 Pavel M, Hermansky H. Temporal masking in automatic speech recognition The Journal of the Acoustical Society of America. 95: 2876-2876. DOI: 10.1121/1.409409  0.527
1994 Hermansky H, Morgan N. RASTA Processing of Speech Ieee Transactions On Speech and Audio Processing. 2: 578-589. DOI: 10.1109/89.326616  0.49
1993 Junqua JC, Wakita H, Hermansky H. Evaluation and Optimization of Perceptually-Based ASR Front-End Ieee Transactions On Speech and Audio Processing. 1: 39-48. DOI: 10.1109/89.221366  0.459
1993 Hermansky H, Morgan N, Hirsch HG. Recognition of speech in additive and convolutional noise based on RASTA spectral processing Proceedings - Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing. 2: II-83-II-86.  0.334
1991 Morgan N, Hermansky H, Bourlard H, Kohn P, Wooters C. Continuous speech recognition using PLP analysis with multilayer perceptrons Proceedings - Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing. 1: 49-52.  0.392
1990 Hermansky H. Perceptual linear predictive (PLP) analysis of speech. The Journal of the Acoustical Society of America. 87: 1738-52. PMID 2341679 DOI: 10.1121/1.399423  0.458
1990 Hermansky H, Cox TL. Synthesis of speech from the low‐dimensional PLP representation The Journal of the Acoustical Society of America. 88: S179-S180. DOI: 10.1121/1.2028800  0.425
1988 Terry M, Hermansky H. Comparison of standard ASR front ends and auditory models in neural net‐based automatic speech recognition The Journal of the Acoustical Society of America. 83: S53-S53. DOI: 10.1121/1.2025401  0.441
1987 Hermansky H. Should ASR front‐end be insensitive to fundamental frequency? (perceptual shift of formant position due to fine harmonic structure of voiced speech The Journal of the Acoustical Society of America. 82: S36-S36. DOI: 10.1121/1.2024778  0.384
1987 Hermansky H. Why is the formant frequency difference limen asymmetric? The Journal of the Acoustical Society of America. 81: S18-S18. DOI: 10.1121/1.2024129  0.355
1986 Hermansky H, Javkin HR. Evaluation of ASR front ends using synthetic vowel‐like sounds The Journal of the Acoustical Society of America. 80: S18-S18. DOI: 10.1121/1.2023687  0.316
1986 Tsuga K, Hermansky H. Effect of the spectral model order in automatic speech recognition The Journal of the Acoustical Society of America. 80: S18-S18. DOI: 10.1121/1.2023684  0.352
1985 Hanson BA, Hermansky H, Wakita H. Root‐power sums and spectral slope distortion measures for all‐pole models of speech The Journal of the Acoustical Society of America. 78: S49-S49. DOI: 10.1121/1.2022847  0.407
1985 Hermansky H, Hanson BA, Wakita H. Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain Speech Communication. 4: 181-187. DOI: 10.1016/0167-6393(85)90045-7  0.451
1984 Hermansky H, Hanson BA, Wakita H. Critical‐band‐weighted linear prediction of speech The Journal of the Acoustical Society of America. 76: S1-S1. DOI: 10.1121/1.2021743  0.406
Low-probability matches (unlikely to be authored by this person)
2004 Misra H, Ikbal S, Bourlard H, Hermansky H. Spectral entropy based feature for robust ASR Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I193-I196.  0.298
2003 Ikbal S, Hermansky H, Bourlard H. Nonlinear spectral transformations for robust speech recognition 2003 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2003. 393-398. DOI: 10.1109/ASRU.2003.1318473  0.297
2008 Pinto J, Hermansky H. Combining evidence from a generative and a discriminative model in phoneme recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2414-2417.  0.293
2012 Weinshall D, Zweig A, Hermansky H, Kombrink S, Ohl FW, Anemüller J, Bach JH, Van Gool L, Nater F, Pajdla T, Havlena M, Pavel M. Beyond novelty detection: incongruent events, when general and specific classifiers disagree. Ieee Transactions On Pattern Analysis and Machine Intelligence. 34: 1886-901. PMID 22213766 DOI: 10.1109/Tpami.2011.279  0.288
2011 Sivaram GSVS, Hermansky H. Multilayer perceptron with sparse hidden outputs for phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5336-5339. DOI: 10.1109/ICASSP.2011.5947563  0.288
1991 Morgan N, Wooters C, Hermansky H. Experiments with temporal resolution for continuous speech recognition with multi-layer perceptrons Neural Networks For Signal Processing. 405-410.  0.288
2013 Clark P, Mallidi SH, Jansen A, Hermansky H. Frequency offset correction in speech without detecting pitch Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7020-7024. DOI: 10.1109/ICASSP.2013.6639023  0.287
2013 Hermansky H. Long, deep and wide artificial neural nets for dealing with unexpected noise in machine recognition of speech Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8082: 14-21. DOI: 10.1007/978-3-642-40585-3_2  0.284
2007 Motlicek P, Hermansky H, Ganapathy S, Garudadri H. Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4629: 350-357.  0.283
2014 Kintzley K, Jansen A, Hermansky H. Featherweight phonetic keyword search for conversational speech Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7859-7863. DOI: 10.1109/ICASSP.2014.6855130  0.283
2011 Carlin MA, Thomas S, Jansen A, Hermansky H. Rapid evaluation of speech representations for spoken term discovery Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 821-824.  0.283
2013 Schatz T, Peddinti V, Bach F, Jansen A, Hermansky H, Dupoux E. Evaluating speech features with the minimal-pair ABX task: Analysis of the classical MFC/PLP pipeline Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1781-1785.  0.273
2008 Sivaram GSVS, Hermansky H. Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition European Signal Processing Conference 0.27
2009 Pavel M, Slaney M, Hermansky H. Reconciliation of human and machine speech recognition performance Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1669-1672. DOI: 10.1109/ICASSP.2009.4959922  0.268
2013 Hermansky H, Cohen JR, Stern RM. Perceptual properties of current speech recognition technology Proceedings of the Ieee. 101: 1968-1985. DOI: 10.1109/JPROC.2013.2252316  0.267
2003 Kajarekar SS, Hermansky H. Analysis of information in speech based on MANOVA Advances in Neural Information Processing Systems 0.263
2006 Motlíek P, Hermansky H, Garudadri H, Srinivasamurthy N. Speech coding based on spectral dynamics Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4188: 471-478.  0.262
2013 Ma J, Zhang B, Matsoukas S, Mallidi SH, Li F, Hermansky H. Improvements in language identification on the RATS noisy speech corpus Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 69-73.  0.255
2006 Fousek P, Hermansky H. Towards asr based on hierarchical posterior-based keyword recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I433-I436.  0.252
2007 Valente F, Vepa J, Hermansky H. Multi-stream features combination based on Dempster-Shafer rule for LVCSR system Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 273-276.  0.251
2012 Ganapathy S, Hermansky H. Robust phoneme recognition using high resolution temporal envelopes 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1826-1829.  0.249
2014 Schatz T, Peddinti V, Cao XN, Bach F, Hermansky H, Dupoux E. Evaluating speech features with the Minimal-Pair ABX task (II): Resistance to noise Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 915-919.  0.246
2012 Hirsch HG, Ganapathy S, Hermansky H. Comparison of different approaches for speech recognition in hands-free mode Proceedings of 10th Itg Symposium On Speech Communication 0.243
2013 Hermansky H, Variani E, Peddinti V. Mean temporal distance: Predicting ASR error from temporal properties of speech signal Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7423-7426. DOI: 10.1109/ICASSP.2013.6639105  0.239
2009 Motlicek P, Ganapathy S, Hermansky H. Arithmetic coding of sub-band residuals in FDLP speech/audio Codec Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2591-2594.  0.237
2010 Sivaram GSVS, Ganapathy S, Hermansky H. Sparse auto-associative neural networks: Theory and application to speech recognition Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 2270-2273.  0.231
2003 Matějka P, Schwarz P, Hermansky H, Černocky J. Phoneme recognition using temporal patterns Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science). 2807: 198-205.  0.231
2002 Sivadas S, Hermansky H. Hierarchical tandem feature extraction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I/809-I/812.  0.224
1998 Yegnanarayana B, Avendano C, Murthy PS, Hermansky H. Enhancement of reverberant speech using LP residual Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 405-408.  0.222
1989 Broad DJ, Hermansky H. The front‐cavity/F2′ hypothesis tested by data on tongue movements The Journal of the Acoustical Society of America. 86: S113-S114. DOI: 10.1121/1.2027307  0.218
2003 Sivadas S, Hermansky H. Generalized Tandem feature extraction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 56-59.  0.216
2010 Delbruck T, Koch T, Berner R, Hermansky H. Fully integrated 500uW speech detection wake-up circuit Iscas 2010 - 2010 Ieee International Symposium On Circuits and Systems: Nano-Bio Circuit Fabrics and Systems. 2015-2018. DOI: 10.1109/ISCAS.2010.5537160  0.214
2008 White C, Zweig G, Burget L, Schwarz P, Hermansky H. Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4085-4088. DOI: 10.1109/ICASSP.2008.4518552  0.213
2000 Yang HH, Hermansky H. Search for information bearing components in speech Advances in Neural Information Processing Systems. 803-809.  0.212
2007 Ketabdar H, Hannemann M, Hermansky H. Detection of out-of-vocabulary words in posterior based ASR International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007. 4: 2772-2775.  0.208
2005 Hermansky H, Fousek P, Lehtonen M. The role of speech in multimodal human-computer interaction (towards reliable rejection of non-keyword input) Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3658: 2-8.  0.208
2023 Luo S, Angrick M, Coogan C, Candrea DN, Wyse-Sookoo K, Shah S, Rabbani Q, Milsap GW, Weiss AR, Anderson WS, Tippett DC, Maragakis NJ, Clawson LL, Vansteensel MJ, Wester BA, ... ... Hermansky H, et al. Stable Decoding from a Speech BCI Enables Control for an Individual with ALS without Recalibration for 3 Months. Advanced Science (Weinheim, Baden-Wurttemberg, Germany). e2304853. PMID 37875404 DOI: 10.1002/advs.202304853  0.206
2016 Mallidi SH, Ogawa T, Hermansky H. Uncertainty estimation of DNN classifiers 2015 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2015 - Proceedings. 283-288. DOI: 10.1109/ASRU.2015.7404806  0.203
2008 Pinto J, Yegnanarayana B, Hermansky H, Magimai -M. Exploiting contextual information for improved phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4449-4452. DOI: 10.1109/ICASSP.2008.4518643  0.203
2008 Pinto J, Sivaram GSVS, Hermansky H. Reverse correlation for analyzing MLP posterior features in ASR Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 469-476. DOI: 10.1007/978-3-540-87391-4_60  0.201
2011 Zweig G, Nguyen P, Van Compernolle D, Demuynck K, Atlas L, Clark P, Sell G, Wang M, Sha F, Hermansky H, Karakos D, Jansen A, Thomas S, S GSVS, Bowman S, et al. Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5044-5047. DOI: 10.1109/ICASSP.2011.5947490  0.198
2007 Motlicek P, Ullal V, Hermansky H. Wide-band perceptual audio coding based on frequency-domain linear prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I265-I268. DOI: 10.1109/ICASSP.2007.366667  0.198
2007 Valente F, Vepa J, Plahl C, Gollan C, Hermansky H, Schlüter R. Hierarchical Neural Networks feature extraction for LVCSR system Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 265-268.  0.195
2004 Sivadas S, Hermansky H. On use of task independent training data in tandem feature extraction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I541-I544.  0.195
2012 Kintzley K, Jansen A, Hermansky H. MAP estimation of whole-word acoustic models with dictionary priors 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 786-789.  0.191
2008 Valente F, Hermansky H. On the combination of auditory and modulation frequency channels for ASR applications Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2242-2245.  0.186
2008 Tošić T, Magimai-Doss M, Hermansky H. Using comparison of parallel phoneme probability streams for OOV word detection European Signal Processing Conference 0.181
2013 Li F, Hermansky H. Effect of filter bandwidth and spectral sampling rate of analysis filterbank on automatic phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7121-7124. DOI: 10.1109/ICASSP.2013.6639044  0.177
2012 Variani E, Hermansky H. Estimating classifier performance in unknown noise 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 2: 1798-1801.  0.173
2012 Li F, Mallidi SH, Hermansky H. Phone recognition in critical bands using sub-band temporal modulations 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1814-1817.  0.169
2009 Ganapathy S, Motlicek P, Hermansky H. MDCT for encoding residual signals in frequency domain linear prediction 127th Audio Engineering Society Convention 2009. 2: 1103-1110.  0.168
2010 Thomas S, Ganapathy S, Hermansky H. Cross-lingual and multi-stream posterior features for low resource LVCSR systems Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 877-880.  0.168
2007 Pinto J, Lovitt A, Hermansky H. Exploiting phoneme similarities in hybrid HMM-ANN keyword spotting International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007. 4: 2388-2391.  0.161
2007 Valente F, Hermansky H. Combination of acoustic classifiers based on dempster-shafer theory of evidence Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4: IV1129-IV1132. DOI: 10.1109/ICASSP.2007.367273  0.158
2006 Valente F, Hermansky H. Discriminant linear processing of time-frequency plane Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 349-352.  0.151
2023 Angrick M, Luo S, Rabbani Q, Candrea DN, Shah S, Milsap GW, Anderson WS, Gordon CR, Rosenblatt KR, Clawson L, Maragakis N, Tenore FV, Fifer MS, Hermansky H, Ramsey NF, et al. Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS. Medrxiv : the Preprint Server For Health Sciences. PMID 37425721 DOI: 10.1101/2023.06.30.23291352  0.144
2008 Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Autoregressive modelling of hilbert envelopes for wide-band audio coding Audio Engineering Society - 124th Audio Engineering Society Convention 2008. 3: 1620-1627.  0.143
2010 Hermansky H. History of modulation spectrum in ASR Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5458-5461. DOI: 10.1109/ICASSP.2010.5494907  0.138
2002 Adami AG, Kajarekar SS, Hermansky H. A new speaker change detection method for two-speaker segmentation Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4: IV/3908-IV/3911.  0.135
2008 Sivaram GSVS, Hermansky H. Emulating temporal receptive fields of higher level auditory neurons for ASR Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 509-516. DOI: 10.1007/978-3-540-87391-4_65  0.13
2013 Peddinti V, Hermansky H. Filter-bank optimization for Frequency Domain Linear Prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7102-7106. DOI: 10.1109/ICASSP.2013.6639040  0.128
2013 Ogawa T, Li F, Hermansky H. Stream selection and integration in multistream ASR using GMM-based performance monitoring Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 3332-3336.  0.114
2011 Sivaram GSVS, Thomas S, Hermansky H. Mixture of auto-associative neural networks for speaker verification Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2381-2384.  0.11
2008 Valente F, Hermansky H. Hierarchical and parallel processing of modulation spectrum for ASR applications Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4165-4168. DOI: 10.1109/ICASSP.2008.4518572  0.109
2001 Kajarekar SS, Yegnanarayana B, Hermansky H. A study of two dimensional linear discriminants for ASR Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 137-140.  0.105
2009 Pinto J, Sivaram GSVS, Hermansky H, Magimai-Doss M. Volterra series for analyzing MLP based phoneme posterior estimator Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1813-1816. DOI: 10.1109/ICASSP.2009.4959958  0.099
2005 Hermansky H, Fousek P. Multi-resolution RASTA filtering for TANDEM-based ASR 9th European Conference On Speech Communication and Technology. 361-364.  0.087
2012 Kintzley K, Jansen A, Church K, Hermansky H. Inverting the point process model for fast phonetic keyword search 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 2437-2440.  0.084
2005 Verhelst W, Herre J, Kubin G, Hermansky H, Jensen SH. Eurasip Journal on Applied Signal Processing: Editorial Eurasip Journal On Applied Signal Processing. 2005: 1289-1291. DOI: 10.1155/ASP.2005.1289  0.084
2009 Stricker C, Wagen JF, Aradilla G, Bourlard H, Hermansky H, Pinto J, Rey PH, Théraulaz J. Intelligent multi-modal interfaces for mobile applications in hostile environment(IM-HOST) Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5440: 71-102. DOI: 10.1007/978-3-642-00437-7_4  0.083
2008 Burget L, Schwarz P, Matějka P, Hannemann M, Rastrow A, White C, Khudanpur S, Hermansky H, Černocký J. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4081-4084. DOI: 10.1109/ICASSP.2008.4518551  0.078
2011 Kintzley K, Jansen A, Hermansky H. Event selection from phone posteriorgrams using matched filters Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1905-1908.  0.068
2012 Anemüller J, Caputo B, Hermansky H, Ohl FW, Pajdla T, Pavel M, Van Gool L, Vogels R, Wabnik S, Weinshall D. DIRAC: Detection and identification of rare audio-visual events Studies in Computational Intelligence. 384: 3-35. DOI: 10.1007/978-3-642-24034-8_1  0.043
2008 Anemüller J, Bach JH, Caputo B, Havlena M, Jie L, Kayser H, Leibe B, Motlicek P, Pajdla T, Pavel M, Torii A, Gool LV, Zweig A, Hermansky H. The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events Icmi'08: Proceedings of the 10th International Conference On Multimodal Interfaces. 289-292. DOI: 10.1145/1452392.1452451  0.038
2010 Jansen A, Church K, Hermansky H. Towards spoken term discovery at scale with zero resources Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 1676-1679.  0.035
2015 Hermansky H, Burget L, Cohen J, Dupoux E, Feldman N, Godfrey J, Khudanpur S, Maciejewski M, Mallidi SH, Menon A, Ogawa T, Peddinti V, Rose R, Stern R, Wiesner M, et al. Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 2015: 5009-5013. DOI: 10.1109/ICASSP.2015.7178924  0.019
2000 Hermansky H. Preface Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1902: V.  0.01
Hide low-probability matches.