Hynek Hermansky - Publications

Affiliations:

Area:

automatic speech recognition

Tree Info Grants Similar researchers PubMed Report error

Year	Citation	Score
2022	Kayser H, Hermansky H, Meyer BT. Spatial speech detection for binaural hearing aids using deep phoneme classifiers. Acta Acustica. European Acoustics Association. 6. PMID 36159631 DOI: 10.1051/aacus/2022013	0.351
2020	Li R, Wang X, Mallidi SH, Watanabe S, Hori T, Hermansky H. Multi-Stream End-to-End Speech Recognition Ieee/Acm Transactions On Audio, Speech, and Language Processing. 28: 646-655. DOI: 10.1109/TASLP.2019.2959721	0.381
2019	Mahajan NR, Mesgarani N, Hermansky H. General properties of auditory spectro-temporal receptive fields. The Journal of the Acoustical Society of America. 146: EL459. PMID 31893764 DOI: 10.1121/1.5135021	0.65
2019	Hermansky H. Coding and decoding of messages in human speech communication: Implications for machine recognition of speech Speech Communication. 106: 112-117. DOI: 10.1016/J.SPECOM.2018.12.004	0.503
2019	Castro Martinez AM, Gerlach L, Payá-Vayá G, Hermansky H, Ooster J, Meyer BT. DNN-based performance measures for predicting error rates in automatic speech recognition and optimizing hearing aid parameters Speech Communication. 106: 44-56. DOI: 10.1016/j.specom.2018.11.006	0.45
2016	Hsiao R, Ma J, Hartmann W, Karafiát M, Grézl F, Burget L, Szöke I, Černocky JH, Watanabe S, Chen Z, Mallidi SH, Hermansky H, Tsakalidis S, Schwartz R. Robust speech recognition in unknown reverberant and noisy conditions 2015 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2015 - Proceedings. 533-538. DOI: 10.1109/ASRU.2015.7404841	0.516
2014	Ganapathy S, Mallidi SH, Hermansky H. Robust feature extraction using modulation filtering of autoregressive models Ieee Transactions On Audio, Speech and Language Processing. 22: 1285-1295. DOI: 10.1109/Taslp.2014.2329190	0.675
2014	Mahajan N, Mesgarani N, Hermansky H. Principal components of auditory spectro-temporal receptive fields Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1983-1987.	0.571
2014	Li F, Nidadavolu PS, Hermansky H. A long, deep and wide artificial neural net for robust speech recognition in unknown noise Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 358-362.	0.322
2013	Garimella S, Hermansky H. Factor analysis of auto-associative neural networks with application in speaker verification. Ieee Transactions On Neural Networks and Learning Systems. 24: 522-8. PMID 24808374 DOI: 10.1109/Tnnls.2012.2236652	0.713
2013	Hermansky H. Multistream recognition of speech: Dealing with unknown unknowns Proceedings of the Ieee. 101: 1076-1088. DOI: 10.1109/JPROC.2012.2236871	0.4
2013	Jansen A, Dupoux E, Goldwater S, Johnson M, Khudanpur S, Church K, Feldman N, Hermansky H, Metze F, Rose R, Seltzer M, Clark P, McGraw I, Varadarajan B, Bennett E, et al. A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 8111-8115. DOI: 10.1109/ICASSP.2013.6639245	0.502
2013	Jansen A, Thomas S, Hermansky H. Weak top-down constraints for unsupervised acoustic model training Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 8091-8095. DOI: 10.1109/ICASSP.2013.6639241	0.456
2013	Plchot O, Matsoukas S, Matejka P, Dehak N, Ma J, Cumani S, Glembek O, Hermansky H, Mallidi SH, Mesgarani N, Schwartz R, Soufifar M, Tan ZH, Thomas S, Zhang B, et al. Developing a speaker identification system for the DARPA RATS project Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 6768-6772. DOI: 10.1109/ICASSP.2013.6638972	0.605
2013	Thomas S, Seltzer ML, Church K, Hermansky H. Deep neural network features and semi-supervised training for low resource speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 6704-6708. DOI: 10.1109/ICASSP.2013.6638959	0.539
2013	Kintzley K, Jansen A, Hermansky H. Text-to-speech inspired duration modeling for improved whole-word acoustic models Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1253-1257.	0.328
2013	Variani E, Li F, Hermansky H. Multi-stream recognition of noisy speech with performance monitoring Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2978-2981.	0.37
2013	Mallidi SH, Ganapathy S, Hermansky H. Robust speaker recognition using spectro-temporal autoregressive models Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 3689-3693.	0.327
2012	Ganapathy S, Hermansky H. Temporal resolution analysis in frequency domain linear prediction. The Journal of the Acoustical Society of America. 132: EL436-42. PMID 23145707 DOI: 10.1121/1.4758826	0.661
2012	Sivaram GSVS, Hermansky H. Sparse multilayer perceptron for phoneme recognition Ieee Transactions On Audio, Speech and Language Processing. 20: 23-29. DOI: 10.1109/TASL.2011.2129510	0.412
2012	Garimella S, Mallidi SH, Hermansky H. Regularized auto-associative neural networks for speaker verification Ieee Signal Processing Letters. 19: 841-844. DOI: 10.1109/Lsp.2012.2221706	0.725
2012	Thomas S, Ganapathy S, Hermansky H. Multilingual MLP features for low-resource LVCSR systems Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4269-4272. DOI: 10.1109/ICASSP.2012.6288862	0.624
2012	Garcia-Romero D, Zhou X, Zotkin D, Srinivasan B, Luo Y, Ganapathy S, Thomas S, Nemala S, Sivaram GSVS, Mirbagheri M, Mallidi SH, Janu T, Rajan P, Mesgarani N, Elhilali M, ... Hermansky H, et al. The UMD-JHU 2011 speaker recognition system Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4229-4232. DOI: 10.1109/ICASSP.2012.6288852	0.769
2012	Ikbal S, Misra H, Hermansky H, Magimai-Doss M. Phase AutoCorrelation (PAC) features for noise robust speech recognition Speech Communication. 54: 867-880. DOI: 10.1016/j.specom.2012.02.005	0.471
2012	Thomas S, Mallidi SH, Janu T, Hermansky H, Mesgarani N, Zhou X, Shamma S, Ng T, Zhang B, Nguyen L, Matsoukas S. Acoustic and data-driven features for robust speech activity detection 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1983-1986.	0.736
2012	Jansen A, Thomas S, Hermansky H. Intrinsic spectral analysis for zero and high resource speech recognition 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 878-881.	0.36
2012	Thomas S, Ganapathy S, Jansen A, Hermansky H. Data-driven posterior features for low resource speech recognition applications 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 790-793.	0.375
2011	Mesgarani N, Thomas S, Hermansky H. Toward optimizing stream fusion in multistream recognition of speech. The Journal of the Acoustical Society of America. 130: EL14-8. PMID 21786862 DOI: 10.1121/1.3595744	0.743
2011	Hermansky H. Dealing with unknown unknowns in speech The Journal of the Acoustical Society of America. 130: 2408-2408. DOI: 10.1121/1.3654655	0.428
2011	Pinto J, Garimella S, Magimai-Doss M, Hermansky H, Bourlard H. Analysis of MLP-based hierarchical phoneme posterior probability estimator Ieee Transactions On Audio, Speech and Language Processing. 19: 225-241. DOI: 10.1109/Tasl.2010.2045943	0.395
2011	Thomas S, Nguyen P, Zweig G, Hermansky H. MLP based phoneme detectors for automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5024-5027. DOI: 10.1109/ICASSP.2011.5947485	0.598
2011	Ganapathy S, Rajan P, Hermansky H. Multi-layer perceptron based speech activity detection for speaker verification Ieee Workshop On Applications of Signal Processing to Audio and Acoustics. 321-324. DOI: 10.1109/ASPAA.2011.6082323	0.569
2011	Hermansky H. Speech recognition from spectral dynamics Sadhana - Academy Proceedings in Engineering Sciences. 36: 729-744. DOI: 10.1007/s12046-011-0044-2	0.518
2011	Hermansky H. Dealing with unexpected words in automatic recognition of speech Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6836: 1-15. DOI: 10.1007/978-3-642-23538-2_1	0.372
2011	Mallidi SH, Ganapathy S, Hermansky H. Modulation spectrum analysis for recognition of reverberant speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 189-192.	0.412
2011	Mesgarani N, Thomas S, Hermansky H. Adaptive stream fusion in multistream recognition of speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2329-2332.	0.671
2010	Ganapathy S, Thomas S, Hermansky H. Temporal envelope compensation for robust phoneme recognition using modulation spectrum. The Journal of the Acoustical Society of America. 128: 3769-80. PMID 21218908 DOI: 10.1121/1.3504658	0.754
2010	Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Wide-band audio coding based on frequency-domain linear prediction Eurasip Journal On Audio, Speech, and Music Processing. 2010. DOI: 10.1155/2010/856280	0.656
2010	Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Wide-Band Audio Coding Based on Frequency-Domain Linear Prediction Eurasip Journal On Audio, Speech, and Music Processing. 2010: 1-14. DOI: 10.1155/2010/856280	0.532
2010	Hermansky H. Posterior‐based attributes in machine recognition of speech. The Journal of the Acoustical Society of America. 127: 2041-2041. DOI: 10.1121/1.3385373	0.483
2010	Ganapathy S, Motlicek P, Hermansky H. Autoregressive models of amplitude modulations in audio compression Ieee Transactions On Audio, Speech and Language Processing. 18: 1624-1631. DOI: 10.1109/Tasl.2009.2038813	0.628
2010	Sivaram GSVS, Nemala SK, Mesgarani N, Hermansky H. Data-driven and feedback based spectro-temporal features for speech recognition Ieee Signal Processing Letters. 17: 957-960. DOI: 10.1109/Lsp.2010.2079930	0.691
2010	Liu SC, Mesgarani N, Harris J, Hermansky H. The use of spike-based representations for hardware audition systems Iscas 2010 - 2010 Ieee International Symposium On Circuits and Systems: Nano-Bio Circuit Fabrics and Systems. 505-508. DOI: 10.1109/ISCAS.2010.5537588	0.558
2010	Ganapathy S, Thomas S, Hermansky H. Robust spectro-temporal features based on autoregressive models of Hilbert envelopes Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4286-4289. DOI: 10.1109/ICASSP.2010.5495668	0.681
2010	Sivaram GSVS, Nemala SK, Elhilali M, Tran TD, Hermansky H. Sparse coding for speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4346-4349. DOI: 10.1109/ICASSP.2010.5495649	0.332
2010	Ganapathy S, Thomas S, Hermansky H. Comparison of modulation features for phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5038-5041. DOI: 10.1109/ICASSP.2010.5495057	0.699
2010	Thomas S, Patil K, Ganapathy S, Mesgarani N, Hermansky H. A phoneme recognition framework based on auditory spectro-temporal receptive fields Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 2458-2461.	0.628
2010	Mesgarani N, Thomas S, Hermansky H. A multistream multiresolution framework for phoneme recognition Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 318-321.	0.645
2009	Ganapathy S, Thomas S, Hermansky H. Modulation frequency features for phoneme recognition in noisy speech. The Journal of the Acoustical Society of America. 125: EL8-12. PMID 19173383 DOI: 10.1121/1.3040022	0.762
2009	Hermansky H. Nonlinear mapping for feature extraction in automatic speech recognition The Journal of the Acoustical Society of America. 125: 4109. DOI: 10.1121/1.3155499	0.449
2009	Thomas S, Ganapathy S, Hermansky H. Phoneme recognition using spectral envelope and modulation frequency features Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4453-4456. DOI: 10.1109/ICASSP.2009.4960618	0.695
2009	Ganapathy S, Thomas S, Hermansky H. Temporal envelope subtraction for robust speech recognition using modulation spectrum Proceedings of the 2009 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2009. 164-169. DOI: 10.1109/ASRU.2009.5372922	0.731
2009	Ganapathy S, Thomas S, Motlicek P, Hermansky H. Applications of signal analysis using autoregressive models for amplitude modulation Ieee Workshop On Applications of Signal Processing to Audio and Acoustics. 341-344. DOI: 10.1109/ASPAA.2009.5346495	0.621
2009	Ganapathy S, Motlicek P, Hermansky H. Error resilient speech coding using sub-band hilbert envelopes Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5729: 355-362. DOI: 10.1007/978-3-642-04208-9_49	0.541
2009	Thomas S, Ganapathy S, Hermansky H. Tandem representations of spectral envelope and modulation frequency features for ASR Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2955-2958.	0.305
2009	Mesgarani N, Sivaram GSVS, Nemala SK, Elhilali M, Hermansky H. Discriminant spectrotemporal features for phoneme recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2983-2986.	0.662
2009	Ganapathy S, Thomas S, Hermansky H. Static and dynamic modulation spectrum for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2823-2826.	0.329
2009	Kombrink S, Burget L, Matějka P, Karafiát M, Hermansky H. Posterior-based out of vocabulary word detection in telephone speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 80-83.	0.317
2008	Thomas S, Ganapathy S, Hermansky H. Recognition of reverberant speech using frequency domain linear prediction Ieee Signal Processing Letters. 15: 681-684. DOI: 10.1109/Lsp.2008.2002708	0.76
2008	Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Temporal masking for bit-rate reduction in audio codec based on Frequency Domain Linear Prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4781-4784. DOI: 10.1109/ICASSP.2008.4518726	0.538
2008	Krishnan Parthasarathi SH, Motlíček P, Hermansky H. Exploiting contextual information for speech/non-speech detection Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 451-459. DOI: 10.1007/978-3-540-87391-4_58	0.316
2008	Motlíček P, Ganapathy S, Hermansky H, Garudadri H, Athineos M. Perceptually motivated sub-band decomposition for FDLP audio coding Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 435-442. DOI: 10.1007/978-3-540-87391-4_56	0.499
2008	Thomas S, Ganapathy S, Hermansky H. Hilbert envelope based features for far-field speech recognition Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5237: 119-124. DOI: 10.1007/978-3-540-85853-9-11	0.394
2008	Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Frequency domain linear prediction for QMF sub-bands and applications to audio coding Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4892: 248-258. DOI: 10.1007/978-3-540-78155-4_22	0.525
2008	Thomas S, Ganapathy S, Hermansky H. Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain European Signal Processing Conference.	0.394
2008	Ganapathy S, Thomas S, Hermansky H. Front-end for far-field speech recognition based on frequency domain linear prediction Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 984-987.	0.31
2008	Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Spectral noise shaping: Improvements in speech/audio codec based on linear prediction in spectral domain Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 675-678.	0.304
2008	Sivaram GSVS, Hermansky H. Introducing temporal asymmetries in feature extraction for automatic speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 890-893.	0.411
2008	Thomas S, Ganapathy S, Hermansky H. Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1521-1524.	0.458
2007	Prasanna SRM, Hermansky H. MRASTA and PLP in automatic speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 137-140.	0.416
2005	Morgan N, Zhu Q, Stolcke A, Sönmez K, Sivadas S, Shinozaki T, Ostendorf M, Jain P, Hermansky H, Ellis D, Doddington G, Chen B, Çetin O, Bourlard H, Athineos M. Pushing the envelope - Aside Ieee Signal Processing Magazine. 22: 81-88. DOI: 10.1109/Msp.2005.1511826	0.505
2004	Ikbal S, Misra H, Bourlard H, Hermansky H. Phase AutoCorrelation (PAC) features in entropy based multi-stream for robust speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I205-I208.	0.377
2003	Hermansky H. Recognition of information‐bearing elements in speech The Journal of the Acoustical Society of America. 114: 2424-2424. DOI: 10.1121/1.4778809	0.496
2003	Hermansky H. TRAP-TANDEM: Data-driven extraction of temporal features from speech 2003 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2003. 255-260. DOI: 10.1109/ASRU.2003.1318450	0.343
2003	Malayath N, Hermansky H. Data-driven spectral basis functions for automatic speech recognition Speech Communication. 40: 449-466. DOI: 10.1016/S0167-6393(02)00127-9	0.515
2000	Hermansky H. Method and system for generating an estimated clean speech signal from a noisy speech signal The Journal of the Acoustical Society of America. 107: 1816. DOI: 10.1121/1.428550	0.415
2000	Yang HH, Van Vuuren S, Sharma S, Hermansky H. Relevance of time-frequency features for phonetic and speaker-channel classification Speech Communication. 31: 35-50. DOI: 10.1016/S0167-6393(00)00007-8	0.406
2000	Malayath N, Hermansky H, Kajarekar S, Yegnanarayana B. Data-driven temporal filters and alternatives to GMM in speaker verification Digital Signal Processing: a Review Journal. 10: 55-74. DOI: 10.1006/dspr.1999.0363	0.363
2000	Kajarekar SS, Hermansky H. Analysis of information in speech and its application in speech recognition Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1902: 283-288.	0.357
1999	Arai T, Pavel M, Hermansky H, Avendano C. Syllable intelligibility for temporally filtered LPC cepstral trajectories. The Journal of the Acoustical Society of America. 105: 2783-91. PMID 10335630 DOI: 10.1121/1.426895	0.359
1999	Hermansky H. Data‐driven speech analysis for ASR The Journal of the Acoustical Society of America. 105: 1352-1352. DOI: 10.1121/1.426410	0.505
1999	Sharma S, Hermansky H. Recognition of speech from temporal patterns The Journal of the Acoustical Society of America. 105: 1158-1158. DOI: 10.1121/1.425505	0.499
1999	Kanedera N, Arai T, Hermansky H, Pavel M. On the relative importance of various components of the modulation spectrum for automatic speech recognition Speech Communication. 28: 43-55. DOI: 10.1016/S0167-6393(99)00002-3	0.499
1999	Yegnanarayana B, Avendano C, Hermansky H, Satyanarayana Murthy P. Speech enhancement using linear prediction residual Speech Communication. 28: 25-42. DOI: 10.1016/S0167-6393(98)00070-3	0.453
1998	Kanedera N, Hermansky H, Arai T. On properties of modulation spectrum for robust automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 2: 613-616. DOI: 10.1109/ICASSP.1998.675339	0.408
1998	Hermansky H. Should recognizers have ears? Speech Communication. 25: 3-27. DOI: 10.1016/S0167-6393(98)00027-2	0.466
1997	Hermansky H, Morgan NH. Noise resistant auditory model for parameterization of speech The Journal of the Acoustical Society of America. 101: 2426. DOI: 10.1121/1.418514	0.41
1997	Avendano C, Hermansky H. On the effects of short-term spectrum smoothing in channel normalization Ieee Transactions On Speech and Audio Processing. 5: 372-374. DOI: 10.1109/89.593318	0.301
1996	Hermansky H. Beyond a ‘‘short‐term’’ analysis of speech The Journal of the Acoustical Society of America. 100: 2792-2792. DOI: 10.1121/1.416495	0.475
1996	Arai T, Pavel M, Hermansky H, Avendano C. Intelligibility of speech with filtered time trajectories of LPC cepstrum The Journal of the Acoustical Society of America. 100: 2756-2756. DOI: 10.1121/1.416322	0.458
1996	Bourlard H, Hermansky H, Morgan N. Towards increasing speech recognition error rates Speech Communication. 18: 205-231. DOI: 10.1016/0167-6393(96)00003-9	0.438
1995	Cole R, Hermansky H, Novick DG, Oviatt S, Hirschman L, Atlas L, Beckman M, Biermann A, Bush M, Clements M, Cohen J, Garcia O, Hanson B, Levinson S, McKeown K, et al. The Challenge of Spoken Language Systems: Research Directions for the Nineties Ieee Transactions On Speech and Audio Processing. 3: 1-21. DOI: 10.1109/89.365385	0.384
1995	Morgan N, Bourlard H, Greenberg S, Hermansky H, Wu SL. Stochastic perceptual models of speech Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 397-400.	0.312
1994	Pavel M, Hermansky H. Temporal masking in automatic speech recognition The Journal of the Acoustical Society of America. 95: 2876-2876. DOI: 10.1121/1.409409	0.527
1994	Hermansky H, Morgan N. RASTA Processing of Speech Ieee Transactions On Speech and Audio Processing. 2: 578-589. DOI: 10.1109/89.326616	0.49
1993	Junqua JC, Wakita H, Hermansky H. Evaluation and Optimization of Perceptually-Based ASR Front-End Ieee Transactions On Speech and Audio Processing. 1: 39-48. DOI: 10.1109/89.221366	0.459
1993	Hermansky H, Morgan N, Hirsch HG. Recognition of speech in additive and convolutional noise based on RASTA spectral processing Proceedings - Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing. 2: II-83-II-86.	0.334
1991	Morgan N, Hermansky H, Bourlard H, Kohn P, Wooters C. Continuous speech recognition using PLP analysis with multilayer perceptrons Proceedings - Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing. 1: 49-52.	0.392
1990	Hermansky H. Perceptual linear predictive (PLP) analysis of speech. The Journal of the Acoustical Society of America. 87: 1738-52. PMID 2341679 DOI: 10.1121/1.399423	0.458
1990	Hermansky H, Cox TL. Synthesis of speech from the low‐dimensional PLP representation The Journal of the Acoustical Society of America. 88: S179-S180. DOI: 10.1121/1.2028800	0.425
1988	Terry M, Hermansky H. Comparison of standard ASR front ends and auditory models in neural net‐based automatic speech recognition The Journal of the Acoustical Society of America. 83: S53-S53. DOI: 10.1121/1.2025401	0.441
1987	Hermansky H. Should ASR front‐end be insensitive to fundamental frequency? (perceptual shift of formant position due to fine harmonic structure of voiced speech The Journal of the Acoustical Society of America. 82: S36-S36. DOI: 10.1121/1.2024778	0.384
1987	Hermansky H. Why is the formant frequency difference limen asymmetric? The Journal of the Acoustical Society of America. 81: S18-S18. DOI: 10.1121/1.2024129	0.355
1986	Hermansky H, Javkin HR. Evaluation of ASR front ends using synthetic vowel‐like sounds The Journal of the Acoustical Society of America. 80: S18-S18. DOI: 10.1121/1.2023687	0.316
1986	Tsuga K, Hermansky H. Effect of the spectral model order in automatic speech recognition The Journal of the Acoustical Society of America. 80: S18-S18. DOI: 10.1121/1.2023684	0.352
1985	Hanson BA, Hermansky H, Wakita H. Root‐power sums and spectral slope distortion measures for all‐pole models of speech The Journal of the Acoustical Society of America. 78: S49-S49. DOI: 10.1121/1.2022847	0.407
1985	Hermansky H, Hanson BA, Wakita H. Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain Speech Communication. 4: 181-187. DOI: 10.1016/0167-6393(85)90045-7	0.451
1984	Hermansky H, Hanson BA, Wakita H. Critical‐band‐weighted linear prediction of speech The Journal of the Acoustical Society of America. 76: S1-S1. DOI: 10.1121/1.2021743	0.406
Low-probability matches (unlikely to be authored by this person)
2004	Misra H, Ikbal S, Bourlard H, Hermansky H. Spectral entropy based feature for robust ASR Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I193-I196.	0.298
2003	Ikbal S, Hermansky H, Bourlard H. Nonlinear spectral transformations for robust speech recognition 2003 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2003. 393-398. DOI: 10.1109/ASRU.2003.1318473	0.297
2008	Pinto J, Hermansky H. Combining evidence from a generative and a discriminative model in phoneme recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2414-2417.	0.293
2012	Weinshall D, Zweig A, Hermansky H, Kombrink S, Ohl FW, AnemÃ¼ller J, Bach JH, Van Gool L, Nater F, Pajdla T, Havlena M, Pavel M. Beyond novelty detection: incongruent events, when general and specific classifiers disagree. Ieee Transactions On Pattern Analysis and Machine Intelligence. 34: 1886-901. PMID 22213766 DOI: 10.1109/Tpami.2011.279	0.288
2011	Sivaram GSVS, Hermansky H. Multilayer perceptron with sparse hidden outputs for phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5336-5339. DOI: 10.1109/ICASSP.2011.5947563	0.288
1991	Morgan N, Wooters C, Hermansky H. Experiments with temporal resolution for continuous speech recognition with multi-layer perceptrons Neural Networks For Signal Processing. 405-410.	0.288
2013	Clark P, Mallidi SH, Jansen A, Hermansky H. Frequency offset correction in speech without detecting pitch Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7020-7024. DOI: 10.1109/ICASSP.2013.6639023	0.287
2013	Hermansky H. Long, deep and wide artificial neural nets for dealing with unexpected noise in machine recognition of speech Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8082: 14-21. DOI: 10.1007/978-3-642-40585-3_2	0.284
2007	Motlicek P, Hermansky H, Ganapathy S, Garudadri H. Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4629: 350-357.	0.283
2014	Kintzley K, Jansen A, Hermansky H. Featherweight phonetic keyword search for conversational speech Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7859-7863. DOI: 10.1109/ICASSP.2014.6855130	0.283
2011	Carlin MA, Thomas S, Jansen A, Hermansky H. Rapid evaluation of speech representations for spoken term discovery Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 821-824.	0.283
2013	Schatz T, Peddinti V, Bach F, Jansen A, Hermansky H, Dupoux E. Evaluating speech features with the minimal-pair ABX task: Analysis of the classical MFC/PLP pipeline Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1781-1785.	0.273
2008	Sivaram GSVS, Hermansky H. Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition European Signal Processing Conference.	0.27
2009	Pavel M, Slaney M, Hermansky H. Reconciliation of human and machine speech recognition performance Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1669-1672. DOI: 10.1109/ICASSP.2009.4959922	0.268
2013	Hermansky H, Cohen JR, Stern RM. Perceptual properties of current speech recognition technology Proceedings of the Ieee. 101: 1968-1985. DOI: 10.1109/JPROC.2013.2252316	0.267
2003	Kajarekar SS, Hermansky H. Analysis of information in speech based on MANOVA Advances in Neural Information Processing Systems.	0.263
2006	Motlíek P, Hermansky H, Garudadri H, Srinivasamurthy N. Speech coding based on spectral dynamics Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4188: 471-478.	0.262
2013	Ma J, Zhang B, Matsoukas S, Mallidi SH, Li F, Hermansky H. Improvements in language identification on the RATS noisy speech corpus Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 69-73.	0.255
2006	Fousek P, Hermansky H. Towards asr based on hierarchical posterior-based keyword recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I433-I436.	0.252
2007	Valente F, Vepa J, Hermansky H. Multi-stream features combination based on Dempster-Shafer rule for LVCSR system Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 273-276.	0.251
2012	Ganapathy S, Hermansky H. Robust phoneme recognition using high resolution temporal envelopes 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1826-1829.	0.249
2014	Schatz T, Peddinti V, Cao XN, Bach F, Hermansky H, Dupoux E. Evaluating speech features with the Minimal-Pair ABX task (II): Resistance to noise Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 915-919.	0.246
2012	Hirsch HG, Ganapathy S, Hermansky H. Comparison of different approaches for speech recognition in hands-free mode Proceedings of 10th Itg Symposium On Speech Communication.	0.243
2013	Hermansky H, Variani E, Peddinti V. Mean temporal distance: Predicting ASR error from temporal properties of speech signal Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7423-7426. DOI: 10.1109/ICASSP.2013.6639105	0.239
2009	Motlicek P, Ganapathy S, Hermansky H. Arithmetic coding of sub-band residuals in FDLP speech/audio Codec Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2591-2594.	0.237
2010	Sivaram GSVS, Ganapathy S, Hermansky H. Sparse auto-associative neural networks: Theory and application to speech recognition Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 2270-2273.	0.231
2003	Matějka P, Schwarz P, Hermansky H, Černocky J. Phoneme recognition using temporal patterns Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science). 2807: 198-205.	0.231
2002	Sivadas S, Hermansky H. Hierarchical tandem feature extraction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I/809-I/812.	0.224
1998	Yegnanarayana B, Avendano C, Murthy PS, Hermansky H. Enhancement of reverberant speech using LP residual Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 405-408.	0.222
1989	Broad DJ, Hermansky H. The front‐cavity/F2′ hypothesis tested by data on tongue movements The Journal of the Acoustical Society of America. 86: S113-S114. DOI: 10.1121/1.2027307	0.218
2003	Sivadas S, Hermansky H. Generalized Tandem feature extraction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 56-59.	0.216
2010	Delbruck T, Koch T, Berner R, Hermansky H. Fully integrated 500uW speech detection wake-up circuit Iscas 2010 - 2010 Ieee International Symposium On Circuits and Systems: Nano-Bio Circuit Fabrics and Systems. 2015-2018. DOI: 10.1109/ISCAS.2010.5537160	0.214
2008	White C, Zweig G, Burget L, Schwarz P, Hermansky H. Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4085-4088. DOI: 10.1109/ICASSP.2008.4518552	0.213
2000	Yang HH, Hermansky H. Search for information bearing components in speech Advances in Neural Information Processing Systems. 803-809.	0.212
2007	Ketabdar H, Hannemann M, Hermansky H. Detection of out-of-vocabulary words in posterior based ASR International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007. 4: 2772-2775.	0.208
2005	Hermansky H, Fousek P, Lehtonen M. The role of speech in multimodal human-computer interaction (towards reliable rejection of non-keyword input) Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3658: 2-8.	0.208
2023	Luo S, Angrick M, Coogan C, Candrea DN, Wyse-Sookoo K, Shah S, Rabbani Q, Milsap GW, Weiss AR, Anderson WS, Tippett DC, Maragakis NJ, Clawson LL, Vansteensel MJ, Wester BA, ... ... Hermansky H, et al. Stable Decoding from a Speech BCI Enables Control for an Individual with ALS without Recalibration for 3 Months. Advanced Science (Weinheim, Baden-Wurttemberg, Germany). e2304853. PMID 37875404 DOI: 10.1002/advs.202304853	0.206
2016	Mallidi SH, Ogawa T, Hermansky H. Uncertainty estimation of DNN classifiers 2015 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2015 - Proceedings. 283-288. DOI: 10.1109/ASRU.2015.7404806	0.203
2008	Pinto J, Yegnanarayana B, Hermansky H, Magimai -M. Exploiting contextual information for improved phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4449-4452. DOI: 10.1109/ICASSP.2008.4518643	0.203
2008	Pinto J, Sivaram GSVS, Hermansky H. Reverse correlation for analyzing MLP posterior features in ASR Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 469-476. DOI: 10.1007/978-3-540-87391-4_60	0.201
2011	Zweig G, Nguyen P, Van Compernolle D, Demuynck K, Atlas L, Clark P, Sell G, Wang M, Sha F, Hermansky H, Karakos D, Jansen A, Thomas S, S GSVS, Bowman S, et al. Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5044-5047. DOI: 10.1109/ICASSP.2011.5947490	0.198
2007	Motlicek P, Ullal V, Hermansky H. Wide-band perceptual audio coding based on frequency-domain linear prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I265-I268. DOI: 10.1109/ICASSP.2007.366667	0.198
2007	Valente F, Vepa J, Plahl C, Gollan C, Hermansky H, Schlüter R. Hierarchical Neural Networks feature extraction for LVCSR system Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 265-268.	0.195
2004	Sivadas S, Hermansky H. On use of task independent training data in tandem feature extraction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I541-I544.	0.195
2012	Kintzley K, Jansen A, Hermansky H. MAP estimation of whole-word acoustic models with dictionary priors 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 786-789.	0.191
2008	Valente F, Hermansky H. On the combination of auditory and modulation frequency channels for ASR applications Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2242-2245.	0.186
2008	Tošić T, Magimai-Doss M, Hermansky H. Using comparison of parallel phoneme probability streams for OOV word detection European Signal Processing Conference.	0.181
2013	Li F, Hermansky H. Effect of filter bandwidth and spectral sampling rate of analysis filterbank on automatic phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7121-7124. DOI: 10.1109/ICASSP.2013.6639044	0.177
2012	Variani E, Hermansky H. Estimating classifier performance in unknown noise 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 2: 1798-1801.	0.173
2012	Li F, Mallidi SH, Hermansky H. Phone recognition in critical bands using sub-band temporal modulations 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1814-1817.	0.169
2009	Ganapathy S, Motlicek P, Hermansky H. MDCT for encoding residual signals in frequency domain linear prediction 127th Audio Engineering Society Convention 2009. 2: 1103-1110.	0.168
2010	Thomas S, Ganapathy S, Hermansky H. Cross-lingual and multi-stream posterior features for low resource LVCSR systems Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 877-880.	0.168
2007	Pinto J, Lovitt A, Hermansky H. Exploiting phoneme similarities in hybrid HMM-ANN keyword spotting International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007. 4: 2388-2391.	0.161
2007	Valente F, Hermansky H. Combination of acoustic classifiers based on dempster-shafer theory of evidence Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4: IV1129-IV1132. DOI: 10.1109/ICASSP.2007.367273	0.158
2006	Valente F, Hermansky H. Discriminant linear processing of time-frequency plane Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 349-352.	0.151
2023	Angrick M, Luo S, Rabbani Q, Candrea DN, Shah S, Milsap GW, Anderson WS, Gordon CR, Rosenblatt KR, Clawson L, Maragakis N, Tenore FV, Fifer MS, Hermansky H, Ramsey NF, et al. Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS. Medrxiv : the Preprint Server For Health Sciences. PMID 37425721 DOI: 10.1101/2023.06.30.23291352	0.144
2008	Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Autoregressive modelling of hilbert envelopes for wide-band audio coding Audio Engineering Society - 124th Audio Engineering Society Convention 2008. 3: 1620-1627.	0.143
2010	Hermansky H. History of modulation spectrum in ASR Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5458-5461. DOI: 10.1109/ICASSP.2010.5494907	0.138
2002	Adami AG, Kajarekar SS, Hermansky H. A new speaker change detection method for two-speaker segmentation Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4: IV/3908-IV/3911.	0.135
2008	Sivaram GSVS, Hermansky H. Emulating temporal receptive fields of higher level auditory neurons for ASR Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 509-516. DOI: 10.1007/978-3-540-87391-4_65	0.13
2013	Peddinti V, Hermansky H. Filter-bank optimization for Frequency Domain Linear Prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7102-7106. DOI: 10.1109/ICASSP.2013.6639040	0.128
2013	Ogawa T, Li F, Hermansky H. Stream selection and integration in multistream ASR using GMM-based performance monitoring Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 3332-3336.	0.114
2011	Sivaram GSVS, Thomas S, Hermansky H. Mixture of auto-associative neural networks for speaker verification Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2381-2384.	0.11
2008	Valente F, Hermansky H. Hierarchical and parallel processing of modulation spectrum for ASR applications Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4165-4168. DOI: 10.1109/ICASSP.2008.4518572	0.109
2001	Kajarekar SS, Yegnanarayana B, Hermansky H. A study of two dimensional linear discriminants for ASR Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 137-140.	0.105
2009	Pinto J, Sivaram GSVS, Hermansky H, Magimai-Doss M. Volterra series for analyzing MLP based phoneme posterior estimator Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1813-1816. DOI: 10.1109/ICASSP.2009.4959958	0.099
2005	Hermansky H, Fousek P. Multi-resolution RASTA filtering for TANDEM-based ASR 9th European Conference On Speech Communication and Technology. 361-364.	0.087
2012	Kintzley K, Jansen A, Church K, Hermansky H. Inverting the point process model for fast phonetic keyword search 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 2437-2440.	0.084
2005	Verhelst W, Herre J, Kubin G, Hermansky H, Jensen SH. Eurasip Journal on Applied Signal Processing: Editorial Eurasip Journal On Applied Signal Processing. 2005: 1289-1291. DOI: 10.1155/ASP.2005.1289	0.084
2009	Stricker C, Wagen JF, Aradilla G, Bourlard H, Hermansky H, Pinto J, Rey PH, Théraulaz J. Intelligent multi-modal interfaces for mobile applications in hostile environment(IM-HOST) Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5440: 71-102. DOI: 10.1007/978-3-642-00437-7_4	0.083
2008	Burget L, Schwarz P, Matějka P, Hannemann M, Rastrow A, White C, Khudanpur S, Hermansky H, Černocký J. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4081-4084. DOI: 10.1109/ICASSP.2008.4518551	0.078
2011	Kintzley K, Jansen A, Hermansky H. Event selection from phone posteriorgrams using matched filters Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1905-1908.	0.068
2012	Anemüller J, Caputo B, Hermansky H, Ohl FW, Pajdla T, Pavel M, Van Gool L, Vogels R, Wabnik S, Weinshall D. DIRAC: Detection and identification of rare audio-visual events Studies in Computational Intelligence. 384: 3-35. DOI: 10.1007/978-3-642-24034-8_1	0.043
2008	Anemüller J, Bach JH, Caputo B, Havlena M, Jie L, Kayser H, Leibe B, Motlicek P, Pajdla T, Pavel M, Torii A, Gool LV, Zweig A, Hermansky H. The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events Icmi'08: Proceedings of the 10th International Conference On Multimodal Interfaces. 289-292. DOI: 10.1145/1452392.1452451	0.038
2010	Jansen A, Church K, Hermansky H. Towards spoken term discovery at scale with zero resources Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 1676-1679.	0.035
2015	Hermansky H, Burget L, Cohen J, Dupoux E, Feldman N, Godfrey J, Khudanpur S, Maciejewski M, Mallidi SH, Menon A, Ogawa T, Peddinti V, Rose R, Stern R, Wiesner M, et al. Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 2015: 5009-5013. DOI: 10.1109/ICASSP.2015.7178924	0.019
2000	Hermansky H. Preface Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1902: V.	0.01
Hide low-probability matches.