Hynek Hermansky - Publications

Affiliations: 
Johns Hopkins University, Baltimore, MD 
Area:
automatic speech recognition

150 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2015 Hermansky H, Burget L, Cohen J, Dupoux E, Feldman N, Godfrey J, Khudanpur S, Maciejewski M, Mallidi SH, Menon A, Ogawa T, Peddinti V, Rose R, Stern R, Wiesner M, et al. Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 2015: 5009-5013. DOI: 10.1109/ICASSP.2015.7178924  1
2014 Ganapathy S, Mallidi SH, Hermansky H. Robust feature extraction using modulation filtering of autoregressive models Ieee Transactions On Audio, Speech and Language Processing. 22: 1285-1295. DOI: 10.1109/taslp.2014.2329190  1
2014 Kintzley K, Jansen A, Hermansky H. Featherweight phonetic keyword search for conversational speech Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7859-7863. DOI: 10.1109/ICASSP.2014.6855130  1
2014 Mahajan N, Mesgarani N, Hermansky H. Principal components of auditory spectro-temporal receptive fields Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1983-1987.  1
2014 Li F, Nidadavolu PS, Hermansky H. A long, deep and wide artificial neural net for robust speech recognition in unknown noise Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 358-362.  1
2014 Schatz T, Peddinti V, Cao XN, Bach F, Hermansky H, Dupoux E. Evaluating speech features with the Minimal-Pair ABX task (II): Resistance to noise Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 915-919.  1
2013 Garimella S, Hermansky H. Factor analysis of auto-associative neural networks with application in speaker verification. Ieee Transactions On Neural Networks and Learning Systems. 24: 522-8. PMID 24808374 DOI: 10.1109/TNNLS.2012.2236652  1
2013 Hermansky H, Cohen JR, Stern RM. Perceptual properties of current speech recognition technology Proceedings of the Ieee. 101: 1968-1985. DOI: 10.1109/JPROC.2013.2252316  1
2013 Hermansky H. Multistream recognition of speech: Dealing with unknown unknowns Proceedings of the Ieee. 101: 1076-1088. DOI: 10.1109/JPROC.2012.2236871  1
2013 Jansen A, Dupoux E, Goldwater S, Johnson M, Khudanpur S, Church K, Feldman N, Hermansky H, Metze F, Rose R, Seltzer M, Clark P, McGraw I, Varadarajan B, Bennett E, et al. A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 8111-8115. DOI: 10.1109/ICASSP.2013.6639245  1
2013 Jansen A, Thomas S, Hermansky H. Weak top-down constraints for unsupervised acoustic model training Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 8091-8095. DOI: 10.1109/ICASSP.2013.6639241  1
2013 Hermansky H, Variani E, Peddinti V. Mean temporal distance: Predicting ASR error from temporal properties of speech signal Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7423-7426. DOI: 10.1109/ICASSP.2013.6639105  1
2013 Li F, Hermansky H. Effect of filter bandwidth and spectral sampling rate of analysis filterbank on automatic phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7121-7124. DOI: 10.1109/ICASSP.2013.6639044  1
2013 Peddinti V, Hermansky H. Filter-bank optimization for Frequency Domain Linear Prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7102-7106. DOI: 10.1109/ICASSP.2013.6639040  1
2013 Clark P, Mallidi SH, Jansen A, Hermansky H. Frequency offset correction in speech without detecting pitch Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7020-7024. DOI: 10.1109/ICASSP.2013.6639023  1
2013 Plchot O, Matsoukas S, Matejka P, Dehak N, Ma J, Cumani S, Glembek O, Hermansky H, Mallidi SH, Mesgarani N, Schwartz R, Soufifar M, Tan ZH, Thomas S, Zhang B, et al. Developing a speaker identification system for the DARPA RATS project Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 6768-6772. DOI: 10.1109/ICASSP.2013.6638972  1
2013 Thomas S, Seltzer ML, Church K, Hermansky H. Deep neural network features and semi-supervised training for low resource speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 6704-6708. DOI: 10.1109/ICASSP.2013.6638959  1
2013 Hermansky H. Long, deep and wide artificial neural nets for dealing with unexpected noise in machine recognition of speech Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8082: 14-21. DOI: 10.1007/978-3-642-40585-3_2  1
2013 Kintzley K, Jansen A, Hermansky H. Text-to-speech inspired duration modeling for improved whole-word acoustic models Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1253-1257.  1
2013 Variani E, Li F, Hermansky H. Multi-stream recognition of noisy speech with performance monitoring Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2978-2981.  1
2013 Ogawa T, Li F, Hermansky H. Stream selection and integration in multistream ASR using GMM-based performance monitoring Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 3332-3336.  1
2013 Schatz T, Peddinti V, Bach F, Jansen A, Hermansky H, Dupoux E. Evaluating speech features with the minimal-pair ABX task: Analysis of the classical MFC/PLP pipeline Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1781-1785.  1
2013 Ma J, Zhang B, Matsoukas S, Mallidi SH, Li F, Hermansky H. Improvements in language identification on the RATS noisy speech corpus Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 69-73.  1
2013 Mallidi SH, Ganapathy S, Hermansky H. Robust speaker recognition using spectro-temporal autoregressive models Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 3689-3693.  1
2012 Ganapathy S, Hermansky H. Temporal resolution analysis in frequency domain linear prediction. The Journal of the Acoustical Society of America. 132: EL436-42. PMID 23145707 DOI: 10.1121/1.4758826  1
2012 Weinshall D, Zweig A, Hermansky H, Kombrink S, Ohl FW, Anemüller J, Bach JH, Van Gool L, Nater F, Pajdla T, Havlena M, Pavel M. Beyond novelty detection: incongruent events, when general and specific classifiers disagree. Ieee Transactions On Pattern Analysis and Machine Intelligence. 34: 1886-901. PMID 22213766 DOI: 10.1109/TPAMI.2011.279  1
2012 Sivaram GSVS, Hermansky H. Sparse multilayer perceptron for phoneme recognition Ieee Transactions On Audio, Speech and Language Processing. 20: 23-29. DOI: 10.1109/TASL.2011.2129510  1
2012 Garimella S, Mallidi SH, Hermansky H. Regularized auto-associative neural networks for speaker verification Ieee Signal Processing Letters. 19: 841-844. DOI: 10.1109/LSP.2012.2221706  1
2012 Thomas S, Ganapathy S, Hermansky H. Multilingual MLP features for low-resource LVCSR systems Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4269-4272. DOI: 10.1109/ICASSP.2012.6288862  1
2012 Garcia-Romero D, Zhou X, Zotkin D, Srinivasan B, Luo Y, Ganapathy S, Thomas S, Nemala S, Sivaram GSVS, Mirbagheri M, Mallidi SH, Janu T, Rajan P, Mesgarani N, Elhilali M, ... Hermansky H, et al. The UMD-JHU 2011 speaker recognition system Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4229-4232. DOI: 10.1109/ICASSP.2012.6288852  1
2012 Ikbal S, Misra H, Hermansky H, Magimai-Doss M. Phase AutoCorrelation (PAC) features for noise robust speech recognition Speech Communication. 54: 867-880. DOI: 10.1016/j.specom.2012.02.005  1
2012 Anemüller J, Caputo B, Hermansky H, Ohl FW, Pajdla T, Pavel M, Van Gool L, Vogels R, Wabnik S, Weinshall D. DIRAC: Detection and identification of rare audio-visual events Studies in Computational Intelligence. 384: 3-35. DOI: 10.1007/978-3-642-24034-8_1  1
2012 Ganapathy S, Hermansky H. Robust phoneme recognition using high resolution temporal envelopes 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1826-1829.  1
2012 Kintzley K, Jansen A, Hermansky H. MAP estimation of whole-word acoustic models with dictionary priors 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 786-789.  1
2012 Thomas S, Mallidi SH, Janu T, Hermansky H, Mesgarani N, Zhou X, Shamma S, Ng T, Zhang B, Nguyen L, Matsoukas S. Acoustic and data-driven features for robust speech activity detection 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1983-1986.  1
2012 Jansen A, Thomas S, Hermansky H. Intrinsic spectral analysis for zero and high resource speech recognition 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 878-881.  1
2012 Kintzley K, Jansen A, Church K, Hermansky H. Inverting the point process model for fast phonetic keyword search 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 2437-2440.  1
2012 Variani E, Hermansky H. Estimating classifier performance in unknown noise 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 2: 1798-1801.  1
2012 Li F, Mallidi SH, Hermansky H. Phone recognition in critical bands using sub-band temporal modulations 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1814-1817.  1
2012 Thomas S, Ganapathy S, Jansen A, Hermansky H. Data-driven posterior features for low resource speech recognition applications 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 790-793.  1
2012 Hirsch HG, Ganapathy S, Hermansky H. Comparison of different approaches for speech recognition in hands-free mode Proceedings of 10th Itg Symposium On Speech Communication 1
2011 Mesgarani N, Thomas S, Hermansky H. Toward optimizing stream fusion in multistream recognition of speech. The Journal of the Acoustical Society of America. 130: EL14-8. PMID 21786862 DOI: 10.1121/1.3595744  1
2011 Pinto J, Garimella S, Magimai-Doss M, Hermansky H, Bourlard H. Analysis of MLP-based hierarchical phoneme posterior probability estimator Ieee Transactions On Audio, Speech and Language Processing. 19: 225-241. DOI: 10.1109/TASL.2010.2045943  1
2011 Sivaram GSVS, Hermansky H. Multilayer perceptron with sparse hidden outputs for phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5336-5339. DOI: 10.1109/ICASSP.2011.5947563  1
2011 Zweig G, Nguyen P, Van Compernolle D, Demuynck K, Atlas L, Clark P, Sell G, Wang M, Sha F, Hermansky H, Karakos D, Jansen A, Thomas S, S GSVS, Bowman S, et al. Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5044-5047. DOI: 10.1109/ICASSP.2011.5947490  1
2011 Thomas S, Nguyen P, Zweig G, Hermansky H. MLP based phoneme detectors for automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5024-5027. DOI: 10.1109/ICASSP.2011.5947485  1
2011 Ganapathy S, Rajan P, Hermansky H. Multi-layer perceptron based speech activity detection for speaker verification Ieee Workshop On Applications of Signal Processing to Audio and Acoustics. 321-324. DOI: 10.1109/ASPAA.2011.6082323  1
2011 Hermansky H. Speech recognition from spectral dynamics Sadhana - Academy Proceedings in Engineering Sciences. 36: 729-744. DOI: 10.1007/s12046-011-0044-2  1
2011 Hermansky H. Dealing with unexpected words in automatic recognition of speech Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6836: 1-15. DOI: 10.1007/978-3-642-23538-2_1  1
2011 Mallidi SH, Ganapathy S, Hermansky H. Modulation spectrum analysis for recognition of reverberant speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 189-192.  1
2011 Sivaram GSVS, Thomas S, Hermansky H. Mixture of auto-associative neural networks for speaker verification Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2381-2384.  1
2011 Carlin MA, Thomas S, Jansen A, Hermansky H. Rapid evaluation of speech representations for spoken term discovery Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 821-824.  1
2011 Mesgarani N, Thomas S, Hermansky H. Adaptive stream fusion in multistream recognition of speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2329-2332.  1
2011 Kintzley K, Jansen A, Hermansky H. Event selection from phone posteriorgrams using matched filters Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1905-1908.  1
2010 Ganapathy S, Thomas S, Hermansky H. Temporal envelope compensation for robust phoneme recognition using modulation spectrum. The Journal of the Acoustical Society of America. 128: 3769-80. PMID 21218908 DOI: 10.1121/1.3504658  1
2010 Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Wide-band audio coding based on frequency-domain linear prediction Eurasip Journal On Audio, Speech, and Music Processing. 2010. DOI: 10.1186/1687-4722-2010-856280  1
2010 Ganapathy S, Motlicek P, Hermansky H. Autoregressive models of amplitude modulations in audio compression Ieee Transactions On Audio, Speech and Language Processing. 18: 1624-1631. DOI: 10.1109/TASL.2009.2038813  1
2010 Sivaram GSVS, Nemala SK, Mesgarani N, Hermansky H. Data-driven and feedback based spectro-temporal features for speech recognition Ieee Signal Processing Letters. 17: 957-960. DOI: 10.1109/LSP.2010.2079930  1
2010 Liu SC, Mesgarani N, Harris J, Hermansky H. The use of spike-based representations for hardware audition systems Iscas 2010 - 2010 Ieee International Symposium On Circuits and Systems: Nano-Bio Circuit Fabrics and Systems. 505-508. DOI: 10.1109/ISCAS.2010.5537588  1
2010 Delbruck T, Koch T, Berner R, Hermansky H. Fully integrated 500uW speech detection wake-up circuit Iscas 2010 - 2010 Ieee International Symposium On Circuits and Systems: Nano-Bio Circuit Fabrics and Systems. 2015-2018. DOI: 10.1109/ISCAS.2010.5537160  1
2010 Ganapathy S, Thomas S, Hermansky H. Robust spectro-temporal features based on autoregressive models of Hilbert envelopes Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4286-4289. DOI: 10.1109/ICASSP.2010.5495668  1
2010 Sivaram GSVS, Nemala SK, Elhilali M, Tran TD, Hermansky H. Sparse coding for speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4346-4349. DOI: 10.1109/ICASSP.2010.5495649  1
2010 Ganapathy S, Thomas S, Hermansky H. Comparison of modulation features for phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5038-5041. DOI: 10.1109/ICASSP.2010.5495057  1
2010 Hermansky H. History of modulation spectrum in ASR Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5458-5461. DOI: 10.1109/ICASSP.2010.5494907  1
2010 Thomas S, Patil K, Ganapathy S, Mesgarani N, Hermansky H. A phoneme recognition framework based on auditory spectro-temporal receptive fields Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 2458-2461.  1
2010 Sivaram GSVS, Ganapathy S, Hermansky H. Sparse auto-associative neural networks: Theory and application to speech recognition Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 2270-2273.  1
2010 Thomas S, Ganapathy S, Hermansky H. Cross-lingual and multi-stream posterior features for low resource LVCSR systems Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 877-880.  1
2010 Mesgarani N, Thomas S, Hermansky H. A multistream multiresolution framework for phoneme recognition Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 318-321.  1
2010 Jansen A, Church K, Hermansky H. Towards spoken term discovery at scale with zero resources Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 1676-1679.  1
2009 Ganapathy S, Thomas S, Hermansky H. Modulation frequency features for phoneme recognition in noisy speech. The Journal of the Acoustical Society of America. 125: EL8-12. PMID 19173383 DOI: 10.1121/1.3040022  1
2009 Thomas S, Ganapathy S, Hermansky H. Phoneme recognition using spectral envelope and modulation frequency features Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4453-4456. DOI: 10.1109/ICASSP.2009.4960618  1
2009 Pinto J, Sivaram GSVS, Hermansky H, Magimai-Doss M. Volterra series for analyzing MLP based phoneme posterior estimator Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1813-1816. DOI: 10.1109/ICASSP.2009.4959958  1
2009 Pavel M, Slaney M, Hermansky H. Reconciliation of human and machine speech recognition performance Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1669-1672. DOI: 10.1109/ICASSP.2009.4959922  1
2009 Ganapathy S, Thomas S, Hermansky H. Temporal envelope subtraction for robust speech recognition using modulation spectrum Proceedings of the 2009 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2009. 164-169. DOI: 10.1109/ASRU.2009.5372922  1
2009 Ganapathy S, Thomas S, Motlicek P, Hermansky H. Applications of signal analysis using autoregressive models for amplitude modulation Ieee Workshop On Applications of Signal Processing to Audio and Acoustics. 341-344. DOI: 10.1109/ASPAA.2009.5346495  1
2009 Ganapathy S, Motlicek P, Hermansky H. Error resilient speech coding using sub-band hilbert envelopes Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5729: 355-362. DOI: 10.1007/978-3-642-04208-9_49  1
2009 Stricker C, Wagen JF, Aradilla G, Bourlard H, Hermansky H, Pinto J, Rey PH, Théraulaz J. Intelligent multi-modal interfaces for mobile applications in hostile environment(IM-HOST) Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5440: 71-102. DOI: 10.1007/978-3-642-00437-7_4  1
2009 Ganapathy S, Motlicek P, Hermansky H. MDCT for encoding residual signals in frequency domain linear prediction 127th Audio Engineering Society Convention 2009. 2: 1103-1110.  1
2009 Thomas S, Ganapathy S, Hermansky H. Tandem representations of spectral envelope and modulation frequency features for ASR Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2955-2958.  1
2009 Mesgarani N, Sivaram GSVS, Nemala SK, Elhilali M, Hermansky H. Discriminant spectrotemporal features for phoneme recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2983-2986.  1
2009 Ganapathy S, Thomas S, Hermansky H. Static and dynamic modulation spectrum for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2823-2826.  1
2009 Motlicek P, Ganapathy S, Hermansky H. Arithmetic coding of sub-band residuals in FDLP speech/audio Codec Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2591-2594.  1
2009 Kombrink S, Burget L, Matějka P, Karafiát M, Hermansky H. Posterior-based out of vocabulary word detection in telephone speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 80-83.  1
2008 Anemüller J, Bach JH, Caputo B, Havlena M, Jie L, Kayser H, Leibe B, Motlicek P, Pajdla T, Pavel M, Torii A, Gool LV, Zweig A, Hermansky H. The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events Icmi'08: Proceedings of the 10th International Conference On Multimodal Interfaces. 289-292. DOI: 10.1145/1452392.1452451  1
2008 Thomas S, Ganapathy S, Hermansky H. Recognition of reverberant speech using frequency domain linear prediction Ieee Signal Processing Letters. 15: 681-684. DOI: 10.1109/LSP.2008.2002708  1
2008 Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Temporal masking for bit-rate reduction in audio codec based on Frequency Domain Linear Prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4781-4784. DOI: 10.1109/ICASSP.2008.4518726  1
2008 Pinto J, Yegnanarayana B, Hermansky H, Magimai -M. Exploiting contextual information for improved phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4449-4452. DOI: 10.1109/ICASSP.2008.4518643  1
2008 Valente F, Hermansky H. Hierarchical and parallel processing of modulation spectrum for ASR applications Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4165-4168. DOI: 10.1109/ICASSP.2008.4518572  1
2008 White C, Zweig G, Burget L, Schwarz P, Hermansky H. Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4085-4088. DOI: 10.1109/ICASSP.2008.4518552  1
2008 Burget L, Schwarz P, Matějka P, Hannemann M, Rastrow A, White C, Khudanpur S, Hermansky H, Černocký J. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4081-4084. DOI: 10.1109/ICASSP.2008.4518551  1
2008 Sivaram GSVS, Hermansky H. Emulating temporal receptive fields of higher level auditory neurons for ASR Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 509-516. DOI: 10.1007/978-3-540-87391-4_65  1
2008 Pinto J, Sivaram GSVS, Hermansky H. Reverse correlation for analyzing MLP posterior features in ASR Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 469-476. DOI: 10.1007/978-3-540-87391-4_60  1
2008 Krishnan Parthasarathi SH, Motlíček P, Hermansky H. Exploiting contextual information for speech/non-speech detection Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 451-459. DOI: 10.1007/978-3-540-87391-4_58  1
2008 Motlíček P, Ganapathy S, Hermansky H, Garudadri H, Athineos M. Perceptually motivated sub-band decomposition for FDLP audio coding Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 435-442. DOI: 10.1007/978-3-540-87391-4_56  1
2008 Thomas S, Ganapathy S, Hermansky H. Hilbert envelope based features for far-field speech recognition Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5237: 119-124. DOI: 10.1007/978-3-540-85853-9-11  1
2008 Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Frequency domain linear prediction for QMF sub-bands and applications to audio coding Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4892: 248-258. DOI: 10.1007/978-3-540-78155-4_22  1
2008 Thomas S, Ganapathy S, Hermansky H. Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain European Signal Processing Conference 1
2008 Ganapathy S, Thomas S, Hermansky H. Front-end for far-field speech recognition based on frequency domain linear prediction Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 984-987.  1
2008 Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Spectral noise shaping: Improvements in speech/audio codec based on linear prediction in spectral domain Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 675-678.  1
2008 Valente F, Hermansky H. On the combination of auditory and modulation frequency channels for ASR applications Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2242-2245.  1
2008 Sivaram GSVS, Hermansky H. Introducing temporal asymmetries in feature extraction for automatic speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 890-893.  1
2008 Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Autoregressive modelling of hilbert envelopes for wide-band audio coding Audio Engineering Society - 124th Audio Engineering Society Convention 2008. 3: 1620-1627.  1
2008 Sivaram GSVS, Hermansky H. Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition European Signal Processing Conference 1
2008 Thomas S, Ganapathy S, Hermansky H. Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1521-1524.  1
2008 Tošić T, Magimai-Doss M, Hermansky H. Using comparison of parallel phoneme probability streams for OOV word detection European Signal Processing Conference 1
2008 Pinto J, Hermansky H. Combining evidence from a generative and a discriminative model in phoneme recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2414-2417.  1
2007 Valente F, Hermansky H. Combination of acoustic classifiers based on dempster-shafer theory of evidence Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4: IV1129-IV1132. DOI: 10.1109/ICASSP.2007.367273  1
2007 Motlicek P, Ullal V, Hermansky H. Wide-band perceptual audio coding based on frequency-domain linear prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I265-I268. DOI: 10.1109/ICASSP.2007.366667  1
2007 Valente F, Vepa J, Plahl C, Gollan C, Hermansky H, Schlüter R. Hierarchical Neural Networks feature extraction for LVCSR system Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 265-268.  1
2007 Prasanna SRM, Hermansky H. MRASTA and PLP in automatic speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 137-140.  1
2007 Ketabdar H, Hannemann M, Hermansky H. Detection of out-of-vocabulary words in posterior based ASR International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007. 4: 2772-2775.  1
2007 Pinto J, Lovitt A, Hermansky H. Exploiting phoneme similarities in hybrid HMM-ANN keyword spotting International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007. 4: 2388-2391.  1
2007 Motlicek P, Hermansky H, Ganapathy S, Garudadri H. Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4629: 350-357.  1
2007 Valente F, Vepa J, Hermansky H. Multi-stream features combination based on Dempster-Shafer rule for LVCSR system Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 273-276.  1
2006 Fousek P, Hermansky H. Towards asr based on hierarchical posterior-based keyword recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I433-I436.  1
2006 Valente F, Hermansky H. Discriminant linear processing of time-frequency plane Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 349-352.  1
2006 Motlíek P, Hermansky H, Garudadri H, Srinivasamurthy N. Speech coding based on spectral dynamics Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4188: 471-478.  1
2005 Verhelst W, Herre J, Kubin G, Hermansky H, Jensen SH. Eurasip Journal on Applied Signal Processing: Editorial Eurasip Journal On Applied Signal Processing. 2005: 1289-1291. DOI: 10.1155/ASP.2005.1289  1
2005 Morgan N, Zhu Q, Stolcke A, Sönmez K, Sivadas S, Shinozaki T, Ostendorf M, Jain P, Hermansky H, Ellis D, Doddington G, Chen B, Çetin O, Bourlard H, Athineos M. Pushing the envelope - Aside Ieee Signal Processing Magazine. 22: 81-88. DOI: 10.1109/MSP.2005.1511826  1
2005 Hermansky H, Fousek P. Multi-resolution RASTA filtering for TANDEM-based ASR 9th European Conference On Speech Communication and Technology. 361-364.  1
2005 Hermansky H, Fousek P, Lehtonen M. The role of speech in multimodal human-computer interaction (towards reliable rejection of non-keyword input) Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3658: 2-8.  1
2004 Misra H, Ikbal S, Bourlard H, Hermansky H. Spectral entropy based feature for robust ASR Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I193-I196.  1
2004 Sivadas S, Hermansky H. On use of task independent training data in tandem feature extraction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I541-I544.  1
2004 Ikbal S, Misra H, Bourlard H, Hermansky H. Phase AutoCorrelation (PAC) features in entropy based multi-stream for robust speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I205-I208.  1
2003 Ikbal S, Hermansky H, Bourlard H. Nonlinear spectral transformations for robust speech recognition 2003 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2003. 393-398. DOI: 10.1109/ASRU.2003.1318473  1
2003 Hermansky H. TRAP-TANDEM: Data-driven extraction of temporal features from speech 2003 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2003. 255-260. DOI: 10.1109/ASRU.2003.1318450  1
2003 Malayath N, Hermansky H. Data-driven spectral basis functions for automatic speech recognition Speech Communication. 40: 449-466. DOI: 10.1016/S0167-6393(02)00127-9  1
2003 Matějka P, Schwarz P, Hermansky H, Černocky J. Phoneme recognition using temporal patterns Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science). 2807: 198-205.  1
2003 Sivadas S, Hermansky H. Generalized Tandem feature extraction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 56-59.  1
2003 Kajarekar SS, Hermansky H. Analysis of information in speech based on MANOVA Advances in Neural Information Processing Systems 1
2002 Sivadas S, Hermansky H. Hierarchical tandem feature extraction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I/809-I/812.  1
2002 Adami AG, Kajarekar SS, Hermansky H. A new speaker change detection method for two-speaker segmentation Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4: IV/3908-IV/3911.  1
2001 Kajarekar SS, Yegnanarayana B, Hermansky H. A study of two dimensional linear discriminants for ASR Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 137-140.  1
2000 Malayath N, Hermansky H, Kajarekar S, Yegnanarayana B. Data-driven temporal filters and alternatives to GMM in speaker verification Digital Signal Processing: a Review Journal. 10: 55-74. DOI: 10.1006/dspr.1999.0363  1
2000 Yang HH, Van Vuuren S, Sharma S, Hermansky H. Relevance of time-frequency features for phonetic and speaker-channel classification Speech Communication. 31: 35-50.  1
2000 Kajarekar SS, Hermansky H. Analysis of information in speech and its application in speech recognition Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1902: 283-288.  1
2000 Yang HH, Hermansky H. Search for information bearing components in speech Advances in Neural Information Processing Systems. 803-809.  1
1999 Arai T, Pavel M, Hermansky H, Avendano C. Syllable intelligibility for temporally filtered LPC cepstral trajectories. The Journal of the Acoustical Society of America. 105: 2783-91. PMID 10335630 DOI: 10.1121/1.426895  1
1999 Kanedera N, Arai T, Hermansky H, Pavel M. On the relative importance of various components of the modulation spectrum for automatic speech recognition Speech Communication. 28: 43-55. DOI: 10.1016/S0167-6393(99)00002-3  1
1999 Yegnanarayana B, Avendano C, Hermansky H, Satyanarayana Murthy P. Speech enhancement using linear prediction residual Speech Communication. 28: 25-42. DOI: 10.1016/S0167-6393(98)00070-3  1
1998 Kanedera N, Hermansky H, Arai T. On properties of modulation spectrum for robust automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 2: 613-616. DOI: 10.1109/ICASSP.1998.675339  1
1998 Hermansky H. Should recognizers have ears? Speech Communication. 25: 3-27.  1
1998 Yegnanarayana B, Avendano C, Murthy PS, Hermansky H. Enhancement of reverberant speech using LP residual Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 405-408.  1
1997 Avendano C, Hermansky H. On the effects of short-term spectrum smoothing in channel normalization Ieee Transactions On Speech and Audio Processing. 5: 372-374. DOI: 10.1109/89.593318  1
1996 Bourlard H, Hermansky H, Morgan N. Towards increasing speech recognition error rates Speech Communication. 18: 205-231. DOI: 10.1016/0167-6393(96)00003-9  1
1995 Cole R, Hermansky H, Novick DG, Oviatt S, Hirschman L, Atlas L, Beckman M, Biermann A, Bush M, Clements M, Cohen J, Garcia O, Hanson B, Levinson S, McKeown K, et al. The Challenge of Spoken Language Systems: Research Directions for the Nineties Ieee Transactions On Speech and Audio Processing. 3: 1-21. DOI: 10.1109/89.365385  1
1994 Hermansky H, Morgan N. RASTA Processing of Speech Ieee Transactions On Speech and Audio Processing. 2: 578-589. DOI: 10.1109/89.326616  1
1991 Morgan N, Hermansky H, Bourlard H, Kohn P, Wooters C. Continuous speech recognition using PLP analysis with multilayer perceptrons Proceedings - Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing. 1: 49-52.  1
1990 Hermansky H. Perceptual linear predictive (PLP) analysis of speech. The Journal of the Acoustical Society of America. 87: 1738-52. PMID 2341679 DOI: 10.1121/1.399423  1
1985 Hermansky H, Hanson BA, Wakita H. Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain Speech Communication. 4: 181-187. DOI: 10.1016/0167-6393(85)90045-7  1
Show low-probability matches.