Year |
Citation |
Score |
2022 |
Kayser H, Hermansky H, Meyer BT. Spatial speech detection for binaural hearing aids using deep phoneme classifiers. Acta Acustica. European Acoustics Association. 6. PMID 36159631 DOI: 10.1051/aacus/2022013 |
0.351 |
|
2020 |
Li R, Wang X, Mallidi SH, Watanabe S, Hori T, Hermansky H. Multi-Stream End-to-End Speech Recognition Ieee/Acm Transactions On Audio, Speech, and Language Processing. 28: 646-655. DOI: 10.1109/TASLP.2019.2959721 |
0.381 |
|
2019 |
Mahajan NR, Mesgarani N, Hermansky H. General properties of auditory spectro-temporal receptive fields. The Journal of the Acoustical Society of America. 146: EL459. PMID 31893764 DOI: 10.1121/1.5135021 |
0.65 |
|
2019 |
Hermansky H. Coding and decoding of messages in human speech communication: Implications for machine recognition of speech Speech Communication. 106: 112-117. DOI: 10.1016/J.SPECOM.2018.12.004 |
0.503 |
|
2019 |
Castro Martinez AM, Gerlach L, Payá-Vayá G, Hermansky H, Ooster J, Meyer BT. DNN-based performance measures for predicting error rates in automatic speech recognition and optimizing hearing aid parameters Speech Communication. 106: 44-56. DOI: 10.1016/j.specom.2018.11.006 |
0.45 |
|
2016 |
Hsiao R, Ma J, Hartmann W, Karafiát M, Grézl F, Burget L, Szöke I, Černocky JH, Watanabe S, Chen Z, Mallidi SH, Hermansky H, Tsakalidis S, Schwartz R. Robust speech recognition in unknown reverberant and noisy conditions 2015 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2015 - Proceedings. 533-538. DOI: 10.1109/ASRU.2015.7404841 |
0.516 |
|
2014 |
Ganapathy S, Mallidi SH, Hermansky H. Robust feature extraction using modulation filtering of autoregressive models Ieee Transactions On Audio, Speech and Language Processing. 22: 1285-1295. DOI: 10.1109/Taslp.2014.2329190 |
0.675 |
|
2014 |
Mahajan N, Mesgarani N, Hermansky H. Principal components of auditory spectro-temporal receptive fields Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1983-1987. |
0.571 |
|
2014 |
Li F, Nidadavolu PS, Hermansky H. A long, deep and wide artificial neural net for robust speech recognition in unknown noise Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 358-362. |
0.322 |
|
2013 |
Garimella S, Hermansky H. Factor analysis of auto-associative neural networks with application in speaker verification. Ieee Transactions On Neural Networks and Learning Systems. 24: 522-8. PMID 24808374 DOI: 10.1109/Tnnls.2012.2236652 |
0.713 |
|
2013 |
Hermansky H. Multistream recognition of speech: Dealing with unknown unknowns Proceedings of the Ieee. 101: 1076-1088. DOI: 10.1109/JPROC.2012.2236871 |
0.4 |
|
2013 |
Jansen A, Dupoux E, Goldwater S, Johnson M, Khudanpur S, Church K, Feldman N, Hermansky H, Metze F, Rose R, Seltzer M, Clark P, McGraw I, Varadarajan B, Bennett E, et al. A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 8111-8115. DOI: 10.1109/ICASSP.2013.6639245 |
0.502 |
|
2013 |
Jansen A, Thomas S, Hermansky H. Weak top-down constraints for unsupervised acoustic model training Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 8091-8095. DOI: 10.1109/ICASSP.2013.6639241 |
0.456 |
|
2013 |
Plchot O, Matsoukas S, Matejka P, Dehak N, Ma J, Cumani S, Glembek O, Hermansky H, Mallidi SH, Mesgarani N, Schwartz R, Soufifar M, Tan ZH, Thomas S, Zhang B, et al. Developing a speaker identification system for the DARPA RATS project Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 6768-6772. DOI: 10.1109/ICASSP.2013.6638972 |
0.605 |
|
2013 |
Thomas S, Seltzer ML, Church K, Hermansky H. Deep neural network features and semi-supervised training for low resource speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 6704-6708. DOI: 10.1109/ICASSP.2013.6638959 |
0.539 |
|
2013 |
Kintzley K, Jansen A, Hermansky H. Text-to-speech inspired duration modeling for improved whole-word acoustic models Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1253-1257. |
0.328 |
|
2013 |
Variani E, Li F, Hermansky H. Multi-stream recognition of noisy speech with performance monitoring Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2978-2981. |
0.37 |
|
2013 |
Mallidi SH, Ganapathy S, Hermansky H. Robust speaker recognition using spectro-temporal autoregressive models Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 3689-3693. |
0.327 |
|
2012 |
Ganapathy S, Hermansky H. Temporal resolution analysis in frequency domain linear prediction. The Journal of the Acoustical Society of America. 132: EL436-42. PMID 23145707 DOI: 10.1121/1.4758826 |
0.661 |
|
2012 |
Sivaram GSVS, Hermansky H. Sparse multilayer perceptron for phoneme recognition Ieee Transactions On Audio, Speech and Language Processing. 20: 23-29. DOI: 10.1109/TASL.2011.2129510 |
0.412 |
|
2012 |
Garimella S, Mallidi SH, Hermansky H. Regularized auto-associative neural networks for speaker verification Ieee Signal Processing Letters. 19: 841-844. DOI: 10.1109/Lsp.2012.2221706 |
0.725 |
|
2012 |
Thomas S, Ganapathy S, Hermansky H. Multilingual MLP features for low-resource LVCSR systems Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4269-4272. DOI: 10.1109/ICASSP.2012.6288862 |
0.624 |
|
2012 |
Garcia-Romero D, Zhou X, Zotkin D, Srinivasan B, Luo Y, Ganapathy S, Thomas S, Nemala S, Sivaram GSVS, Mirbagheri M, Mallidi SH, Janu T, Rajan P, Mesgarani N, Elhilali M, ... Hermansky H, et al. The UMD-JHU 2011 speaker recognition system Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4229-4232. DOI: 10.1109/ICASSP.2012.6288852 |
0.769 |
|
2012 |
Ikbal S, Misra H, Hermansky H, Magimai-Doss M. Phase AutoCorrelation (PAC) features for noise robust speech recognition Speech Communication. 54: 867-880. DOI: 10.1016/j.specom.2012.02.005 |
0.471 |
|
2012 |
Thomas S, Mallidi SH, Janu T, Hermansky H, Mesgarani N, Zhou X, Shamma S, Ng T, Zhang B, Nguyen L, Matsoukas S. Acoustic and data-driven features for robust speech activity detection 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1983-1986. |
0.736 |
|
2012 |
Jansen A, Thomas S, Hermansky H. Intrinsic spectral analysis for zero and high resource speech recognition 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 878-881. |
0.36 |
|
2012 |
Thomas S, Ganapathy S, Jansen A, Hermansky H. Data-driven posterior features for low resource speech recognition applications 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 790-793. |
0.375 |
|
2011 |
Mesgarani N, Thomas S, Hermansky H. Toward optimizing stream fusion in multistream recognition of speech. The Journal of the Acoustical Society of America. 130: EL14-8. PMID 21786862 DOI: 10.1121/1.3595744 |
0.743 |
|
2011 |
Hermansky H. Dealing with unknown unknowns in speech The Journal of the Acoustical Society of America. 130: 2408-2408. DOI: 10.1121/1.3654655 |
0.428 |
|
2011 |
Pinto J, Garimella S, Magimai-Doss M, Hermansky H, Bourlard H. Analysis of MLP-based hierarchical phoneme posterior probability estimator Ieee Transactions On Audio, Speech and Language Processing. 19: 225-241. DOI: 10.1109/Tasl.2010.2045943 |
0.395 |
|
2011 |
Thomas S, Nguyen P, Zweig G, Hermansky H. MLP based phoneme detectors for automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5024-5027. DOI: 10.1109/ICASSP.2011.5947485 |
0.598 |
|
2011 |
Ganapathy S, Rajan P, Hermansky H. Multi-layer perceptron based speech activity detection for speaker verification Ieee Workshop On Applications of Signal Processing to Audio and Acoustics. 321-324. DOI: 10.1109/ASPAA.2011.6082323 |
0.569 |
|
2011 |
Hermansky H. Speech recognition from spectral dynamics Sadhana - Academy Proceedings in Engineering Sciences. 36: 729-744. DOI: 10.1007/s12046-011-0044-2 |
0.518 |
|
2011 |
Hermansky H. Dealing with unexpected words in automatic recognition of speech Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6836: 1-15. DOI: 10.1007/978-3-642-23538-2_1 |
0.372 |
|
2011 |
Mallidi SH, Ganapathy S, Hermansky H. Modulation spectrum analysis for recognition of reverberant speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 189-192. |
0.412 |
|
2011 |
Mesgarani N, Thomas S, Hermansky H. Adaptive stream fusion in multistream recognition of speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2329-2332. |
0.671 |
|
2010 |
Ganapathy S, Thomas S, Hermansky H. Temporal envelope compensation for robust phoneme recognition using modulation spectrum. The Journal of the Acoustical Society of America. 128: 3769-80. PMID 21218908 DOI: 10.1121/1.3504658 |
0.754 |
|
2010 |
Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Wide-band audio coding based on frequency-domain linear prediction Eurasip Journal On Audio, Speech, and Music Processing. 2010. DOI: 10.1155/2010/856280 |
0.656 |
|
2010 |
Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Wide-Band Audio Coding Based on Frequency-Domain Linear Prediction Eurasip Journal On Audio, Speech, and Music Processing. 2010: 1-14. DOI: 10.1155/2010/856280 |
0.532 |
|
2010 |
Hermansky H. Posterior‐based attributes in machine recognition of speech. The Journal of the Acoustical Society of America. 127: 2041-2041. DOI: 10.1121/1.3385373 |
0.483 |
|
2010 |
Ganapathy S, Motlicek P, Hermansky H. Autoregressive models of amplitude modulations in audio compression Ieee Transactions On Audio, Speech and Language Processing. 18: 1624-1631. DOI: 10.1109/Tasl.2009.2038813 |
0.628 |
|
2010 |
Sivaram GSVS, Nemala SK, Mesgarani N, Hermansky H. Data-driven and feedback based spectro-temporal features for speech recognition Ieee Signal Processing Letters. 17: 957-960. DOI: 10.1109/Lsp.2010.2079930 |
0.691 |
|
2010 |
Liu SC, Mesgarani N, Harris J, Hermansky H. The use of spike-based representations for hardware audition systems Iscas 2010 - 2010 Ieee International Symposium On Circuits and Systems: Nano-Bio Circuit Fabrics and Systems. 505-508. DOI: 10.1109/ISCAS.2010.5537588 |
0.558 |
|
2010 |
Ganapathy S, Thomas S, Hermansky H. Robust spectro-temporal features based on autoregressive models of Hilbert envelopes Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4286-4289. DOI: 10.1109/ICASSP.2010.5495668 |
0.681 |
|
2010 |
Sivaram GSVS, Nemala SK, Elhilali M, Tran TD, Hermansky H. Sparse coding for speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4346-4349. DOI: 10.1109/ICASSP.2010.5495649 |
0.332 |
|
2010 |
Ganapathy S, Thomas S, Hermansky H. Comparison of modulation features for phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5038-5041. DOI: 10.1109/ICASSP.2010.5495057 |
0.699 |
|
2010 |
Thomas S, Patil K, Ganapathy S, Mesgarani N, Hermansky H. A phoneme recognition framework based on auditory spectro-temporal receptive fields Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 2458-2461. |
0.628 |
|
2010 |
Mesgarani N, Thomas S, Hermansky H. A multistream multiresolution framework for phoneme recognition Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 318-321. |
0.645 |
|
2009 |
Ganapathy S, Thomas S, Hermansky H. Modulation frequency features for phoneme recognition in noisy speech. The Journal of the Acoustical Society of America. 125: EL8-12. PMID 19173383 DOI: 10.1121/1.3040022 |
0.762 |
|
2009 |
Hermansky H. Nonlinear mapping for feature extraction in automatic speech recognition The Journal of the Acoustical Society of America. 125: 4109. DOI: 10.1121/1.3155499 |
0.449 |
|
2009 |
Thomas S, Ganapathy S, Hermansky H. Phoneme recognition using spectral envelope and modulation frequency features Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4453-4456. DOI: 10.1109/ICASSP.2009.4960618 |
0.695 |
|
2009 |
Ganapathy S, Thomas S, Hermansky H. Temporal envelope subtraction for robust speech recognition using modulation spectrum Proceedings of the 2009 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2009. 164-169. DOI: 10.1109/ASRU.2009.5372922 |
0.731 |
|
2009 |
Ganapathy S, Thomas S, Motlicek P, Hermansky H. Applications of signal analysis using autoregressive models for amplitude modulation Ieee Workshop On Applications of Signal Processing to Audio and Acoustics. 341-344. DOI: 10.1109/ASPAA.2009.5346495 |
0.621 |
|
2009 |
Ganapathy S, Motlicek P, Hermansky H. Error resilient speech coding using sub-band hilbert envelopes Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5729: 355-362. DOI: 10.1007/978-3-642-04208-9_49 |
0.541 |
|
2009 |
Thomas S, Ganapathy S, Hermansky H. Tandem representations of spectral envelope and modulation frequency features for ASR Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2955-2958. |
0.305 |
|
2009 |
Mesgarani N, Sivaram GSVS, Nemala SK, Elhilali M, Hermansky H. Discriminant spectrotemporal features for phoneme recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2983-2986. |
0.662 |
|
2009 |
Ganapathy S, Thomas S, Hermansky H. Static and dynamic modulation spectrum for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2823-2826. |
0.329 |
|
2009 |
Kombrink S, Burget L, Matějka P, Karafiát M, Hermansky H. Posterior-based out of vocabulary word detection in telephone speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 80-83. |
0.317 |
|
2008 |
Thomas S, Ganapathy S, Hermansky H. Recognition of reverberant speech using frequency domain linear prediction Ieee Signal Processing Letters. 15: 681-684. DOI: 10.1109/Lsp.2008.2002708 |
0.76 |
|
2008 |
Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Temporal masking for bit-rate reduction in audio codec based on Frequency Domain Linear Prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4781-4784. DOI: 10.1109/ICASSP.2008.4518726 |
0.538 |
|
2008 |
Krishnan Parthasarathi SH, Motlíček P, Hermansky H. Exploiting contextual information for speech/non-speech detection Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 451-459. DOI: 10.1007/978-3-540-87391-4_58 |
0.316 |
|
2008 |
Motlíček P, Ganapathy S, Hermansky H, Garudadri H, Athineos M. Perceptually motivated sub-band decomposition for FDLP audio coding Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 435-442. DOI: 10.1007/978-3-540-87391-4_56 |
0.499 |
|
2008 |
Thomas S, Ganapathy S, Hermansky H. Hilbert envelope based features for far-field speech recognition Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5237: 119-124. DOI: 10.1007/978-3-540-85853-9-11 |
0.394 |
|
2008 |
Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Frequency domain linear prediction for QMF sub-bands and applications to audio coding Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4892: 248-258. DOI: 10.1007/978-3-540-78155-4_22 |
0.525 |
|
2008 |
Thomas S, Ganapathy S, Hermansky H. Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain European Signal Processing Conference. |
0.394 |
|
2008 |
Ganapathy S, Thomas S, Hermansky H. Front-end for far-field speech recognition based on frequency domain linear prediction Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 984-987. |
0.31 |
|
2008 |
Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Spectral noise shaping: Improvements in speech/audio codec based on linear prediction in spectral domain Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 675-678. |
0.304 |
|
2008 |
Sivaram GSVS, Hermansky H. Introducing temporal asymmetries in feature extraction for automatic speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 890-893. |
0.411 |
|
2008 |
Thomas S, Ganapathy S, Hermansky H. Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1521-1524. |
0.458 |
|
2007 |
Prasanna SRM, Hermansky H. MRASTA and PLP in automatic speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 137-140. |
0.416 |
|
2005 |
Morgan N, Zhu Q, Stolcke A, Sönmez K, Sivadas S, Shinozaki T, Ostendorf M, Jain P, Hermansky H, Ellis D, Doddington G, Chen B, Çetin O, Bourlard H, Athineos M. Pushing the envelope - Aside Ieee Signal Processing Magazine. 22: 81-88. DOI: 10.1109/Msp.2005.1511826 |
0.505 |
|
2004 |
Ikbal S, Misra H, Bourlard H, Hermansky H. Phase AutoCorrelation (PAC) features in entropy based multi-stream for robust speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I205-I208. |
0.377 |
|
2003 |
Hermansky H. Recognition of information‐bearing elements in speech The Journal of the Acoustical Society of America. 114: 2424-2424. DOI: 10.1121/1.4778809 |
0.496 |
|
2003 |
Hermansky H. TRAP-TANDEM: Data-driven extraction of temporal features from speech 2003 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2003. 255-260. DOI: 10.1109/ASRU.2003.1318450 |
0.343 |
|
2003 |
Malayath N, Hermansky H. Data-driven spectral basis functions for automatic speech recognition Speech Communication. 40: 449-466. DOI: 10.1016/S0167-6393(02)00127-9 |
0.515 |
|
2000 |
Hermansky H. Method and system for generating an estimated clean speech signal from a noisy speech signal The Journal of the Acoustical Society of America. 107: 1816. DOI: 10.1121/1.428550 |
0.415 |
|
2000 |
Yang HH, Van Vuuren S, Sharma S, Hermansky H. Relevance of time-frequency features for phonetic and speaker-channel classification Speech Communication. 31: 35-50. DOI: 10.1016/S0167-6393(00)00007-8 |
0.406 |
|
2000 |
Malayath N, Hermansky H, Kajarekar S, Yegnanarayana B. Data-driven temporal filters and alternatives to GMM in speaker verification Digital Signal Processing: a Review Journal. 10: 55-74. DOI: 10.1006/dspr.1999.0363 |
0.363 |
|
2000 |
Kajarekar SS, Hermansky H. Analysis of information in speech and its application in speech recognition Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1902: 283-288. |
0.357 |
|
1999 |
Arai T, Pavel M, Hermansky H, Avendano C. Syllable intelligibility for temporally filtered LPC cepstral trajectories. The Journal of the Acoustical Society of America. 105: 2783-91. PMID 10335630 DOI: 10.1121/1.426895 |
0.359 |
|
1999 |
Hermansky H. Data‐driven speech analysis for ASR The Journal of the Acoustical Society of America. 105: 1352-1352. DOI: 10.1121/1.426410 |
0.505 |
|
1999 |
Sharma S, Hermansky H. Recognition of speech from temporal patterns The Journal of the Acoustical Society of America. 105: 1158-1158. DOI: 10.1121/1.425505 |
0.499 |
|
1999 |
Kanedera N, Arai T, Hermansky H, Pavel M. On the relative importance of various components of the modulation spectrum for automatic speech recognition Speech Communication. 28: 43-55. DOI: 10.1016/S0167-6393(99)00002-3 |
0.499 |
|
1999 |
Yegnanarayana B, Avendano C, Hermansky H, Satyanarayana Murthy P. Speech enhancement using linear prediction residual Speech Communication. 28: 25-42. DOI: 10.1016/S0167-6393(98)00070-3 |
0.453 |
|
1998 |
Kanedera N, Hermansky H, Arai T. On properties of modulation spectrum for robust automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 2: 613-616. DOI: 10.1109/ICASSP.1998.675339 |
0.408 |
|
1998 |
Hermansky H. Should recognizers have ears? Speech Communication. 25: 3-27. DOI: 10.1016/S0167-6393(98)00027-2 |
0.466 |
|
1997 |
Hermansky H, Morgan NH. Noise resistant auditory model for parameterization of speech The Journal of the Acoustical Society of America. 101: 2426. DOI: 10.1121/1.418514 |
0.41 |
|
1997 |
Avendano C, Hermansky H. On the effects of short-term spectrum smoothing in channel normalization Ieee Transactions On Speech and Audio Processing. 5: 372-374. DOI: 10.1109/89.593318 |
0.301 |
|
1996 |
Hermansky H. Beyond a ‘‘short‐term’’ analysis of speech The Journal of the Acoustical Society of America. 100: 2792-2792. DOI: 10.1121/1.416495 |
0.475 |
|
1996 |
Arai T, Pavel M, Hermansky H, Avendano C. Intelligibility of speech with filtered time trajectories of LPC cepstrum The Journal of the Acoustical Society of America. 100: 2756-2756. DOI: 10.1121/1.416322 |
0.458 |
|
1996 |
Bourlard H, Hermansky H, Morgan N. Towards increasing speech recognition error rates Speech Communication. 18: 205-231. DOI: 10.1016/0167-6393(96)00003-9 |
0.438 |
|
1995 |
Cole R, Hermansky H, Novick DG, Oviatt S, Hirschman L, Atlas L, Beckman M, Biermann A, Bush M, Clements M, Cohen J, Garcia O, Hanson B, Levinson S, McKeown K, et al. The Challenge of Spoken Language Systems: Research Directions for the Nineties Ieee Transactions On Speech and Audio Processing. 3: 1-21. DOI: 10.1109/89.365385 |
0.384 |
|
1995 |
Morgan N, Bourlard H, Greenberg S, Hermansky H, Wu SL. Stochastic perceptual models of speech Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 397-400. |
0.312 |
|
1994 |
Pavel M, Hermansky H. Temporal masking in automatic speech recognition The Journal of the Acoustical Society of America. 95: 2876-2876. DOI: 10.1121/1.409409 |
0.527 |
|
1994 |
Hermansky H, Morgan N. RASTA Processing of Speech Ieee Transactions On Speech and Audio Processing. 2: 578-589. DOI: 10.1109/89.326616 |
0.49 |
|
1993 |
Junqua JC, Wakita H, Hermansky H. Evaluation and Optimization of Perceptually-Based ASR Front-End Ieee Transactions On Speech and Audio Processing. 1: 39-48. DOI: 10.1109/89.221366 |
0.459 |
|
1993 |
Hermansky H, Morgan N, Hirsch HG. Recognition of speech in additive and convolutional noise based on RASTA spectral processing Proceedings - Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing. 2: II-83-II-86. |
0.334 |
|
1991 |
Morgan N, Hermansky H, Bourlard H, Kohn P, Wooters C. Continuous speech recognition using PLP analysis with multilayer perceptrons Proceedings - Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing. 1: 49-52. |
0.392 |
|
1990 |
Hermansky H. Perceptual linear predictive (PLP) analysis of speech. The Journal of the Acoustical Society of America. 87: 1738-52. PMID 2341679 DOI: 10.1121/1.399423 |
0.458 |
|
1990 |
Hermansky H, Cox TL. Synthesis of speech from the low‐dimensional PLP representation The Journal of the Acoustical Society of America. 88: S179-S180. DOI: 10.1121/1.2028800 |
0.425 |
|
1988 |
Terry M, Hermansky H. Comparison of standard ASR front ends and auditory models in neural net‐based automatic speech recognition The Journal of the Acoustical Society of America. 83: S53-S53. DOI: 10.1121/1.2025401 |
0.441 |
|
1987 |
Hermansky H. Should ASR front‐end be insensitive to fundamental frequency? (perceptual shift of formant position due to fine harmonic structure of voiced speech The Journal of the Acoustical Society of America. 82: S36-S36. DOI: 10.1121/1.2024778 |
0.384 |
|
1987 |
Hermansky H. Why is the formant frequency difference limen asymmetric? The Journal of the Acoustical Society of America. 81: S18-S18. DOI: 10.1121/1.2024129 |
0.355 |
|
1986 |
Hermansky H, Javkin HR. Evaluation of ASR front ends using synthetic vowel‐like sounds The Journal of the Acoustical Society of America. 80: S18-S18. DOI: 10.1121/1.2023687 |
0.316 |
|
1986 |
Tsuga K, Hermansky H. Effect of the spectral model order in automatic speech recognition The Journal of the Acoustical Society of America. 80: S18-S18. DOI: 10.1121/1.2023684 |
0.352 |
|
1985 |
Hanson BA, Hermansky H, Wakita H. Root‐power sums and spectral slope distortion measures for all‐pole models of speech The Journal of the Acoustical Society of America. 78: S49-S49. DOI: 10.1121/1.2022847 |
0.407 |
|
1985 |
Hermansky H, Hanson BA, Wakita H. Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain Speech Communication. 4: 181-187. DOI: 10.1016/0167-6393(85)90045-7 |
0.451 |
|
1984 |
Hermansky H, Hanson BA, Wakita H. Critical‐band‐weighted linear prediction of speech The Journal of the Acoustical Society of America. 76: S1-S1. DOI: 10.1121/1.2021743 |
0.406 |
|
Low-probability matches (unlikely to be authored by this person) |
2004 |
Misra H, Ikbal S, Bourlard H, Hermansky H. Spectral entropy based feature for robust ASR Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I193-I196. |
0.298 |
|
2003 |
Ikbal S, Hermansky H, Bourlard H. Nonlinear spectral transformations for robust speech recognition 2003 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2003. 393-398. DOI: 10.1109/ASRU.2003.1318473 |
0.297 |
|
2008 |
Pinto J, Hermansky H. Combining evidence from a generative and a discriminative model in phoneme recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2414-2417. |
0.293 |
|
2012 |
Weinshall D, Zweig A, Hermansky H, Kombrink S, Ohl FW, Anemüller J, Bach JH, Van Gool L, Nater F, Pajdla T, Havlena M, Pavel M. Beyond novelty detection: incongruent events, when general and specific classifiers disagree. Ieee Transactions On Pattern Analysis and Machine Intelligence. 34: 1886-901. PMID 22213766 DOI: 10.1109/Tpami.2011.279 |
0.288 |
|
2011 |
Sivaram GSVS, Hermansky H. Multilayer perceptron with sparse hidden outputs for phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5336-5339. DOI: 10.1109/ICASSP.2011.5947563 |
0.288 |
|
1991 |
Morgan N, Wooters C, Hermansky H. Experiments with temporal resolution for continuous speech recognition with multi-layer perceptrons Neural Networks For Signal Processing. 405-410. |
0.288 |
|
2013 |
Clark P, Mallidi SH, Jansen A, Hermansky H. Frequency offset correction in speech without detecting pitch Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7020-7024. DOI: 10.1109/ICASSP.2013.6639023 |
0.287 |
|
2013 |
Hermansky H. Long, deep and wide artificial neural nets for dealing with unexpected noise in machine recognition of speech Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8082: 14-21. DOI: 10.1007/978-3-642-40585-3_2 |
0.284 |
|
2007 |
Motlicek P, Hermansky H, Ganapathy S, Garudadri H. Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4629: 350-357. |
0.283 |
|
2014 |
Kintzley K, Jansen A, Hermansky H. Featherweight phonetic keyword search for conversational speech Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7859-7863. DOI: 10.1109/ICASSP.2014.6855130 |
0.283 |
|
2011 |
Carlin MA, Thomas S, Jansen A, Hermansky H. Rapid evaluation of speech representations for spoken term discovery Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 821-824. |
0.283 |
|
2013 |
Schatz T, Peddinti V, Bach F, Jansen A, Hermansky H, Dupoux E. Evaluating speech features with the minimal-pair ABX task: Analysis of the classical MFC/PLP pipeline Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1781-1785. |
0.273 |
|
2008 |
Sivaram GSVS, Hermansky H. Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition European Signal Processing Conference. |
0.27 |
|
2009 |
Pavel M, Slaney M, Hermansky H. Reconciliation of human and machine speech recognition performance Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1669-1672. DOI: 10.1109/ICASSP.2009.4959922 |
0.268 |
|
2013 |
Hermansky H, Cohen JR, Stern RM. Perceptual properties of current speech recognition technology Proceedings of the Ieee. 101: 1968-1985. DOI: 10.1109/JPROC.2013.2252316 |
0.267 |
|
2003 |
Kajarekar SS, Hermansky H. Analysis of information in speech based on MANOVA Advances in Neural Information Processing Systems. |
0.263 |
|
2006 |
Motlíek P, Hermansky H, Garudadri H, Srinivasamurthy N. Speech coding based on spectral dynamics Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4188: 471-478. |
0.262 |
|
2013 |
Ma J, Zhang B, Matsoukas S, Mallidi SH, Li F, Hermansky H. Improvements in language identification on the RATS noisy speech corpus Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 69-73. |
0.255 |
|
2006 |
Fousek P, Hermansky H. Towards asr based on hierarchical posterior-based keyword recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I433-I436. |
0.252 |
|
2007 |
Valente F, Vepa J, Hermansky H. Multi-stream features combination based on Dempster-Shafer rule for LVCSR system Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 273-276. |
0.251 |
|
2012 |
Ganapathy S, Hermansky H. Robust phoneme recognition using high resolution temporal envelopes 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1826-1829. |
0.249 |
|
2014 |
Schatz T, Peddinti V, Cao XN, Bach F, Hermansky H, Dupoux E. Evaluating speech features with the Minimal-Pair ABX task (II): Resistance to noise Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 915-919. |
0.246 |
|
2012 |
Hirsch HG, Ganapathy S, Hermansky H. Comparison of different approaches for speech recognition in hands-free mode Proceedings of 10th Itg Symposium On Speech Communication. |
0.243 |
|
2013 |
Hermansky H, Variani E, Peddinti V. Mean temporal distance: Predicting ASR error from temporal properties of speech signal Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7423-7426. DOI: 10.1109/ICASSP.2013.6639105 |
0.239 |
|
2009 |
Motlicek P, Ganapathy S, Hermansky H. Arithmetic coding of sub-band residuals in FDLP speech/audio Codec Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2591-2594. |
0.237 |
|
2010 |
Sivaram GSVS, Ganapathy S, Hermansky H. Sparse auto-associative neural networks: Theory and application to speech recognition Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 2270-2273. |
0.231 |
|
2003 |
Matějka P, Schwarz P, Hermansky H, Černocky J. Phoneme recognition using temporal patterns Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science). 2807: 198-205. |
0.231 |
|
2002 |
Sivadas S, Hermansky H. Hierarchical tandem feature extraction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I/809-I/812. |
0.224 |
|
1998 |
Yegnanarayana B, Avendano C, Murthy PS, Hermansky H. Enhancement of reverberant speech using LP residual Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 405-408. |
0.222 |
|
1989 |
Broad DJ, Hermansky H. The front‐cavity/F2′ hypothesis tested by data on tongue movements The Journal of the Acoustical Society of America. 86: S113-S114. DOI: 10.1121/1.2027307 |
0.218 |
|
2003 |
Sivadas S, Hermansky H. Generalized Tandem feature extraction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 56-59. |
0.216 |
|
2010 |
Delbruck T, Koch T, Berner R, Hermansky H. Fully integrated 500uW speech detection wake-up circuit Iscas 2010 - 2010 Ieee International Symposium On Circuits and Systems: Nano-Bio Circuit Fabrics and Systems. 2015-2018. DOI: 10.1109/ISCAS.2010.5537160 |
0.214 |
|
2008 |
White C, Zweig G, Burget L, Schwarz P, Hermansky H. Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4085-4088. DOI: 10.1109/ICASSP.2008.4518552 |
0.213 |
|
2000 |
Yang HH, Hermansky H. Search for information bearing components in speech Advances in Neural Information Processing Systems. 803-809. |
0.212 |
|
2007 |
Ketabdar H, Hannemann M, Hermansky H. Detection of out-of-vocabulary words in posterior based ASR International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007. 4: 2772-2775. |
0.208 |
|
2005 |
Hermansky H, Fousek P, Lehtonen M. The role of speech in multimodal human-computer interaction (towards reliable rejection of non-keyword input) Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3658: 2-8. |
0.208 |
|
2023 |
Luo S, Angrick M, Coogan C, Candrea DN, Wyse-Sookoo K, Shah S, Rabbani Q, Milsap GW, Weiss AR, Anderson WS, Tippett DC, Maragakis NJ, Clawson LL, Vansteensel MJ, Wester BA, ... ... Hermansky H, et al. Stable Decoding from a Speech BCI Enables Control for an Individual with ALS without Recalibration for 3 Months. Advanced Science (Weinheim, Baden-Wurttemberg, Germany). e2304853. PMID 37875404 DOI: 10.1002/advs.202304853 |
0.206 |
|
2016 |
Mallidi SH, Ogawa T, Hermansky H. Uncertainty estimation of DNN classifiers 2015 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2015 - Proceedings. 283-288. DOI: 10.1109/ASRU.2015.7404806 |
0.203 |
|
2008 |
Pinto J, Yegnanarayana B, Hermansky H, Magimai -M. Exploiting contextual information for improved phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4449-4452. DOI: 10.1109/ICASSP.2008.4518643 |
0.203 |
|
2008 |
Pinto J, Sivaram GSVS, Hermansky H. Reverse correlation for analyzing MLP posterior features in ASR Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 469-476. DOI: 10.1007/978-3-540-87391-4_60 |
0.201 |
|
2011 |
Zweig G, Nguyen P, Van Compernolle D, Demuynck K, Atlas L, Clark P, Sell G, Wang M, Sha F, Hermansky H, Karakos D, Jansen A, Thomas S, S GSVS, Bowman S, et al. Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5044-5047. DOI: 10.1109/ICASSP.2011.5947490 |
0.198 |
|
2007 |
Motlicek P, Ullal V, Hermansky H. Wide-band perceptual audio coding based on frequency-domain linear prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I265-I268. DOI: 10.1109/ICASSP.2007.366667 |
0.198 |
|
2007 |
Valente F, Vepa J, Plahl C, Gollan C, Hermansky H, Schlüter R. Hierarchical Neural Networks feature extraction for LVCSR system Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 265-268. |
0.195 |
|
2004 |
Sivadas S, Hermansky H. On use of task independent training data in tandem feature extraction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I541-I544. |
0.195 |
|
2012 |
Kintzley K, Jansen A, Hermansky H. MAP estimation of whole-word acoustic models with dictionary priors 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 786-789. |
0.191 |
|
2008 |
Valente F, Hermansky H. On the combination of auditory and modulation frequency channels for ASR applications Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2242-2245. |
0.186 |
|
2008 |
Tošić T, Magimai-Doss M, Hermansky H. Using comparison of parallel phoneme probability streams for OOV word detection European Signal Processing Conference. |
0.181 |
|
2013 |
Li F, Hermansky H. Effect of filter bandwidth and spectral sampling rate of analysis filterbank on automatic phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7121-7124. DOI: 10.1109/ICASSP.2013.6639044 |
0.177 |
|
2012 |
Variani E, Hermansky H. Estimating classifier performance in unknown noise 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 2: 1798-1801. |
0.173 |
|
2012 |
Li F, Mallidi SH, Hermansky H. Phone recognition in critical bands using sub-band temporal modulations 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1814-1817. |
0.169 |
|
2009 |
Ganapathy S, Motlicek P, Hermansky H. MDCT for encoding residual signals in frequency domain linear prediction 127th Audio Engineering Society Convention 2009. 2: 1103-1110. |
0.168 |
|
2010 |
Thomas S, Ganapathy S, Hermansky H. Cross-lingual and multi-stream posterior features for low resource LVCSR systems Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 877-880. |
0.168 |
|
2007 |
Pinto J, Lovitt A, Hermansky H. Exploiting phoneme similarities in hybrid HMM-ANN keyword spotting International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007. 4: 2388-2391. |
0.161 |
|
2007 |
Valente F, Hermansky H. Combination of acoustic classifiers based on dempster-shafer theory of evidence Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4: IV1129-IV1132. DOI: 10.1109/ICASSP.2007.367273 |
0.158 |
|
2006 |
Valente F, Hermansky H. Discriminant linear processing of time-frequency plane Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 349-352. |
0.151 |
|
2023 |
Angrick M, Luo S, Rabbani Q, Candrea DN, Shah S, Milsap GW, Anderson WS, Gordon CR, Rosenblatt KR, Clawson L, Maragakis N, Tenore FV, Fifer MS, Hermansky H, Ramsey NF, et al. Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS. Medrxiv : the Preprint Server For Health Sciences. PMID 37425721 DOI: 10.1101/2023.06.30.23291352 |
0.144 |
|
2008 |
Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Autoregressive modelling of hilbert envelopes for wide-band audio coding Audio Engineering Society - 124th Audio Engineering Society Convention 2008. 3: 1620-1627. |
0.143 |
|
2010 |
Hermansky H. History of modulation spectrum in ASR Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5458-5461. DOI: 10.1109/ICASSP.2010.5494907 |
0.138 |
|
2002 |
Adami AG, Kajarekar SS, Hermansky H. A new speaker change detection method for two-speaker segmentation Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4: IV/3908-IV/3911. |
0.135 |
|
2008 |
Sivaram GSVS, Hermansky H. Emulating temporal receptive fields of higher level auditory neurons for ASR Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 509-516. DOI: 10.1007/978-3-540-87391-4_65 |
0.13 |
|
2013 |
Peddinti V, Hermansky H. Filter-bank optimization for Frequency Domain Linear Prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7102-7106. DOI: 10.1109/ICASSP.2013.6639040 |
0.128 |
|
2013 |
Ogawa T, Li F, Hermansky H. Stream selection and integration in multistream ASR using GMM-based performance monitoring Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 3332-3336. |
0.114 |
|
2011 |
Sivaram GSVS, Thomas S, Hermansky H. Mixture of auto-associative neural networks for speaker verification Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2381-2384. |
0.11 |
|
2008 |
Valente F, Hermansky H. Hierarchical and parallel processing of modulation spectrum for ASR applications Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4165-4168. DOI: 10.1109/ICASSP.2008.4518572 |
0.109 |
|
2001 |
Kajarekar SS, Yegnanarayana B, Hermansky H. A study of two dimensional linear discriminants for ASR Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 137-140. |
0.105 |
|
2009 |
Pinto J, Sivaram GSVS, Hermansky H, Magimai-Doss M. Volterra series for analyzing MLP based phoneme posterior estimator Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1813-1816. DOI: 10.1109/ICASSP.2009.4959958 |
0.099 |
|
2005 |
Hermansky H, Fousek P. Multi-resolution RASTA filtering for TANDEM-based ASR 9th European Conference On Speech Communication and Technology. 361-364. |
0.087 |
|
2012 |
Kintzley K, Jansen A, Church K, Hermansky H. Inverting the point process model for fast phonetic keyword search 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 2437-2440. |
0.084 |
|
2005 |
Verhelst W, Herre J, Kubin G, Hermansky H, Jensen SH. Eurasip Journal on Applied Signal Processing: Editorial Eurasip Journal On Applied Signal Processing. 2005: 1289-1291. DOI: 10.1155/ASP.2005.1289 |
0.084 |
|
2009 |
Stricker C, Wagen JF, Aradilla G, Bourlard H, Hermansky H, Pinto J, Rey PH, Théraulaz J. Intelligent multi-modal interfaces for mobile applications in hostile environment(IM-HOST) Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5440: 71-102. DOI: 10.1007/978-3-642-00437-7_4 |
0.083 |
|
2008 |
Burget L, Schwarz P, Matějka P, Hannemann M, Rastrow A, White C, Khudanpur S, Hermansky H, Černocký J. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4081-4084. DOI: 10.1109/ICASSP.2008.4518551 |
0.078 |
|
2011 |
Kintzley K, Jansen A, Hermansky H. Event selection from phone posteriorgrams using matched filters Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1905-1908. |
0.068 |
|
2012 |
Anemüller J, Caputo B, Hermansky H, Ohl FW, Pajdla T, Pavel M, Van Gool L, Vogels R, Wabnik S, Weinshall D. DIRAC: Detection and identification of rare audio-visual events Studies in Computational Intelligence. 384: 3-35. DOI: 10.1007/978-3-642-24034-8_1 |
0.043 |
|
2008 |
Anemüller J, Bach JH, Caputo B, Havlena M, Jie L, Kayser H, Leibe B, Motlicek P, Pajdla T, Pavel M, Torii A, Gool LV, Zweig A, Hermansky H. The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events Icmi'08: Proceedings of the 10th International Conference On Multimodal Interfaces. 289-292. DOI: 10.1145/1452392.1452451 |
0.038 |
|
2010 |
Jansen A, Church K, Hermansky H. Towards spoken term discovery at scale with zero resources Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 1676-1679. |
0.035 |
|
2015 |
Hermansky H, Burget L, Cohen J, Dupoux E, Feldman N, Godfrey J, Khudanpur S, Maciejewski M, Mallidi SH, Menon A, Ogawa T, Peddinti V, Rose R, Stern R, Wiesner M, et al. Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 2015: 5009-5013. DOI: 10.1109/ICASSP.2015.7178924 |
0.019 |
|
2000 |
Hermansky H. Preface Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1902: V. |
0.01 |
|
Hide low-probability matches. |