Mark Hasegawa-Johnson - Publications

Affiliations: 
University of Illinois, Urbana-Champaign, Urbana-Champaign, IL 
Area:
Computer Science, Linguistics Language

82 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2022 Gao H, Ni J, Zhang Y, Qian K, Chang S, Hasegawa-Johnson M. Domain Generalization for Language-Independent Automatic Speech Recognition. Frontiers in Artificial Intelligence. 5: 806274. PMID 35647534 DOI: 10.3389/frai.2022.806274  0.311
2021 Li J, Hasegawa-Johnson M, McElwain NL. Analysis of acoustic and voice quality features for the classification of infant and mother vocalizations. Speech Communication. 133: 41-61. PMID 36062214 DOI: 10.1016/j.specom.2021.07.010  0.321
2020 Wang L, Hasegawa-Johnson M. Multimodal Word Discovery and Retrieval With Spoken Descriptions and Visual Concepts Ieee/Acm Transactions On Audio, Speech, and Language Processing. 28: 1560-1573. DOI: 10.1109/Taslp.2020.2996082  0.45
2020 Scharenborg O, Besacier L, Black A, Hasegawa-Johnson M, Metze F, Neubig G, Stuker S, Godard P, Muller M, Ondel L, Palaskar S, Arthur P, Ciannella F, Du M, Larsen E, et al. Speech Technology for Unwritten Languages Ieee/Acm Transactions On Audio, Speech, and Language Processing. 28: 964-975. DOI: 10.1109/Taslp.2020.2973896  0.488
2018 He D, Lim BP, Yang X, Hasegawa-Johnson M, Chen D. Acoustic landmarks contain more information about the phone string than other frames for automatic speech recognition with deep neural network acoustic model. The Journal of the Acoustical Society of America. 143: 3207. PMID 29960420 DOI: 10.1121/1.5039837  0.705
2017 He D, Lim BPP, Yang X, Hasegawa-Johnson M, Chen D. Selecting frames for automatic speech recognition based on acoustic landmarks Journal of the Acoustical Society of America. 141: 3468-3468. DOI: 10.1121/1.4987204  0.525
2017 Kong X, Yang X, Hasegawa-Johnson M, Choi J, Shattuck-Hufnagel S. Landmark-based consonant voicing detection on multilingual corpora Journal of the Acoustical Society of America. 141: 3468-3468. DOI: 10.1121/1.4987203  0.743
2016 Chen W, Hasegawa-Johnson M, Chen NF. Mismatched Crowdsourcing based Language Perception for Under-resourced Languages Procedia Computer Science. 81: 23-29. DOI: 10.1016/j.procs.2016.04.025  0.328
2016 Livescu K, Rudzicz F, Fosler-Lussier E, Hasegawa-Johnson M, Bilmes J. Speech Production in Speech Technologies: Introduction to the CSL Special Issue Computer Speech and Language. 36: 165-172. DOI: 10.1016/J.Csl.2015.11.002  0.465
2015 Zhang Y, Ou Z, Hasegawa-Johnson M. Incorporating AM-FM effect in voiced speech for probabilistic acoustic tube model 2015 Ieee Workshop On Applications of Signal Processing to Audio and Acoustics, Waspaa 2015. DOI: 10.1109/WASPAA.2015.7336905  0.415
2015 Huang PS, Kim M, Hasegawa-Johnson M, Smaragdis P. Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation Ieee/Acm Transactions On Speech and Language Processing. 23: 2136-2147. DOI: 10.1109/Taslp.2015.2468583  0.403
2015 Chen K, Hasegawa-Johnson M. Improving the robustness of prosody dependent language modeling based on prosody syntax dependence 2003 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2003. 435-440. DOI: 10.1109/ASRU.2003.1318480  0.335
2015 Pietrowicz M, Hasegawa-Johnson M, Karahalios K. Acoustic correlates for perceived effort levels in expressive speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2015: 3720-3724.  0.35
2015 Jyothi P, Hasegawa-Johnson M. Acquiring speech transcriptions using mismatched crowdsourcing Proceedings of the National Conference On Artificial Intelligence. 2: 1263-1269.  0.336
2015 Jyothi P, Hasegawa-Johnson M. Transcribing continuous speech using mismatched crowdsourcing Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2015: 2774-2778.  0.386
2014 Chen A, Hasegawa-Johnson MA. Mixed stereo audio classification using a stereo-input mixed-to-panned level feature Ieee/Acm Transactions On Speech and Language Processing. 22: 2025-2033. DOI: 10.1109/TASLP.2014.2359628  0.435
2014 Zhang Y, Ou Z, Hasegawa-Johnson M. Improvement of Probabilistic Acoustic Tube model for speech decomposition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7929-7933. DOI: 10.1109/ICASSP.2014.6855144  0.331
2014 Khasanova A, Cole J, Hasegawa-Johnson M. Detecting articulatory compensation in acoustic data through linear regression modeling Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 925-929.  0.308
2014 Jyothi P, Cole J, Hasegawa-Johnson M, Puri V. An investigation of prosody in Hindi narrative speech Proceedings of the International Conference On Speech Prosody. 623-627.  0.398
2013 Yoon S, Pierce L, Huensch A, Juul E, Perkins S, Sproat R, Hasegawa-Johnson M. Construction of a Rated Speech Corpus of L2 Learners' Spontaneous Speech Calico Journal. 26: 662-673. DOI: 10.1558/Cj.V26I3.662-673  0.375
2013 Sharma HV, Hasegawa-Johnson M. Acoustic model adaptation using in-domain background models for dysarthric speech recognition Computer Speech and Language. 27: 1147-1162. DOI: 10.1016/J.Csl.2012.10.002  0.751
2012 Nam H, Mitra V, Tiede M, Hasegawa-Johnson M, Espy-Wilson C, Saltzman E, Goldstein L. A procedure for estimating gestural scores from speech acoustics. The Journal of the Acoustical Society of America. 132: 3980-9. PMID 23231127 DOI: 10.1121/1.4763545  0.497
2012 Rong P, Loucks T, Kim H, Hasegawa-Johnson M. Relationship between kinematics, F2 slope and speech intelligibility in dysarthria due to cerebral palsy. Clinical Linguistics & Phonetics. 26: 806-22. PMID 22876770 DOI: 10.3109/02699206.2012.706686  0.328
2012 Tang H, Chu SM, Hasegawa-Johnson M, Huang TS. Partially supervised speaker clustering. Ieee Transactions On Pattern Analysis and Machine Intelligence. 34: 959-71. PMID 21844626 DOI: 10.1109/Tpami.2011.174  0.357
2012 Mertens R, Huang P, Gottlieb L, Friedland G, Divakaran A, Hasegawa-Johnson M. On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks International Journal of Multimedia Data Engineering and Management. 3: 1-19. DOI: 10.4018/Jmdem.2012070101  0.492
2012 Kim H, Hasegawa-Johnson M. Second-formant locus patterns in dysarthric speech The Journal of the Acoustical Society of America. 132: 2089-2089. DOI: 10.1121/1.4755719  0.379
2012 Ozbek IY, Hasegawa-Johnson M, Demirekler M. On Improving Dynamic State Space Approaches to Articulatory Inversion With MAP-Based Parameter Estimation Ieee Transactions On Audio, Speech, and Language Processing. 20: 67-81. DOI: 10.1109/Tasl.2011.2157496  0.353
2012 Cole J, Hasegawa-Johnson M, Loehr D, Guilder LV, Reetz H, Frisch SA. Corpora, Databases, and Internet Resources: Corpus Phonology with Speech Resources Using The Internet For Collecting Phonological Data Speech Manipulation, Synthesis, and Automatic Recognition in Laboratory Phonology Phonotactic Patterns in Lexical Corpora The Oxford Handbook of Laboratory Phonology. DOI: 10.1093/oxfordhb/9780199575039.013.0017  0.366
2012 Mathur S, Poole MS, Peña-Mora F, Hasegawa-Johnson M, Contractor N. Detecting interaction links in a collaborating group using manually annotated data Social Networks. 34: 515-526. DOI: 10.1016/J.Socnet.2012.04.002  0.32
2012 Kim LH, Hasegawa-Johnson M. Optimal multi-microphone speech enhancement in cars Digital Signal Processing For in-Vehicle Systems and Safety. 195-204. DOI: 10.1007/978-1-4419-9607-7_13  0.344
2011 Kim H, Hasegawa-Johnson M, Perlman A. Vowel contrast and speech intelligibility in dysarthria. Folia Phoniatrica Et Logopaedica : Official Organ of the International Association of Logopedics and Phoniatrics (Ialp). 63: 187-94. PMID 20938200 DOI: 10.1159/000318881  0.364
2011 Hasegawa-Johnson MA, Huang J, King S, Zhou X. Normalized recognition of speech and audio events The Journal of the Acoustical Society of America. 130: 2524-2524. DOI: 10.1121/1.3655075  0.311
2011 Kim H, Hasegawa-Johnson M, Perlman A. Temporal and spectral characteristics of fricatives in dysarthria The Journal of the Acoustical Society of America. 130: 2446-2446. DOI: 10.1121/1.3654821  0.455
2011 Hasegawa-Johnson MA, Huang J, Zhuang X. Semi-supervised learning for speech and audio processing The Journal of the Acoustical Society of America. 130: 2408-2408. DOI: 10.1121/1.3654654  0.628
2011 Ozbek İY, Hasegawa-Johnson M, Demirekler M. Estimation of Articulatory Trajectories Based on Gaussian Mixture Model (GMM) With Audio-Visual Information Fusion and Dynamic Kalman Smoothing Ieee Transactions On Audio, Speech, and Language Processing. 19: 1180-1195. DOI: 10.1109/Tasl.2010.2087751  0.374
2011 Lobdell BE, Allen JB, Hasegawa-Johnson MA. Intelligibility predictors and neural representation of speech Speech Communication. 53: 185-194. DOI: 10.1016/J.Specom.2010.08.016  0.737
2011 Hasegawa-Johnson M, Goudeseune C, Cole J, Kaczmarski H, Kim H, King S, Mahrt T, Huang JT, Zhuang X, Lin KH, Sharma HV, Li Z, Huang TS. Multimodal speech and audio user interfaces for K-12 outreach Apsipa Asc 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011. 526-531.  0.648
2011 Mahrt T, Huang JT, Mo Y, Fleck M, Hasegawa-Johnson M, Cole J. Optimal models of prosodic prominence using the Bayesian information criterion Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2037-2040.  0.336
2010 Kim H, Martin K, Hasegawa-Johnson M, Perlman A. Frequency of consonant articulation errors in dysarthric speech. Clinical Linguistics & Phonetics. 24: 759-70. PMID 20831376 DOI: 10.3109/02699206.2010.497238  0.398
2010 Cole J, Mo Y, Hasegawa-Johnson M. Signal-based and expectation-based factors in the perception of prosodic prominence Laboratory Phonology. 1: 425-452. DOI: 10.1515/Labphon.2010.022  0.445
2010 Tang H, Hasegawa-Johnson M, Huang T. A novel vector representation of stochastic signals based on adapted ergodic HMMs Ieee Signal Processing Letters. 17: 715-718. DOI: 10.1109/Lsp.2010.2051945  0.39
2010 Zhuang X, Zhou X, Hasegawa-Johnson MA, Huang TS. Real-world acoustic event detection Pattern Recognition Letters. 31: 1543-1551. DOI: 10.1016/J.Patrec.2010.02.005  0.633
2010 Kim LH, Kim KT, Hasegawa-Johnson M. Robust automatic speech recognition with decoder oriented ideal binary mask estimation Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 2066-2069.  0.324
2009 Huang TS, Hasegawa-Johnson MA, Chu SM, Zeng Z, Tang H. Sensitive Talking Heads Ieee Signal Processing Magazine. 26: 67-72. DOI: 10.1109/Msp.2009.932562  0.307
2009 Huang JT, Zhou X, Hasegawa-Johnson M, Huang T. Kernel metric learning for phonetic classification Proceedings of the 2009 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2009. 141-145. DOI: 10.1109/ASRU.2009.5373389  0.392
2009 Sharma HV, Hasegawa-Johnson M. Universal access: Speech recognition for talkers with spastic dysarthria Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1451-1454.  0.737
2008 Kim LH, Hasegawa-Johnson M, Lim JS, Sung KM. Acoustic model for robustness analysis of optimal multipoint room equalization. The Journal of the Acoustical Society of America. 123: 2043-53. PMID 18397012 DOI: 10.1121/1.2837285  0.535
2008 Tang H, Fu Y, Tu J, Hasegawa-Johnson M, Huang TS. Humanoid audio-visual avatar with emotive text-to-speech synthesis Ieee Transactions On Multimedia. 10: 969-981. DOI: 10.1109/Tmm.2008.2001355  0.335
2008 Kantor A, Hasegawa-Johnson M. Stream weight tuning in dynamic Bayesian networks Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4525-4528. DOI: 10.1109/ICASSP.2008.4518662  0.565
2008 Zhuang X, Zhou X, Huang TS, Hasegawa-Johnson M. Feature analysis and selection for acoustic event detection Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 17-20. DOI: 10.1109/ICASSP.2008.4517535  0.382
2008 Zhou X, Zhuang X, Liu M, Tang H, Hasegawa-Johnson M, Huang T. HMM-based acoustic event detection with adaboost feature selection Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4625: 345-353. DOI: 10.1007/978-3-540-68585-2_33  0.345
2008 Zhuang X, Hasegawa-Johnson M. Towards interpretation of creakiness in switchboard Proceedings of the 4th International Conference On Speech Prosody, Sp 2008. 37-40.  0.328
2008 Kim H, Hasegawa-Johnson M, Perlman A, Gunderson J, Huang T, Watkin K, Frame S. Dysarthric speech database for universal access research Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1741-1744.  0.379
2008 Lobdell BE, Hasegawa-Johnson MA, Allen JB. Human speech perception and feature extraction Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1797-1800.  0.726
2007 Zhu W, Hasegawa-Johnson M, Kantor A, Roth D, Park Y, Yang L. E-coder for Automatic Scoring Physical Activity Diary Data Medicine & Science in Sports & Exercise. 39: S190. DOI: 10.1249/01.Mss.0000273709.05036.8D  0.543
2007 Hasegawa-Johnson M. A multi-stream approach to audiovisual automatic speech recognition 2007 Ieee 9th International Workshop On Multimedia Signal Processing, Mmsp 2007 - Proceedings. 328-331. DOI: 10.1109/MMSP.2007.4412884  0.378
2007 Zhou X, Fu Y, Liu M, Hasegawa-Johnson M, Huang TS. Robust analysis and weighting on MFCC components for speech recognition and speaker identification Proceedings of the 2007 Ieee International Conference On Multimedia and Expo, Icme 2007. 188-191.  0.327
2006 Zhu W, Hasegawa-Johnson M, Roth D, Kantor A, Gao Y, Gandhi MA, Park Y, Yang L. Validation of an E-diary System for Assessing Physical Activities Medicine & Science in Sports & Exercise. 38: S102-S103. DOI: 10.1249/00005768-200605001-01354  0.521
2006 Chen K, Hasegawa-Johnson M, Cohen A, Borys S, Kim SS, Cole J, Choi JY. Prosody dependent speech recognition on radio news corpus of American English Ieee Transactions On Audio, Speech and Language Processing. 14: 232-244. DOI: 10.1109/Tsa.2005.853208  0.611
2006 Zhang T, Hasegawa-Johnson M, Levinson SE. Cognitive state classification in a spoken tutorial dialogue system Speech Communication. 48: 616-632. DOI: 10.1016/J.Specom.2005.09.006  0.462
2006 Zhang T, Hasegawa-Johnson M, Levinson SE. Extraction of pragmatic and semantic salience from spontaneous spoken English Speech Communication. 48: 437-462. DOI: 10.1016/J.Specom.2005.07.007  0.49
2005 Hasegawa-Johnson M, Baker J, Borys S, Chen K, Coogan E, Greenberg S, Juneja A, Kirchhoff K, Livescu K, Mohan S, Muller J, Sonmez K, Wang T. LANDMARK-BASED SPEECH RECOGNITION: REPORT OF THE 2004 JOHNS HOPKINS SUMMER WORKSHOP. Proceedings of the ... Ieee International Conference On Acoustics, Speech, and Signal Processing / Sponsored by the Institute of Electrical and Electronics Engineers Signal Processing Society. Icassp (Conference). 1: 1213-1216. PMID 19212454 DOI: 10.1109/ICASSP.2005.1415088  0.581
2005 Choi JY, Hasegawa-Johnson M, Cole J. Finding intonational boundaries using acoustic cues related to the voice source. The Journal of the Acoustical Society of America. 118: 2579-87. PMID 16266178 DOI: 10.1121/1.2010288  0.342
2005 Hasegawa-Johnson M, Chen K, Cole J, Borys S, Kim SS, Cohen A, Zhang T, Choi JY, Kim H, Yoon T, Chavarria S. Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus Speech Communication. 46: 418-439. DOI: 10.1016/J.Specom.2005.01.009  0.634
2005 Borys S, Hasegawa-Johnson M. Distinctive feature based SVM discriminant features for improvements to phone recognition on telephone band speech 9th European Conference On Speech Communication and Technology. 697-700.  0.349
2004 Omar M, Hasegawa-Johnson M. Model Enforcement: A Unified Feature Transformation Framework for Classification and Recognition Ieee Transactions On Signal Processing. 52: 2701-2710. DOI: 10.1109/Tsp.2004.834344  0.594
2004 Kim SS, Hasegawa-Johnson M, Chen K. Automatic recognition of pitch movements using multilayer perception and time-delay recursive neural network Ieee Signal Processing Letters. 11: 645-648. DOI: 10.1109/Lsp.2004.830114  0.341
2004 Chen K, Hasegawa-Johnson M, Cohen A. An automatic prosody labeling system using ANN-based syntactic-prosodic model and GMM-based acoustic-prosodic model Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1.  0.382
2004 Zheng Y, Hasegawa-Johnson M. Formant tracking by mixture state particle filter Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1.  0.419
2003 Hasegawa-Johnson M, Pizza S, Alwan A, Cha JS, Haker K. Vowel category dependence of the relationship between palate height, tongue height, and oral area. Journal of Speech, Language, and Hearing Research : Jslhr. 46: 738-53. PMID 14697000 DOI: 10.1044/1092-4388(2003/059)  0.53
2003 Zheng Y, Hasegawa-Johnson M, Pizza S. Analysis of the three-dimensional tongue shape using a three-index factor analysis model. The Journal of the Acoustical Society of America. 113: 478-86. PMID 12558285 DOI: 10.1121/1.1520538  0.443
2003 Lee B, Hasegawa-Johnson MA, Goudeseune C. Open‐loop dereverberation of multichannel room impulse responses The Journal of the Acoustical Society of America. 113: 2202-2203. DOI: 10.1121/1.4780198  0.509
2003 Omar MK, Hasegawa-Johnson M. Approximately Independent Factors of Speech Using Nonlinear Symplectic Transformation Ieee Transactions On Speech and Audio Processing. 11: 660-671. DOI: 10.1109/Tsa.2003.814457  0.637
2003 Zheng Y, Hasegawa-Johnson M. Particle filtering approach to Bayesian formant tracking Ieee Workshop On Statistical Signal Processing Proceedings. 2003: 601-604. DOI: 10.1109/SSP.2003.1289549  0.49
2003 Omar MK, Hasegawa-Johnson M. Strong-sense class-dependent features for statistical recognition Ieee Workshop On Statistical Signal Processing Proceedings. 2003: 490-493. DOI: 10.1109/SSP.2003.1289454  0.556
2003 Hasegawa-Johnson M. Bayesian learning for models of human speech perception Ieee Workshop On Statistical Signal Processing Proceedings. 2003: 408-411. DOI: 10.1109/SSP.2003.1289432  0.389
2003 Zheng Y, Hasegawa-Johnson M. Acoustic segmentation using switching state Kalman filter Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 752-755.  0.533
2002 Omar MK, Hasegawa-Johnson M. Maximum mutual information based acoustic-features representation of phonological features for speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1.  0.619
2002 Jing Z, Hasegawa-Johnson M. Auditory-modeling inspired methods of feature extraction for robust automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4.  0.355
2001 Beauchamp JW, Taube H, Tipei S, Wyatt SA, Haken L, Hasegawa-Johnson M. Acoustics, Audio, and Music Technology Education at the University of Illinois at Urbana‐Champaign The Journal of the Acoustical Society of America. 110: 2626-2626. DOI: 10.1121/1.4776867  0.367
2001 Omar MK, Hasegawa-Johnson M, Levinson S. Gaussian mixture models of phonetic boundaries for speech recognition 2001 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2001 - Conference Proceedings. 33-36. DOI: 10.1109/ASRU.2001.1034582  0.652
2001 Gunawan W, Hasegawa-Johnson M. PLP coefficients can be quantized at 400 BPS Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 77-80.  0.368
Show low-probability matches.