Professor Naomi Harte
Professor in Speech Technology, Electronic & Elect. Engineering
Biography
Naomi is Professor in Speech Technology in the School of Engineering in Trinity College. She is Co-PI and a founding member of the ADAPT SFI Centre. In ADAPT, she has led a major Research Theme centered on Multimodal Interaction involving researchers from Universities across Ireland and was instrumental in developing the future vision for the Centre for 2021-2026. She is also a lead academic of the hugely successful Sigmedia Research Group in the School of Engineering. She was appointed as an SFI Engineering Initiative Lecturer in Digital Media in TCD in 2008 (Stokes Programme). Prior to returning to academia, Naomi worked in high-tech start-ups in the field of DSP Systems Development, including her own company. She also previously worked in McMaster University in Canada. She was a Visiting Professor at ICSI in 2015, and became a Fellow of TCD in 2017. She earned a Google Faculty Award in 2018 and was shortlisted for the AI Ireland Awards in 2019. She currently serves on the Editorial Board of Computer Speech and Language and was General Chair of INTERSPEECH 2023 in Dublin.
Naomi's research centres around Human Speech Communication. She likes to consider speech as something we both hear and see, with a strong multimodal aspect to her work. Her research involves the design and application of mathematical algorithms to enhance or augment speech communication between humans and technology. Much of that work is underpinned by signal processing and machine learning, but also requires an understanding of how humans interact. Her current research projects include audio-visual speech recognition, speech synthesis evaluation, multimodal speech analysis, and birdsong. Her industrial background brings a real-world approach to her research.
Publications and Further Research Outputs
Peer-Reviewed Publications
Sébastien Le Maguer, Simon King, Naomi Harte, The limits of the Mean Opinion Score for speech synthesis evaluation, Computer Speech and Language, 84, 2024
Kotey, S., Dahyot, R., Harte, N., Fine Grained Spoken Document Summarization Through Text Segmentation, 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings, 2023, p647-654
Kotey, S., Dahyot, R., Harte, N., Query Based Acoustic Summarization for Podcasts, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 2023, p1483-1487
Pandey, A., Edlund, J., Le Maguer, S., Harte, N., Listener sensitivity to deviating obstruents in WaveNet, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 2023, p1080-1084
Le Maguer, S., Anderson, M., Harte, N., Sp1NY: A Quick and Flexible Speech visualisation Tool in Python, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 2023, p2012-2013
Anderson, M., Kinnunen, T., Harte, N., Learnable Frontends That Do Not Learn: Quantifying Sensitivity To Filterbank Initialisation, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2023-June, 2023
Gonzales, M.G., Corcoran, P., Harte, N., Schukat, M., Joint Speech-Text Embeddings with Disentangled Speaker Features, 2023 34th Irish Signals and Systems Conference, ISSC 2023, 2023
A Karaali and N Harte and CR Jung, Deep Multi-Scale Feature Learning for Defocus Blur Estimation, IEEE Transactions on Image Processing, 2022
Pandey, A., Le Maguer, S., Carson-Berndsen, J., Harte, N., Production characteristics of obstruents in WaveNET and older TTS systems, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022-September, 2022, p2373-2377
Le Maguer, S., King, S., Harte, N., Back to the Future: Extending the Blizzard Challenge 2013, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022-September, 2022, p2378-2382
Ilaria Torre, Simon Holk, Elmira Yadollahi, Iolanda Leite, Rachel McDonnell, Naomi Harte, Smiling in the Face and Voice of Avatars and Robots: Evidence for a smiling McGurk Effect, IEEE Transactions on Affective Computing, 2022, p1-12
Reverdy, J., O'Connor Russell, S., Duquenne, L., Garaialde, D., Cowan, B., Harte, N., RoomReader: A Multimodal Corpus of Online Multiparty Conversational Interactions, 2022 Language Resources and Evaluation Conference, LREC 2022, 2022, p2517-2527
Sterpu, G., Harte, N., Taris: An online speech recognition framework with sequence to sequence neural networks for both audio-only and audio-visual speech, Computer Speech and Language, 74, 2022
Anderson, M., Harte, N., Learnable Acoustic Frontends in Bird Activity Detection, International Workshop on Acoustic Signal Enhancement, IWAENC 2022 - Proceedings, 2022
Jassim, W.A., Harte, N., Comparison of discrete transforms for deep-neural-networks-based speech enhancement, IET Signal Processing, 16, (4), 2022, p438-448
Torre, I. and Deichler, A. and Nicholson, M. and McDonnell, R. and Harte, N., To smile or not to smile: The effect of mismatched emotional expressions in a Human-Robot cooperative task, 2022, pp8-13
G Sterpu and C Saam and N Harte, Learning to count words in fluent speech enables online speech recognition, 2021 IEEE Spoken Language Technology Workshop (SLT), 2021, pp38-45
M Anderson and N Harte, Bioacoustic Event Detection with prototypical networks and data augmentation, 2021
Torre, Ilaria and Carrigan, Emma and Domijan, Katarina and McDonnell, Rachel and Harte, Naomi, Dimensional perception of a 'smiling McGurk effect', 9th International Conference on Affective Computing and Intelligent Interaction (ACII), 2021, pp1-8
Mark Anderson, John Kennedy, Naomi Harte, Low Resource Species Agnostic Bird Activity Detection, 2021 IEEE Workshop on Signal Processing Systems (SiPS), 2021, pp34-39
Ilaria Torre, Emma Carrigan, Katarina Domijan, Rachel McDonnell, Naomi Harte, The Effect of Audio-Visual Smiles on Social Influence in a Cooperative Human-Agent Interaction Task, ACM Transactions on Computer-Human Interaction (TOCHI), 28, (6), 2021, p1-38
Ayushi Pandey, Sébastien Le Maguer, Julie Berndsen, Naomi Harte, Mind your p's and k's--Comparing obstruents across TTS voices of the Blizzard Challenge 2013, Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 2021, pp166-171
Le Maguer, Sebastien and Harte, Naomi, Investigation of Auditory Nerve Model Based Analysis for Vocoded Speech Synthesis, 2020, pp1--6
Jassim, Wissam A and Harte, Naomi, Estimation of a priori signal-to-noise ratio using neurograms for speech enhancement, The Journal of the Acoustical Society of America, 147, (6), 2020, p3830--3848
Sterpu, G., Saam, C., Harte, N., Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020-October, 2020, p3506-3509
Sterpu G., Saam C., Harte N., How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition, IEEE/ACM Transactions on Audio Speech and Language Processing, 28, 2020, p1052 - 1064
Roddy, Matthew and Harte, Naomi, Neural Generation of Dialogue Response Timings, Annual Conference of the Association for Computational Linguistics (ACL), 2020, pp2442-2452
Le Maguer, S., Harte, N., Can auditory nerve models tell us what's different about wavenet vocoded speech?, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020-October, 2020, p230-234
Fernandez-Lopez, Adriana and Karaali, Ali and Harte, Naomi and Sukno, Federico M, ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp6294--6298
Ilaria Torre, Emma Carrigan, Rachel McDonnell, Katarina Domijan, Killian McCabe, Naomi Harte, The effect of multimodal emotional expression and agent appearance on trust in human-agent interaction, Proceedings - MIG 2019: ACM Conference on Motion, Interaction, and Games, ACM Conference on Motion, Interaction, and Games, 2019, 2019
Motion, Interaction and Games in, Motion, Interaction and Games, 2019, pp1--6 , [Torre, Ilaria and Carrigan, Emma and McDonnell, Rachel and Domijan, Katarina and McCabe, Killian and Harte, Naomi]
Clark, L. and Cowan, B.R. and Edwards, J. and Edlund, J. and Szekely, E. and Munteanu, C. and Murad, C. and Healey, P. and Aylett, M. and Harte, N. and Torre, I. and Moore, R.K. and Doyle, P., Mapping theoretical and methodological perspectives for understanding speech interface interactions, CHI EA '19 Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems , (3299009), 2019
Ilaria Torre, Emma Carrigan, Killian McCabe, Rachel McDonnell, Naomi Harte, Survival at the museum: A cooperation experiment with emotionally expressive virtual characters, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction , 2018, pp423-427
Cullen, A. and Harte, N., A longitudinal database of Irish political speech with annotations of speaker ability, Language Resources and Evaluation, 52, (2), 2018, p401-432
Edmonds, C.J., Harte, N., Gardner, M., How does drinking water affect attention and memory? The effect of mouth rinsing and mouth drying on children's performance, Physiology and Behavior, 194, 2018, p233-238
Roddy, M. and Skantze, G. and Harte, N., Multimodal continuous turn-taking prediction using multiscale Rnns, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction, 2018, pp186-190
Laura Dungan, Ali Karaali, Naomi Harte, The Impact Of Reduced Video Quality On Visual Speech Recognition, IEEE International Conference on Image Processing, Athens, Greece, 2018
Sterpu, G. and Saam, C. and Harte, N., Attention-based audio-visual fusion for robust automatic speech recognition, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction , 20th ACM International Conference on Multimodal Interaction , 2018, pp111-115
Sterpu, G. and Saam, C. and Harte, N., Can DNNs Learn to Lipread Full Sentences?, 2018 25th IEEE International Conference on Image Processing (ICIP), (8451388), 2018, pp16-20
Roddy, M. and Skantze, G. and Harte, N., Investigating speech features for continuous turn-taking prediction using LSTMs, Proc. Interspeech 2018, Interspeech 2018, 2018-September, 2018, pp586-590
Cullen, A. and Hines, A. and Harte, N., Perception and prediction of speaker appeal â" A single speaker study, Computer Speech and Language, 52, 2018, p23-40
Dungan, L. and Karaali, A. and Harte, N., The impact of reduced video quality on visual speech recognition, 2018 25th IEEE International Conference on Image Processing (ICIP), 2018 25th IEEE International Conference on Image Processing (ICIP), (8451754), 2018, pp2560-2564
G Sterpu and N Harte, Towards Lipreading Sentences with Active Appearance Models, arXiv preprint arXiv:1805.11688, 2018
O'Reilly, C. and Analuddin, K. and Kelly, D.J. and Harte, N., Measuring vocal difference in bird population pairs, Journal of the Acoustical Society of America, 143, (3), 2018, p1658-1671
Wissam A. Jassim and Naomi Harte, Voice Activity Detection Using Neurograms, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Alberta, Canada, 15-20 April 2018, 2018
A Cullen and N Harte, Thin slicing to predict viewer impressions of TED Talks, âŠÂ of the 14th International Conference on âŠ, 2017
Jassim, W.A. and Paramesran, R. and Harte, N., Speech emotion classification using combined neurogram and INTERSPEECH 2010 paralinguistic challenge features, IET Signal Processing, 11, (5), 2017, p587-595
Roddy, M. and Harte, N., Detecting conversational gaze aversion using unsupervised learning, 2017-January, (8081172), 2017, pp76-80
C O'Reilly and N Harte, Pitch tracking of bird vocalizations and an automated process using YIN-bird, Cogent Biology, 2017
M Roddy and N Harte, Towards predicting dialog acts from previous speakers' non-verbal cues, BIBTEX 2017, 2017, pp1--
Sloan, C. and Harte, N. and Kelly, D. and Kokaram, A.C. and Hines, A., Objective Assessment of Perceptual Audio Quality Using ViSQOLAudio, IEEE Transactions on Broadcasting, 63, (4), 2017, p693-705
O'Reilly, C. and Kokuer, M. and Jancovic, P. and Drennan, R. and Harte, N., Automatic frequency feature extraction for bird species delimitation, 2017-January, (8081511), 2017, pp1759-1763
N Harte and P Jancovic and Karl-L. Schuchmann, Interspeech 2016 Special Session on Bird and Animal Vocalisations Organisers, In:Interspeech 2016, 2016
Jan Skoglund Andrew J. HINES Naomi A. HARTE Anil Kokaram, 'Objective speech quality metric', US, US20150199959A1, 2016, Google LLC
AJ Hines and J Skoglund and N Harte and A Kokaram, Detection of chopped speech, US Patent 9,263,061, 2016
O'Reilly C, Marples N.M, Kelly D.J, Harte N, YIN-bird: Improved pitch tracking for bird vocalisations, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2016, 08-12-September-2016, 2016, pp2641 - 2645
Andrew J. HINES Jan Skoglund Naomi HARTE Anil Kokaram, 'Detection of chopped speech', US, 2016, Google LLC
Hines A, Skoglund J, Kokaram A.C, Harte N, Monitoring voip speech quality for chopped and clipped speech, Komunikacie, 18, (1), 2016, p3 - 10
Sloan C, Harte N, Kelly D, Kokaram A.C, Hines A, Bitrate classification of twice-encoded audio using objective quality features, 2016 8th International Conference on Quality of Multimedia Experience, QoMEX 2016, 2016, 2016, pp7498956-
Hines A, Skoglund J, Kokaram A.C, Harte N, ViSQOL: an objective speech quality model, Eurasip Journal on Audio, Speech, and Music Processing, 2015, (1), 2015, p13-
Harte N, Gillen E, Hines A, TCD-VoIP, a research database of degraded speech for assessing quality in VoIP applications, 7th International Workshop on Quality of Multimedia Experience, QoMEX 2015, 26-29 May 2015 , IEEE, 2015, 7148100-
Hines A, Gillen E, Kelly D, Skoglund J, Kokaram A, Harte N, ViSQOLAudio: An objective audio quality metric for low bitrate codecs, Journal of the Acoustical Society of America, 137, (6), 2015, pEL449 - EL455
C. O'Reilly, D. J. Kelly, N. M. Marples and N. Harte , Quantifying difference in vocalizations of bird populations, Proceedings of Interspeech 2015, 2015, 2015, p3417 - 3421
Harte N, Gillen E, TCD-TIMIT: An audio-visual corpus of continuous speech, IEEE Transactions on Multimedia, 17, (5), 2015, p603 - 615
Hines A, Gillen E, Harte N, Measuring and monitoring speech quality for voice over IP with POLQA, ViSQOL and P.563, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2015, 2015-January, 2015, pp438 - 442
Kelly F, Harte N, Forensic comparison of ageing voices from automatic and auditory perspectives, International Journal of Speech, Language and the Law, 22, (2), 2015, p167 - 202
Pitie, F., Kelly, D., Foucu, T., Harte, N., Kokaram, A. , Assessment of Audio/Video synchronisation in streaming media, 2014 6th International Workshop on Quality of Multimedia Experience, QoMEX 2014, 2014 6th International Workshop on Quality of Multimedia Experience, QoMEX 2014, 2014, pp171-176
Cullen, Ailbhe, Hines, Andrew and Harte, Naomi, Building a Database of Political Speech - Does culture matter in charisma annotations? , 1 4th International Workshop on Audio/Visual Emotion Challenge, AVEC 2014, AVEC'14: 4th International Audio/Visual Emotion Challenge and Workshop., Orlando, FL., 2014, pp27 - 31
Francois Pitie and Damien Kelly and Thierry Foucu and Naomi Harte and Anil C. Kokaram , Assessment of Audio/Video synchronisation in streaming media., International Workshop on Quality of Multimedia Experience, Singapore, 2014, pp171 - 176
Finnian Kelly, Rahim Saeidi, Naomi Harte, David van Leeuwen, Effect of long-term ageing on i-vector speaker verification, Computer Speech & Language, InterSpeech, Singapore, 2014, pp1068 - 1084
Andrew Hines, Eoin Gillen, Jan Skoglund, Damien Kelly, Anil Kokaram and Naomi Harte, Perceived Audio Quality for Streaming Stereo Music. , ACM Multimedia, Orlando, FL, USA, 2014, pp1173 - 1176
Ailbhe Cullen and Naomi Harte, Late Integration of Features for Acoustic Emotion Recognition, European Signal Processing Conference (EUSIPCO)., 2013, pp1 - 5
Finnian Kelly and Naomi Harte, Auditory detectability of vocal ageing and its effect on forensic automatic speaker recognition, InterSpeech, Lyon, France, 2013, pp2846 - 2850
K Pan and F Kelly and N Harte and N Harte and S Murphy and DJ Kelly and ..., Shape Models for Image Segmentation in Microscopy, mee.tcd.ie, 2013
Sooknanan, Ken, Doyle, Jennifer, Wilson, James, Harte, Naomi, Kokaram, Anil and Corrigan, David, Mosaics For Burrow Detection in Underwater Surveillance Video, IEEE Oceans 2013, San Diego, USA, 2013, pp9 - 12
Finnian Kelly, Niko Brummer and Naomi Harte, Eigenageing Compensation for Speaker Verification. , InterSpeech , Lyon, France, 2013, pp1624 - 1628
Finnian Kelly, Andrzej Drygajlo and Naomi Harte , Speaker verification in score-ageing-quality classification space, Computer Speech & Language, 27, (5), 2013, p1068-1084
Hines, A., Skoglund, J., Kokaram, A., Harte, N. , Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2013, pp3697-3701
Kelly, Finnian and Harte, Naomi in, editor(s)Michael Fairhurst , Age Factors in Biometric Processing, IET, 2013, [Kelly, Finnian and Harte, Naomi]
A Hines, J Skoglund, A Kokaram, N Harte, Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vancouver, Canada, 2013, pp3697 - 3701
Harte, Naomi, Murphy, Sadhbh, Kelly, David J. and Marples, Nicola M., Identifying new bird species from differences in birdsong. , INTERSPEECH, Lyon France., 2013, pp2900-2904
Ailbhe Cullen, John Kane, Thomas Drugman, and Naomi Harte , Creaky Voice and the Classification of Affect, Workshop on Affective Social Speech Signals (WASSS), Grenoble, France, 2013
A Hines, J Skoglund, A Kokaram, N Harte, Monitoring the Effects of Temporal Clipping on VoIP Speech Quality, Interspeech, Lyon, France, 2013, 2013, pp1188 - 1192
F Kelly and N Harte and M Fairhurst, The impact of ageing on speech-based biometric systems, Age Factors in Biometric Processing, 2013
K. Sooknanan, A. Kokaram, D. Corrigan, G. Baugh, N. Harte and J. Wilson, Indexing and Selection of Well-Lit Details in Underwater Video Mosaics Using Vignetting Estimation, Program Book - OCEANS 2012 MTS/IEEE Yeosu: The Living Ocean and Coast - Diversity of Resources and Sustainable Activities, International OCEANS Conference, Yeosu, South Korea, May, IEEE, 2012, ppArticle number 6263541
Andrew Hines, Naomi Harte, Speech Intelligibility prediction using a Neurogram Similarity Index Measure, Speech Communication, 54, (2), 2012, p306-320
Andrew Hines, Naomi Harte, Improved Speech Intelligibility with a Chimaera Hearing Aid Algorithm, Interspeech, Portland, OR, ISCA, 2012, pp1 - 4
K. Sooknanan, A. Kokaram, D. Corrigan, G. Baugh, J. Wilson and N. Harte , Improving Underwater Visibility Using Vignetting Correction, Proceedings of SPIE - The International Society for Optical Engineering, Visual Information Processing and Communication, Burlingame, California, USA, January, 8305, SPIE, 2012, ppArticle number 83050M
Andrew Hines, Jan Skoglund, Anil Kokaram, Naomi Harte, ViSQOL: The Virtual Speech Quality Objective Listener, The International Workshop on Acoustic Signal Enhancement (IWAENC), Aachen, Germany, 4-6 Sept. 2012, 2012, pp1 - 4
Cappelletta, L., Harte, N., Phoneme-to-viseme mapping for visual speech recognition, ICPRAM 2012 - Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods, 2, 2012, p322-329
Corrigan, D. ; Kokaram, A. ; Harte, N. , Algorithms for the Digital Restoration of Torn Film , Image Processing, IEEE Transactions on, 21, (2), 2012, p573-587
Hines, A., Skoglund, J., Kokaram, A., Harte, N., VISQOL: The virtual speech quality objective listener, International Workshop on Acoustic Signal Enhancement, IWAENC 2012, 2012
Kelly, F., Drygajlo, A., Harte, N., Compensating for ageing and quality variation in speaker verification, 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, 1, 2012, p498-501
F. Kelly , A. Drygajlo and N. Harte, Speaker Verification with Long-Term Ageing Data , International Conference on Biometrics (ICB), New Delhi, 2012, pp478 - 483
L Cappelletta and N Harte, Non Phonetic Viseme Definition for Visual-Only Speech Recognition, 2012, -
Luca Cappelletta and Naomi Harte, Viseme Definitions Comparison for Visual-Only Speech Recognition, European Signal Processing Conference (Eusipco), 2011, pp2109 - 2113
Finnian Kelly, Naomi Harte, Effects of Long-Term Ageing on Speaker Verification, Proceedings of the COST 2101 European conference on Biometrics and ID management, Springer-Verlag, 2011, pp113--124
C Berry and A Kokaram and N Harte, An extended multiresolution approach to mouth specific aam fitting for speech recognition, 2011 19th European Signal âŠ, 2011
Andrew Hines and Naomi Harte , Simulated performance intensity functions , Engineering in Medicine and Biology Society Conference (EMBC), EMBS (IEEE). , 2011, pp7139 - 7142
Andrew Hines and Naomi Harte, Comparing hearing aid algorithm performance using Simulated Performance Intensity Functions , Speech perception and auditory disorders, Int. Symposium on Audiological and Auditory Research (ISAAR), 2011
Craig Berry, Anil Kokaram, Naomi Harte, An Extended Multiresolution Approach to Mouth Specific AAM Fitting for Speech Recognition. , European Signal Processing Conference (Eusipco), 2011
Andrew Hines and Naomi Harte, Speech intelligibility from image processing, Speech Communication, 52, (9), 2010, p736 - 752
Finnian Kelly and Naomi Harte, Auditory Features Revisited for Robust Speech Recognition. , International Conference on Pattern Recognition (ICPR). , Istanbul, Turkey, Aug 2010, 2010, pp4456 - 4459
Finnian Kelly and Naomi Harte, Training GMMs for Speaker Verification. , IET Irish Signals and Systems Conference, Cork, Ireland, June 2010, 2010, pp163 - 168
Finnian Kelly and Naomi Harte, A Comparison of Auditory Features for Robust Speech Recognition. , European Signal Processing Conference (EUSIPCO 2010). , Aalborg, Denmark, August 2010, 2010
Luca Cappelletta and Naomi Harte, Nostril Detection for Robust Mouth Tracking, Irish Signals and Systems Conference, Cork, Ireland, 2010, pp239 - 244
A Hines and N Harte, Reproduction of the Performance/Intensity Function using image processing and an auditory nerve computational model, 2010
K Finnian and N Harte, A comparison of auditory features for robust speech recognition, presentation, 18th European Signal Processing âŠ, 2010
Andrew Hines and Naomi Harte, Evaluating Sensorineural Hearing Loss With An Auditory Nerve Model Using A Mean Structural Similarity Measure. , European Signal Processing Conference (EUSIPCO '10). , Aalborg, Denmark, 2010
Andrew Hines, Naomi Harte, Error Metrics for Impaired Auditory Nerve Responses of Different Phoneme Groups, Interspeech 2009, Brighton, 2009, 2009, pp1119 - 1122
Naomi Harte, Daire Lennon, and Anil Kokaram, On Parsing Visual Sequences with the Hidden Markov Model, EURASIP Journal on Image and Video Processing , Volume 2009, 2009
Craig Berry, Naomi Harte, Region of Interest Extraction using Colour Based Methods on the CUAVE Database , IET Irish Signals and Systems Conference ISSC, Dublin, 10-12 June , 2009
N Hurley and N Harte and C Fearon and S Rickard, Speech Source Separation in Hardware, 2009, -
Andrew Hines, Naomi Harte , Measurement of phonemic degradation in sensorineural hearing loss using a computational model of the auditory periphery , IET Irish Signals and Systems Conference ISSC 2009, UCD, June 10-11, 2009, pp1-6
Harte, N., Lennon, D., Kokaram, A., On parsing visual sequences with the hidden markov model, Eurasip Journal on Image and Video Processing, 2009, 2009
David Corrigan, Naomi Harte, Anil Kokaram, Pathological Motion Detection for Robust Missing Data Treatment, EURASIP Journal on Advances in Signal Processing, 2008, 2008, pArticle ID 542436
Action Recognition in Multimedia Streams in, editor(s)Petros Maragos, Alexandros Potamianos, Patrick Gros , Multimodal Processing and Interaction, Springer Verlag. , 2008, pp127 - 142, [Daire Lennon, Naomi Harte, and Anil Kokaram, Rozenn Dahyot, Francois Pitie]
D Lennon and N Harte and A Kokaram, Rotation detection using the curl equation, 2007 IEEE International â", 2007
Harte, Naomi; Rankin, Andrew; Baugh, Gary; Kokaram, Anil;, Detection of Illegal Dumping from CCTV at Recycling Centres, International Machine Vision and Image Processing, International Machine Vision and Image Processing Conference, Kildare, Ireland , 2007, (5-7 Sept. ), 2007, pp204
Corrigan, David; Harte, Naomi; Kokaram, Anil;, Automated Segmentation of Torn Frames using the Graph Cuts Technique, Image Processing, IEEE International Conference on Image Processing, 2007. ICIP 2007., San Antonio, TX, USA , 2007, (Sept. 16-Oct. 19), 2007, pp557-560
D Lennon and N Harte and A Kokaram and E Doyle and ..., A hmm framework for motion based parsing for video from observational psychology, IEEE Irish Machine Vision âŠ, 2006
Daire Lennon, Naomi Harte, Anil Kokaram, Erika Doyle, Ray Fuller, A HMM Framework for Motion based parsing for video from Observational Psychology, IEEE Irish Machine Vision and Image Processing Conference, Irish Machine Vision and Image Processing Conference , 2006
Corrigan, D. Harte, N. and Kokaram, A. , Pathological motion detection for robust missing data treatment in degraded archived media, Image Processing, IEEE International Conference on Image Processing 2006, Atlanta, GA , 8-11 Oct. 2006 , 2006, pp621 - 624
Naomi Harte and Anil Kokaram, Automated Removal of Overshoot Artefact from Images, EUSIPCO , European Signal Processing Conference , 2006
Naomi Harte, Shahab U. Ansari, Ian Bruce, Exploiting Voicing Cues for Contrast Enhanced Frequency Shaping of Speech for Impaired Listeners, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE International Conference on Acoustics, Speech, and Signal Processing, Toulouse, 14-19 May 2006 , 5, IEEE, 2006, ppV
Naomi Harte, Niall Hurley, Conor Fearon, Scott Rickard., Towards a Hardware Realization of Time-Frequency Source Separation of Speech, Proceedings of IEEE European Conference on Circuit Theory and Design, IEEE European Conference on Circuit Theory and Design, 28 Aug -2 Sept. 2005, IEEE, 2005
Ansari, S., Harte, N., and Bruce, I., , Efficiently combining improved contrast-enhancing frequency shaping and multiband compression to enhance speech intelligibility in hearing aids, Lake Ontario Auditory Neuroscience (LOAN) Meeting, Hamilton, Canada, 2005
Niall Hurley, Naomi Harte, Conor Fearon, Scott Rickard,, Blind Source Separation of Speech in Hardware, Workshop on Signal Processing Systems, Nov 2005, IEEE, 2005, pp442- 445
N.Harte, S. Bates, B. Murray, The IntelliRate Oversampling Architecture for a Gigabit Ethernet Transceiver, Proceedings of Irish Signals and Systems Conference , Irish Signals and Systems Conference , 2002
McCourt, P. Harte, N. Vaseghi, S. , Discriminitive Multi-Resolution Sub-Band and Segmental Phonetic Model Combination, Electronics Letters, 36, (3), 2000, p270-271
Paul McCourt, Naomi Harte, Saeed Vaseghi, Combined Temporal and Spectral Multi-Resolution Phonetic Modelling, Proc. Eurospeech, Eurospeech, Budapest, Hungary, September 5-9, 1999, 1999, pp1111-1114
McCourt, P., Harte, N., Vaseghi, S., COMBINED TEMPORAL AND SPECTRAL MULTI-RESOLUTION PHONETIC MODELLING, 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, 1999, p1111-1114
NA Harte, Segmental phonetic features and models for speech recognition., ethos.bl.uk, 1999
McMahon, P.; Harte, N.; Vaseghi, S.; McCourt, P, Discriminative spectral-temporal multiresolution features for speech recognition, IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 1999, vol.2, 1999, pppp.581-584
P.Hanna, N.Harte, J. Ming, S.Vaseghi, F.J.Smith, Variation of features of interframe dependent HMM for speech recognition, IEE Electronic Letters, Apr., 1998, p858-859
N.Harte, S.Vaseghi, B.Milner, Joint Recognition and Segmentation using Phonetically Derived Features and a Hybrid Phoneme Model, Proc International Conference on Spoken Language Processing, Proc International 5th International Conference on Spoken Language Processing, Sydney, Australia, Nov 30 - Dec 4, 1998
P.McCourt, S.Vaseghi, N.Harte, Multi-Resolution Cepstral Features for Phoneme Recognition Across Speech Sub-Bands, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, International Conference on Acoustics, Speech, and Signal Processing, Seattle, USA, 12-15 May 1998, 1, IEEE, 1998, pp557-560
N.Harte, S.Vaseghi, P.McCourt, A Novel Model for Phoneme Recognition using Phonetically Derived Features, Proceedings of European Signal Processing Conference (EUSIPCO), , European Signal Processing Conference (EUSIPCO), , 1998, pp1485 - 1488
Harte, N., Vascghi, S., McCourt, P., A novel model for phoneme recognition using phonetically derived features, European Signal Processing Conference, 1998-January, 1998
Harte, N., Vaseghi, S., Milner, B., JOINT RECOGNITION AND SEGMENTATION USING PHONETICALLY DERIVED FEATURES AND A HYBRID PHONEME MODEL, 5th International Conference on Spoken Language Processing, ICSLP 1998, 1998
SVNHB Milner, MULTI-RESOLUTION PHONETIC/SEGMENTAL FEATURES AND MODELS FOR HMM-BASED SPEECH RECOGNITION, 1997 IEEE International Conference âŠ, 1997
SVNHB Milner, MULTI-RESOLUTION PHONETIC/SEGMENTAL FEATURES AND MODELS FOR HMM-BASED SPEECH RECOGNITION, 1997 IEEE International Conference âŠ, 1997
S.Vaseghi, N.Harte, B.Milner, Multi-Resolution Phonetic/Segmental Features and Models for HMM-Based Speech Recognition, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2, 1997, pp1263
N. Harte ; S. Vaseghi ; B. Milner , Dynamic features for segmental speech recognition, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996, p933--
N.Harte, S.Vaseghi, B.Milner, Dynamic Features for Segmental Speech Recognition, Proc International Conference on Spoken Language Processing, International Conference on Spoken Language Processing, Philadelphia, 3-6 Oct 1996, 1996, pp933-936
Non-Peer-Reviewed Publications
Dr. Silvia Giordani, Poster Making and Presentation, TCD, Chemistry Dept, 2007
Fine-Davis, M., Welcome Address, Mental Health and the Workplace: Challenges and Opportunities, Trinity College, Dublin, 13 March, 2000
Research Expertise
Projects
- Title
- Dynamic Visual Features and Improved Audio-Visual Fusion for Automatic Speech Recognition
- Summary
- Human speech is bimodal in nature. Incorporating visual features in Automatic Speech Recognition systems can improve performance in real environments. This work addresses core challenges in audio-visual speech recognition. It will develop new dynamic visual features that better capture the correlations in key mouth movements used by humans in lipreading. This is crucial in improving Hidden Markov Model performance. It will explore a new audio-fusion strategy motivated by the differing visibility of visemes allowing the influence of the audio and video stream to change over time.
- Funding Agency
- SFI
- Date From
- Oct. 2009
- Date To
- Sept. 2013
- Title
- Robust Speaker Verification
- Summary
- Biometrics involves the use of intrinsic physical or behavioural traits of humans to verify their identity. Traits used in biometrics typically include face, fingerprints, hand geometry, handwriting, iris, retinal, vein, and voice. Many are concerned that these technologies are potentially invasive and open to fraud. Speaker verification, using voice or voice and video, has been recognised as an important alternative in the world of biometrics. It is less invasive and requires less expensive installations that iris and fingerprint authentication systems. The changes that occur in the human voice due to ageing have been well documented. The impact of these changes on speaker verification is less clear. In this work, we examine the effect of long-term vocal ageing on a speaker verification systems.
- Funding Agency
- IRCSET
- Date From
- 2009
- Date To
- 2012
- Title
- Audio-Visual Fusion for Human Computer Interaction.
- Summary
- This project will thus focus on key challenges in Audio Visual Speech Recognition: . Given state of the art audio and visual features, do early or late integration strategies work better? . How well does such an integration scheme translate to less controlled situations, where the speech is less constrained, intonation or prosody is more natural, or the speech is emotionally influenced? . Can these algorithms work on a real handheld device?
- Funding Agency
- IRCSET
- Date From
- 2011
- Date To
- 2014
- Title
- Speech Quality for VoIP
- Summary
- This project is developing new metrics to measure speech quality for VoIP applications, particularly Google Chrome WebRTC
- Funding Agency
- Google Inc
- Date From
- April 2011
- Date To
- April 2012
- Title
- Advanced Metrics for Audio-Visual Signal Quality in Internet Communications
- Funding Agency
- Enterprise Ireland/Google
- Date From
- Sept 2013
- Date To
- Dec 2014
Recognition
Representations
International Expert Reviewer for Swiss National Science Foundation (SNSF)
Peer reviewing for top conferences and journals, e.g.: IEEE ICASSP, Interspeech, ACM ICMI, EUSIPCO, IEEE ASRU, IEEE ICIP, ACL, Speech Communication, JASA, IEEE Trans Multimedia
Senior Technical Program Committee for ACM ICMI
TCD Representative to MIDAS (MicroElectronics Design Association of Ireland)
Irish representative to the EU COST Action 2101 entitled "Biometrics for Identity Documents and Smart cards"
Regular Session Chair at Interspeech
Irish representative to the EU COST Action IC1006 Integrating Biometrics and Forensics for the Digital Age
ICT Evaluator for FP6 ICT Call FP6-2004-SME-COOP in Co-operative research (Research involving SMEs, Universities and research organisations). Acted as Group Rapporteur.
Expert Evaluator for FP7 Call FP7-REGIONS-2012-2013-1 in Transnational cooperation between regional research-driven clusters
PhD External Examiner University of Cambridge
PhD External Examiner, Victoria University, New Zealand
PhD External Examiner, University of York
PhD External Examiner, Athlone Institute of Technology
PhD External Examiner, University of East Anglia
Awards and Honours
AI Awards (Shortlisted in Best Application of AI in an Academic Research Body)
Google Faculty Award
Fellow of Trinity College Dublin
Cognitec Best Student Paper Award for PhD Student Finnian Kelly, International Conference on Biometrics (ICB)
Shortlisted for Provost Teaching Award
British Telecom Research Scholarship
IEE Leslie H. Paddle Scholarship
Glen Dimplex British Council Chevening Scholarship
Awarded a Gold Medal for Distinction in Engineering upon graduation.
Maurice F. Fitzgerald Prize - first overall in the Engineering Faculty in the Degree exams.
David Clark Prize - first place in the Microelectronic and Electrical Engineering Degree exams.
Memberships
IEEE (Institute of Electrical and Electronics Engineers)
ISCA (International Speech Communication Association)
IEEE Women in Engineering
IEEE Signal Processing Society