Skip to main content

Trinity College Dublin, The University of Dublin

Trinity Menu Trinity Search



Naomi Harte
, Electronic & Elect. Engineering

Biography

Dr. Harte is an Assistant Professor in Digital Media Systems in the School of Engineering. She was appointed as an SFI Engineering Initiative Lecturer in Digital Media in 2008. Prior to returning to academia, Dr. Harte worked in high-tech start-ups in the field of DSP Systems Development, including her own company founded in 2002. She also previously worked in McMaster University in Canada.

Dr. Harte's specialist area is Human Speech Communication. Her industrial background brings a real-world approach to her research. Her work involves the design and application of mathematical algorithms to enhance or augment speech communication between humans and technology. Since her appointment, she has established a strong international reputation in the speech processing community.

Dr. Harte's research simultaneously represents academic excellence and industrial relevance. She has published over 60 peer reviewed papers in her specialist areas. For the past two years, Dr. Harte has been involved in a major collaboration with Google Chrome and YouTube, leading to multiple patent applications and publications.

Publications and Further Research Outputs

Peer-Reviewed Publications

Clark, L. and Cowan, B.R. and Edwards, J. and Edlund, J. and Szekely, E. and Munteanu, C. and Murad, C. and Healey, P. and Aylett, M. and Harte, N. and Torre, I. and Moore, R.K. and Doyle, P., Mapping theoretical and methodological perspectives for understanding speech interface interactions, CHI EA '19 Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems , (3299009), 2019 Conference Paper, 2019 DOI

Roddy, M. and Skantze, G. and Harte, N., Multimodal continuous turn-taking prediction using multiscale Rnns, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction, 2018, pp186-190 Conference Paper, 2018 DOI

Sterpu, G. and Saam, C. and Harte, N., Can DNNs Learn to Lipread Full Sentences?, 2018 25th IEEE International Conference on Image Processing (ICIP), (8451388), 2018, pp16-20 Conference Paper, 2018 DOI

O'Reilly, C. and Analuddin, K. and Kelly, D.J. and Harte, N., Measuring vocal difference in bird population pairs, Journal of the Acoustical Society of America, 143, (3), 2018, p1658-1671 Journal Article, 2018 DOI

Wissam A. Jassim and Naomi Harte, Voice Activity Detection Using Neurograms, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Alberta, Canada, 15-20 April 2018, 2018 Conference Paper, 2018

Cullen, A. and Harte, N., A longitudinal database of Irish political speech with annotations of speaker ability, Language Resources and Evaluation, 52, (2), 2018, p401-432 Journal Article, 2018 DOI

Laura Dungan, Ali Karaali, Naomi Harte, The Impact Of Reduced Video Quality On Visual Speech Recognition, IEEE International Conference on Image Processing, 2018 Conference Paper, 2018 DOI

Torre, I. and Carrigan, E. and McCabe, K. and McDonnell, R. and Harte, N., Survival at the museum: A cooperation experiment with emotionally expressive virtual characters, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction , 2018, pp423-427 Conference Paper, 2018 DOI

CJ Edmonds and N Harte and M Gardner, How does drinking water affect attention and memory? The effect of mouth rinsing and mouth drying on children's performance, Physiology \& behavior, 2018 Journal Article, 2018

Cullen, A. and Hines, A. and Harte, N., Perception and prediction of speaker appeal â€" A single speaker study, Computer Speech and Language, 52, 2018, p23-40 Journal Article, 2018 DOI

Roddy, M. and Skantze, G. and Harte, N., Investigating speech features for continuous turn-taking prediction using LSTMs, Proc. Interspeech 2018, Interspeech 2018, 2018-September, 2018, pp586-590 Conference Paper, 2018 DOI

Dungan, L. and Karaali, A. and Harte, N., The impact of reduced video quality on visual speech recognition, 2018 25th IEEE International Conference on Image Processing (ICIP), 2018 25th IEEE International Conference on Image Processing (ICIP), (8451754), 2018, pp2560-2564 Conference Paper, 2018 DOI

G Sterpu and N Harte, Towards Lipreading Sentences with Active Appearance Models, arXiv preprint arXiv:1805.11688, 2018 Journal Article, 2018

Sterpu, G. and Saam, C. and Harte, N., Attention-based audio-visual fusion for robust automatic speech recognition, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction , 20th ACM International Conference on Multimodal Interaction , 2018, pp111-115 Conference Paper, 2018 DOI

V Leong and E Byrne and K Clackson and N Harte and S Lam and ..., Infants' neural oscillatory processing of theta-rate speech patterns exceeds adults', bioRxiv, 2017 Journal Article, 2017

O'Reilly, C. and Kokuer, M. and Jancovic, P. and Drennan, R. and Harte, N., Automatic frequency feature extraction for bird species delimitation, 2017-January, (8081511), 2017, pp1759-1763 Conference Paper, 2017 DOI

Jassim, W.A. and Paramesran, R. and Harte, N., Speech emotion classification using combined neurogram and INTERSPEECH 2010 paralinguistic challenge features, IET Signal Processing, 11, (5), 2017, p587-595 Journal Article, 2017 DOI

A Cullen and N Harte, Thin slicing to predict viewer impressions of TED Talks, 
 of the 14th International Conference on 
, 2017 Journal Article, 2017

Sloan, C. and Harte, N. and Kelly, D. and Kokaram, A.C. and Hines, A., Objective Assessment of Perceptual Audio Quality Using ViSQOLAudio, IEEE Transactions on Broadcasting, 63, (4), 2017, p693-705 Journal Article, 2017 DOI

C O'Reilly and N Harte, Pitch tracking of bird vocalizations and an automated process using YIN-bird, Cogent Biology, 2017 Journal Article, 2017

Roddy, M. and Harte, N., Detecting conversational gaze aversion using unsupervised learning, 2017-January, (8081172), 2017, pp76-80 Conference Paper, 2017 DOI

Hines A, Skoglund J, Kokaram A.C, Harte N, Monitoring voip speech quality for chopped and clipped speech, Komunikacie, 18, (1), 2016, p3 - 10 Journal Article, 2016 URL

Sloan C, Harte N, Kelly D, Kokaram A.C, Hines A, Bitrate classification of twice-encoded audio using objective quality features, 2016 8th International Conference on Quality of Multimedia Experience, QoMEX 2016, 2016, 2016, pp7498956- Conference Paper, 2016 URL DOI

O'Reilly C, Marples N.M, Kelly D.J, Harte N, YIN-bird: Improved pitch tracking for bird vocalisations, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2016, 08-12-September-2016, 2016, pp2641 - 2645 Conference Paper, 2016 DOI URL

AJ Hines and J Skoglund and N Harte and A Kokaram, Detection of chopped speech, US Patent 9,263,061, 2016 Journal Article, 2016

Hines A, Gillen E, Kelly D, Skoglund J, Kokaram A, Harte N, ViSQOLAudio: An objective audio quality metric for low bitrate codecs, Journal of the Acoustical Society of America, 137, (6), 2015, pEL449 - EL455 Journal Article, 2015 DOI URL

Harte N, Gillen E, TCD-TIMIT: An audio-visual corpus of continuous speech, IEEE Transactions on Multimedia, 17, (5), 2015, p603 - 615 Journal Article, 2015 URL DOI

Hines A, Skoglund J, Kokaram A.C, Harte N, ViSQOL: an objective speech quality model, Eurasip Journal on Audio, Speech, and Music Processing, 2015, (1), 2015, p13- Journal Article, 2015 URL DOI TARA - Full Text

Kelly F, Harte N, Forensic comparison of ageing voices from automatic and auditory perspectives, International Journal of Speech, Language and the Law, 22, (2), 2015, p167 - 202 Journal Article, 2015 DOI URL

Hines A, Gillen E, Harte N, Measuring and monitoring speech quality for voice over IP with POLQA, ViSQOL and P.563, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2015, 2015-January, 2015, pp438 - 442 Conference Paper, 2015 URL

Harte N, Gillen E, Hines A, TCD-VoIP, a research database of degraded speech for assessing quality in VoIP applications, 7th International Workshop on Quality of Multimedia Experience, QoMEX 2015, 26-29 May 2015 , IEEE, 2015, 7148100- Conference Paper, 2015 DOI

C. O'Reilly, D. J. Kelly, N. M. Marples and N. Harte , Quantifying difference in vocalizations of bird populations, Proceedings of Interspeech 2015, 2015, 2015, p3417 - 3421 Journal Article, 2015

Pitie, F., Kelly, D., Foucu, T., Harte, N., Kokaram, A. , Assessment of Audio/Video synchronisation in streaming media, 2014 6th International Workshop on Quality of Multimedia Experience, QoMEX 2014, 2014 6th International Workshop on Quality of Multimedia Experience, QoMEX 2014, 2014, pp171-176 Conference Paper, 2014 DOI

Francois Pitie and Damien Kelly and Thierry Foucu and Naomi Harte and Anil C. Kokaram , Assessment of Audio/Video synchronisation in streaming media., International Workshop on Quality of Multimedia Experience, Singapore, 2014, pp171 - 176 Conference Paper, 2014

Cullen, Ailbhe, Hines, Andrew and Harte, Naomi, Building a Database of Political Speech - Does culture matter in charisma annotations? , 1 4th International Workshop on Audio/Visual Emotion Challenge, AVEC 2014, AVEC'14: 4th International Audio/Visual Emotion Challenge and Workshop., Orlando, FL., 2014, pp27 - 31 Conference Paper, 2014 DOI

Andrew Hines, Eoin Gillen, Jan Skoglund, Damien Kelly, Anil Kokaram and Naomi Harte, Perceived Audio Quality for Streaming Stereo Music. , ACM Multimedia, Orlando, FL, USA, 2014, pp1173 - 1176 Conference Paper, 2014 DOI

Finnian Kelly, Rahim Saeidi, Naomi Harte, David van Leeuwen, Effect of long-term ageing on i-vector speaker verification, Computer Speech & Language, InterSpeech, Singapore, 2014, pp1068 - 1084 Conference Paper, 2014

Finnian Kelly and Naomi Harte, Auditory detectability of vocal ageing and its effect on forensic automatic speaker recognition, InterSpeech, Lyon, France, 2013, pp2846 - 2850 Conference Paper, 2013

Sooknanan, Ken, Doyle, Jennifer, Wilson, James, Harte, Naomi, Kokaram, Anil and Corrigan, David, Mosaics For Burrow Detection in Underwater Surveillance Video, IEEE Oceans 2013, San Diego, USA, 2013, pp9 - 12 Conference Paper, 2013

Kelly, Finnian and Harte, Naomi in, editor(s)Michael Fairhurst , Age Factors in Biometric Processing, IET, 2013, [Kelly, Finnian and Harte, Naomi] Book Chapter, 2013

A Hines, J Skoglund, A Kokaram, N Harte, Monitoring the Effects of Temporal Clipping on VoIP Speech Quality, Interspeech, Lyon, France, 2013, 2013, pp1188 - 1192 Conference Paper, 2013

Ailbhe Cullen, John Kane, Thomas Drugman, and Naomi Harte , Creaky Voice and the Classification of Affect, Workshop on Affective Social Speech Signals (WASSS), Grenoble, France, 2013 Conference Paper, 2013 DOI

Finnian Kelly, Niko Brummer and Naomi Harte, Eigenageing Compensation for Speaker Verification. , InterSpeech , Lyon, France, 2013, pp1624 - 1628 Conference Paper, 2013

K Pan and F Kelly and N Harte and N Harte and S Murphy and DJ Kelly and ..., Shape Models for Image Segmentation in Microscopy, mee.tcd.ie, 2013 Book, 2013

Harte, Naomi, Murphy, Sadhbh, Kelly, David J. and Marples, Nicola M., Identifying new bird species from differences in birdsong. , INTERSPEECH, Lyon France., 2013, pp2900-2904 Conference Paper, 2013

F Kelly and N Harte and M Fairhurst, The impact of ageing on speech-based biometric systems, Age Factors in Biometric Processing, 2013 Journal Article, 2013

A Hines, J Skoglund, A Kokaram, N Harte, Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vancouver, Canada, 2013, pp3697 - 3701 Conference Paper, 2013

Ailbhe Cullen and Naomi Harte, Late Integration of Features for Acoustic Emotion Recognition, European Signal Processing Conference (EUSIPCO)., 2013, pp1 - 5 Conference Paper, 2013

Finnian Kelly, Andrzej Drygajlo and Naomi Harte , Speaker verification in score-ageing-quality classification space, Computer Speech & Language, 27, (5), 2013, p1068-1084 Journal Article, 2013

Hines, A., Skoglund, J., Kokaram, A., Harte, N. , Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2013, pp3697-3701 Conference Paper, 2013 DOI

Andrew Hines, Naomi Harte, Improved Speech Intelligibility with a Chimaera Hearing Aid Algorithm, Interspeech, Portland, OR, ISCA, 2012, pp1 - 4 Conference Paper, 2012

Corrigan, D. ; Kokaram, A. ; Harte, N. , Algorithms for the Digital Restoration of Torn Film , Image Processing, IEEE Transactions on, 21, (2), 2012, p573-587 Journal Article, 2012 DOI

L Cappelletta and N Harte, Non Phonetic Viseme Definition for Visual-Only Speech Recognition, 2012, - Miscellaneous, 2012

Andrew Hines, Jan Skoglund, Anil Kokaram, Naomi Harte, ViSQOL: The Virtual Speech Quality Objective Listener, The International Workshop on Acoustic Signal Enhancement (IWAENC), Aachen, Germany, 4-6 Sept. 2012, 2012, pp1 - 4 Conference Paper, 2012 TARA - Full Text

K. Sooknanan, A. Kokaram, D. Corrigan, G. Baugh, N. Harte and J. Wilson, Indexing and Selection of Well-Lit Details in Underwater Video Mosaics Using Vignetting Estimation, Program Book - OCEANS 2012 MTS/IEEE Yeosu: The Living Ocean and Coast - Diversity of Resources and Sustainable Activities, International OCEANS Conference, Yeosu, South Korea, May, IEEE, 2012, ppArticle number 6263541 Conference Paper, 2012 DOI

K. Sooknanan, A. Kokaram, D. Corrigan, G. Baugh, J. Wilson and N. Harte , Improving Underwater Visibility Using Vignetting Correction, Proceedings of SPIE - The International Society for Optical Engineering, Visual Information Processing and Communication, Burlingame, California, USA, January, 8305, SPIE, 2012, ppArticle number 83050M Conference Paper, 2012 DOI

F. Kelly , A. Drygajlo and N. Harte, Speaker Verification with Long-Term Ageing Data , International Conference on Biometrics (ICB), New Delhi, 2012, pp478 - 483 Conference Paper, 2012

Andrew Hines, Naomi Harte, Speech Intelligibility prediction using a Neurogram Similarity Index Measure, Speech Communication, 54, (2), 2012, p306-320 Journal Article, 2012 TARA - Full Text

Andrew Hines and Naomi Harte , Simulated performance intensity functions , Engineering in Medicine and Biology Society Conference (EMBC), EMBS (IEEE). , 2011, pp7139 - 7142 Conference Paper, 2011

Andrew Hines and Naomi Harte, Comparing hearing aid algorithm performance using Simulated Performance Intensity Functions , Speech perception and auditory disorders, Int. Symposium on Audiological and Auditory Research (ISAAR), 2011 Conference Paper, 2011

C Berry and A Kokaram and N Harte, An extended multiresolution approach to mouth specific aam fitting for speech recognition, 2011 19th European Signal 
, 2011 Journal Article, 2011

Craig Berry, Anil Kokaram, Naomi Harte, An Extended Multiresolution Approach to Mouth Specific AAM Fitting for Speech Recognition. , European Signal Processing Conference (Eusipco), 2011 Conference Paper, 2011 DOI

Finnian Kelly, Naomi Harte, Effects of Long-Term Ageing on Speaker Verification, Proceedings of the COST 2101 European conference on Biometrics and ID management, Springer-Verlag, 2011, pp113--124 Conference Paper, 2011

Luca Cappelletta and Naomi Harte, Viseme Definitions Comparison for Visual-Only Speech Recognition, European Signal Processing Conference (Eusipco), 2011, pp2109 - 2113 Conference Paper, 2011

Luca Cappelletta and Naomi Harte, Nostril Detection for Robust Mouth Tracking, Irish Signals and Systems Conference, Cork, Ireland, 2010, pp239 - 244 Conference Paper, 2010

Finnian Kelly and Naomi Harte, A Comparison of Auditory Features for Robust Speech Recognition. , European Signal Processing Conference (EUSIPCO 2010). , Aalborg, Denmark, August 2010, 2010 Conference Paper, 2010 DOI

Finnian Kelly and Naomi Harte, Auditory Features Revisited for Robust Speech Recognition. , International Conference on Pattern Recognition (ICPR). , Istanbul, Turkey, Aug 2010, 2010, pp4456 - 4459 Conference Paper, 2010

Andrew Hines and Naomi Harte, Speech intelligibility from image processing, Speech Communication, 52, (9), 2010, p736 - 752 Journal Article, 2010

Finnian Kelly and Naomi Harte, Training GMMs for Speaker Verification. , IET Irish Signals and Systems Conference, Cork, Ireland, June 2010, 2010, pp163 - 168 Conference Paper, 2010

Andrew Hines and Naomi Harte, Evaluating Sensorineural Hearing Loss With An Auditory Nerve Model Using A Mean Structural Similarity Measure. , European Signal Processing Conference (EUSIPCO '10). , Aalborg, Denmark, 2010 Conference Paper, 2010 TARA - Full Text

K Finnian and N Harte, A comparison of auditory features for robust speech recognition, presentation, 18th European Signal Processing 
, 2010 Journal Article, 2010

Naomi Harte, Daire Lennon, and Anil Kokaram, On Parsing Visual Sequences with the Hidden Markov Model, EURASIP Journal on Image and Video Processing , Volume 2009, 2009 Journal Article, 2009 DOI

Andrew Hines, Naomi Harte, Error Metrics for Impaired Auditory Nerve Responses of Different Phoneme Groups, Interspeech 2009, Brighton, 2009, 2009, pp1119 - 1122 Conference Paper, 2009 TARA - Full Text

Craig Berry, Naomi Harte, Region of Interest Extraction using Colour Based Methods on the CUAVE Database , IET Irish Signals and Systems Conference ISSC, Dublin, 10-12 June , 2009 Conference Paper, 2009 TARA - Full Text

Andrew Hines, Naomi Harte , Measurement of phonemic degradation in sensorineural hearing loss using a computational model of the auditory periphery , IET Irish Signals and Systems Conference ISSC 2009, UCD, June 10-11, 2009, pp1-6 Conference Paper, 2009 URL TARA - Full Text

David Corrigan, Naomi Harte, Anil Kokaram, Pathological Motion Detection for Robust Missing Data Treatment, EURASIP Journal on Advances in Signal Processing, 2008, 2008, pArticle ID 542436 Journal Article, 2008 DOI

Action Recognition in Multimedia Streams in, editor(s)Petros Maragos, Alexandros Potamianos, Patrick Gros , Multimodal Processing and Interaction, Springer Verlag. , 2008, pp127 - 142, [Daire Lennon, Naomi Harte, and Anil Kokaram, Rozenn Dahyot, Francois Pitie] Book Chapter, 2008

Corrigan, David; Harte, Naomi; Kokaram, Anil;, Automated Segmentation of Torn Frames using the Graph Cuts Technique, Image Processing, IEEE International Conference on Image Processing, 2007. ICIP 2007., San Antonio, TX, USA , 2007, (Sept. 16-Oct. 19), 2007, pp557-560 Conference Paper, 2007 DOI TARA - Full Text URL

Harte, Naomi; Rankin, Andrew; Baugh, Gary; Kokaram, Anil;, Detection of Illegal Dumping from CCTV at Recycling Centres, International Machine Vision and Image Processing, International Machine Vision and Image Processing Conference, Kildare, Ireland , 2007, (5-7 Sept. ), 2007, pp204 Conference Paper, 2007 URL TARA - Full Text

D Lennon and N Harte and A Kokaram, Rotation detection using the curl equation, 2007 IEEE International 
, 2007 Journal Article, 2007

D Lennon and N Harte and A Kokaram and E Doyle and ..., A hmm framework for motion based parsing for video from observational psychology, IEEE Irish Machine Vision 
, 2006 Journal Article, 2006

Corrigan, D. Harte, N. and Kokaram, A. , Pathological motion detection for robust missing data treatment in degraded archived media, Image Processing, IEEE International Conference on Image Processing 2006, Atlanta, GA , 8-11 Oct. 2006 , 2006, pp621 - 624 Conference Paper, 2006 DOI TARA - Full Text URL

Naomi Harte and Anil Kokaram, Automated Removal of Overshoot Artefact from Images, EUSIPCO , European Signal Processing Conference , 2006 Conference Paper, 2006 URL

Naomi Harte, Shahab U. Ansari, Ian Bruce, Exploiting Voicing Cues for Contrast Enhanced Frequency Shaping of Speech for Impaired Listeners, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE International Conference on Acoustics, Speech, and Signal Processing, Toulouse, 14-19 May 2006 , 5, IEEE, 2006, ppV Conference Paper, 2006 DOI TARA - Full Text URL

Daire Lennon, Naomi Harte, Anil Kokaram, Erika Doyle, Ray Fuller, A HMM Framework for Motion based parsing for video from Observational Psychology, IEEE Irish Machine Vision and Image Processing Conference, Irish Machine Vision and Image Processing Conference , 2006 Conference Paper, 2006 TARA - Full Text URL

Ansari, S., Harte, N., and Bruce, I., , Efficiently combining improved contrast-enhancing frequency shaping and multiband compression to enhance speech intelligibility in hearing aids, Lake Ontario Auditory Neuroscience (LOAN) Meeting, Hamilton, Canada, 2005 Conference Paper, 2005

Naomi Harte, Niall Hurley, Conor Fearon, Scott Rickard., Towards a Hardware Realization of Time-Frequency Source Separation of Speech, Proceedings of IEEE European Conference on Circuit Theory and Design, IEEE European Conference on Circuit Theory and Design, 28 Aug -2 Sept. 2005, IEEE, 2005 Conference Paper, 2005 TARA - Full Text DOI URL

Niall Hurley, Naomi Harte, Conor Fearon, Scott Rickard,, Blind Source Separation of Speech in Hardware, Workshop on Signal Processing Systems, Nov 2005, IEEE, 2005, pp442- 445 Conference Paper, 2005 TARA - Full Text URL DOI

N.Harte, S. Bates, B. Murray, The IntelliRate Oversampling Architecture for a Gigabit Ethernet Transceiver, Proceedings of Irish Signals and Systems Conference , Irish Signals and Systems Conference , 2002 Conference Paper, 2002

McCourt, P. Harte, N. Vaseghi, S. , Discriminitive Multi-Resolution Sub-Band and Segmental Phonetic Model Combination, Electronics Letters, 36, (3), 2000, p270-271 Journal Article, 2000 URL TARA - Full Text DOI

Paul McCourt, Naomi Harte, Saeed Vaseghi, Combined Temporal and Spectral Multi-Resolution Phonetic Modelling, Proc. Eurospeech, Eurospeech, Budapest, Hungary, September 5-9, 1999, 1999, pp1111-1114 Conference Paper, 1999 URL

NA Harte, Segmental phonetic features and models for speech recognition., ethos.bl.uk, 1999 Book, 1999

McMahon, P.; Harte, N.; Vaseghi, S.; McCourt, P, Discriminative spectral-temporal multiresolution features for speech recognition, IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 1999, vol.2, 1999, pppp.581-584 Conference Paper, 1999

N.Harte, S.Vaseghi, P.McCourt, A Novel Model for Phoneme Recognition using Phonetically Derived Features, Proceedings of European Signal Processing Conference (EUSIPCO), , European Signal Processing Conference (EUSIPCO), , 1998, pp1485 - 1488 Conference Paper, 1998

P.McCourt, S.Vaseghi, N.Harte, Multi-Resolution Cepstral Features for Phoneme Recognition Across Speech Sub-Bands, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, International Conference on Acoustics, Speech, and Signal Processing, Seattle, USA, 12-15 May 1998, 1, IEEE, 1998, pp557-560 Conference Paper, 1998 TARA - Full Text DOI URL

P.Hanna, N.Harte, J. Ming, S.Vaseghi, F.J.Smith, Variation of features of interframe dependent HMM for speech recognition, IEE Electronic Letters, Apr., 1998, p858-859 Journal Article, 1998 URL TARA - Full Text

N.Harte, S.Vaseghi, B.Milner, Joint Recognition and Segmentation using Phonetically Derived Features and a Hybrid Phoneme Model, Proc International Conference on Spoken Language Processing, Proc International 5th International Conference on Spoken Language Processing, Sydney, Australia, Nov 30 - Dec 4, 1998 Conference Paper, 1998 URL

S.Vaseghi, N.Harte, B.Milner, Multi-Resolution Phonetic/Segmental Features and Models for HMM-Based Speech Recognition, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2, 1997, pp1263 Conference Paper, 1997 URL

SVNHB Milner, MULTI-RESOLUTION PHONETIC/SEGMENTAL FEATURES AND MODELS FOR HMM-BASED SPEECH RECOGNITION, 1997 IEEE International Conference 
, 1997 Journal Article, 1997

SVNHB Milner, MULTI-RESOLUTION PHONETIC/SEGMENTAL FEATURES AND MODELS FOR HMM-BASED SPEECH RECOGNITION, 1997 IEEE International Conference 
, 1997 Journal Article, 1997

N.Harte, S.Vaseghi, B.Milner, Dynamic Features for Segmental Speech Recognition, Proc International Conference on Spoken Language Processing, International Conference on Spoken Language Processing, Philadelphia, 3-6 Oct 1996, 1996, pp933-936 Conference Paper, 1996 DOI TARA - Full Text URL

M Roddy and N Harte, Towards predicting dialog acts from previous speakers' non-verbal cues, mmsym.org Journal Article,

N Harte and P Jancovic, Interspeech 2016 Special Session on Bird and Animal Vocalisations Organisers, mee.tcd.ie Journal Article,

NHSV BenMilner, DYNAMIC FEATURES FOR SEGMENTAL SPEECH RECOGNITION., asel.udel.edu Journal Article,

D Bailey and J Barron and A Bouridane and D Braggins and ..., IMVIP 2009, ieeexplore.ieee.org Journal Article,

A Hines and N Harte, Reproduction of the Performance/Intensity Function using image processing and an auditory nerve computational model, mee.tcd.ie Journal Article,

N Hurley and N Harte and C Fearon and S Rickard, Speech Source Separation in Hardware, - Miscellaneous,

NHSV BenMilner, DYNAMIC FEATURES FOR SEGMENTAL SPEECH RECOGNITION., asel.udel.edu Journal Article,

Non-Peer-Reviewed Publications

Dr. Silvia Giordani, Poster Making and Presentation, TCD, Chemistry Dept, 2007 Poster, 2007

John Toland, Gordon Stein, ed., Unbelief in the Enlightenment, Prometheus Press, 1985, pp666 - 670, [D. Berman] Item in dictionary or encyclopaedia, etc, 1985

Research Expertise

Projects

  • Title
    • Dynamic Visual Features and Improved Audio-Visual Fusion for Automatic Speech Recognition
  • Summary
    • Human speech is bimodal in nature. Incorporating visual features in Automatic Speech Recognition systems can improve performance in real environments. This work addresses core challenges in audio-visual speech recognition. It will develop new dynamic visual features that better capture the correlations in key mouth movements used by humans in lipreading. This is crucial in improving Hidden Markov Model performance. It will explore a new audio-fusion strategy motivated by the differing visibility of visemes allowing the influence of the audio and video stream to change over time.
  • Funding Agency
    • SFI
  • Date From
    • Oct. 2009
  • Date To
    • Sept. 2013
  • Title
    • Robust Speaker Verification
  • Summary
    • Biometrics involves the use of intrinsic physical or behavioural traits of humans to verify their identity. Traits used in biometrics typically include face, fingerprints, hand geometry, handwriting, iris, retinal, vein, and voice. Many are concerned that these technologies are potentially invasive and open to fraud. Speaker verification, using voice or voice and video, has been recognised as an important alternative in the world of biometrics. It is less invasive and requires less expensive installations that iris and fingerprint authentication systems. The changes that occur in the human voice due to ageing have been well documented. The impact of these changes on speaker verification is less clear. In this work, we examine the effect of long-term vocal ageing on a speaker verification systems.
  • Funding Agency
    • IRCSET
  • Date From
    • 2009
  • Date To
    • 2012
  • Title
    • Audio-Visual Fusion for Human Computer Interaction.
  • Summary
    • This project will thus focus on key challenges in Audio Visual Speech Recognition: . Given state of the art audio and visual features, do early or late integration strategies work better? . How well does such an integration scheme translate to less controlled situations, where the speech is less constrained, intonation or prosody is more natural, or the speech is emotionally influenced? . Can these algorithms work on a real handheld device?
  • Funding Agency
    • IRCSET
  • Date From
    • 2011
  • Date To
    • 2014
  • Title
    • Speech Quality for VoIP
  • Summary
    • This project is developing new metrics to measure speech quality for VoIP applications, particularly Google Chrome WebRTC
  • Funding Agency
    • Google Inc
  • Date From
    • April 2011
  • Date To
    • April 2012
  • Title
    • Advanced Metrics for Audio-Visual Signal Quality in Internet Communications
  • Funding Agency
    • Enterprise Ireland/Google
  • Date From
    • Sept 2013
  • Date To
    • Dec 2014

Keywords

Audio-visual speech processing; Birdsong Analysis; Emotion in Speech; Human-Computer Interaction; Information/Communication Systems; Multimedia; Signal Processing; Speaker Recognition; SPEECH; Speech Biometrics; Speech processing/technology; Speech Quality; SPEECH RECOGNITION

Recognition

Representations

. TCD Representative to MIDAS (MicroElectronics Design Association of Ireland) . Irish representative to the EU COST Action 2101 entitled "Biometrics for Identity Documents and Smart cards" . Irish representative to the EU COST Action IC1006 Integrating Biometrics and Forensics for the Digital Age . ICT Evaluator for FP6 ICT Call FP6-2004-SME-COOP in Co-operative research (Research involving SMEs, Universities and research organisations). Acted as Group Rapporteur. . Expert Evaluator for FP7 Call FP7-REGIONS-2012-2013-1 in Transnational cooperation between regional research-driven clusters (Feb & March 2012)

Awards and Honours

British Telecom Research Scholarship 1997-1999

IEE Leslie H. Paddle Scholarship 1995-1998

Glen Dimplex British Council Chevening Scholarship 1995-1996

Awarded a Gold Medal for Distinction in Engineering upon graduation. 1995

Maurice F. Fitzgerald Prize - first overall in the Engineering Faculty in the Degree exams. 1995

David Clark Prize - first place in the Microelectronic and Electrical Engineering Degree exams. 1995

Memberships

IEEE (Institute of Electrical and Electronics Engineers) ISCA (International Speech Communication Association) EURASIP (European Association for Signal Processing)