Elaine Ui Dhonnchadha
Assistant Professor/Head of Discipline, C.L.C.S.

Biography

Dr Elaine Uí Dhonnchadha is Assistant Professor in Computational Linguistics and Head of Department for the Centre for Language and Communication Studies. Her research interests include corpus linguistics, morphology and syntax, and the development of language processing tools for Irish, e.g. morphological analysers and generators, rule-based part-of-speech taggers, chunkers and parsers, and machine translation language transfer rules. Such language-specific rule-based resources are essential components of many language technology applications including search engines, spelling and grammar checkers, speech synthesis and speech recognition, computer-aided language learning, machine translation systems, and many others. Prior to joining Trinity College, Elaine worked as a researcher in Institiúid Teangeolaíochta Éireann (The Linguistics Institute of Ireland), and as a Systems Analyst and Programmer in a number of software development companies.

Publications and Further Research Outputs

Peer-Reviewed Publications

Uí Dhonnchadha, E; Scannell, K; Ó hUiginn, R; Ní Mhearraí, E; Nic Mhaoláin, M; Ó Raghallaigh, B; Toner, G; Mac Mathúna, S; D'Auria, D; Ní Ghallchobhair, E; O'Leary, N, Corpas na Gaeilge 1882-1926: Integrating Historical and Modern Irish Texts, LREC 2014 Workshop LRT4HDA: Language Resources and Technologies for Processing and Linking Historical Documents and Archives - Deploying Linked Open Data in Cultural Heritage, Reykjavik, Iceland, May, 2014, edited by Kristín Bjarnadóttir , 2014, pp12-18 Conference Paper, 2014 URL

Quizzes on Tap: Exporting a Test Generation System from One Less-Resourced Language to Another in, editor(s)Vetulani, Zygmunt; Mariani, Joseph , Human Language Technology Challenges for Computer Science and Linguistics, Springer, 2014, pp502-514 , [Montse Maritxalar, Elaine Uí Donnchadha, Jennifer Foster, and Monica Ward] Book Chapter, 2014 URL

Judge, J., Ní Chasaide, A., Ní Dhubhda, R., Scannell, K., Uí Dhonnchadha, E., The Irish Language in the Digital Age: An Ghaeilge sa Ré Dhigiteach, Heidelberg, Springer, 2012 Book, 2012 URL

Teresa Lynn, Jennifer Foster, Mark Dras and Elaine Uí Dhonnchadha, Active Learning and the Irish Treebank, Proceedings of the Australasian Language Technology Association Workshop 2012, ALTA 2012: Australasian Language Technology Workshop, Dunedin, New Zealand, 4-6 December 2012, edited by Paul Cook and Scott Nowson , 10, 2012, pp23-32 Conference Paper, 2012 URL

Uí Dhonnchadha, E., Frenda, A., Vaughan, B. , Issues in Designing a Corpus of Spoken Irish, LREC-2012: SALTMIL-AfLaT Workshop on "Language technology for normalisation of less-resourced languages, Istanbul, May 2012, edited by G. De Pauw, G-M de Schryver, M. Forcadea, K. Sarasola, F. Tyers, P. Waiganjo Wagach , 2012, pp1-6 Conference Paper, 2012 URL

Lynn, Teresa; Cetinoglu, Ozlem; Foster, Jennifer; Uí Dhonnchadha, Elaine; Dras, Mark and van Genabith, Josef, Irish Treebanking and Parsing: A Preliminary Evaluation, Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12), LREC 2012, Istanbul, May 2012, edited by Nicoletta Calzolari et al , 2012, pp1939 - 1946 Conference Paper, 2012 URL

Frenda, A; Uí Dhonnchadha, E; Welby, P., Not missing the bád: a spoke language corpus for Irish, Tionól 2011, Dublin Institute of Advanced Studies, Dublin, November 18-19, 2011, 2011 Conference Paper, 2011

Welby, P., Frenda, A., Uí Dhonnchadha, E., Talkin' 'bout a revolution: Irish spontaneous speech corpora and phonetic analysis, 14th International Congress of Celtic Studies, NUI Maynooth, Ireland, 1-5 August 2011, 2011 Conference Paper, 2011

Uí Dhonnchadha,E and Van Genabith, J, Partial Dependency Parsing for Irish, LREC2010: Language Resources and Evaluation Conference, Malta, 17-23 May 2010, 2010 Conference Paper, 2010

Scaling an Irish FST morphology engine for use on unrestricted text in, editor(s)A. Yli-Jyrä, L. Karttunen, J. Karhumäki , Lecture Notes in Artificial Intelligence (LNAI): Proceedings of the FSMNLP 2005 Finite-State Methods in Natural Language Processing, Berlin, Springer-Verlag, 2006, pp247 - 258, [Uí Dhonnchadha, E., van Genabith, J.] Book Chapter, 2006

Uí Dhonnchadha, E., Part-of-Speech Tagging and Partial Parsing for Irish using Finite-State Transducers and Constraint Grammar, DCU, 2009 Thesis, 2009 URL

Uí Dhonnchadha, E & Van Genabith, J., A Part-of-Speech Tagger for Irish using Finite State Morphology and Constraint Grammar Disambiguation, LREC 2006, Genoa, May, 2006 Conference Paper, 2006

Kilgarriff, A., Rundell, M., Uí Dhonnchadha, E., Efficient corpus creation for lexicography, Language Resources and Evaluation Journal, 40, (2), 2006 Journal Article, 2006

Keogh, K., Koller, T., Uí Dhonnchadha, E., van Genabith, J., Ward, M. , CL for CALL in the Primary School, Proc. eLearning for Computational Linguistics and Computational Linguistics for eLearning, COLING 2004 Workshop: eLearning for Computational Linguistics and Computational Linguistics for eLearning, Geneva, 2004 Conference Paper, 2004

Kilgarriff, A., Rychly, P., Chu-Ren, H., Smith, S., Tugwell, D., Uí Dhonnchadha, E, Word sketches for Irish and Chinese, Corpus Linguistics 2005, Birmingham , July, 2005 Conference Paper, 2005

Kilgarriff, A., Rundell, M., Uí Dhonnchadha, E., Corpus creation for lexicography, Proceedings of AsiaLex 2005, AsiaLex 2005, Singapore, Nov, 2005 Conference Paper, 2005

Uí Dhonnchadha, E., Van Genabith, J., Nic Pháidín, C. , Design, Implementation and Evaluation of an Inflectional Morphology Finite-State Transducer for Irish, Machine Translation Journal, 2003 Journal Article, 2003

Uí Dhonnchadha, E., Finite-State Morphology and Irish, Proceedings of the Workshop on Finite-State Methods in Natural Language Processing, 10th Conference of the European Chapter of the Association for Computational Linguistics, Budapest, 2003 Conference Paper, 2003

Uí Dhonnchadha, E, Two-level Finite-State Morphology for Irish, LREC 2002 3rd International Conference on Language resources and Evaluation, Gran Canaria, 2002 Conference Paper, 2002

Kilgarriff, A., Rundell, M., Dhonnchadha, E.U., Efficient corpus development for lexicography: Building the New Corpus for Ireland, Language Resources and Evaluation, 40, (2), 2006, p127-152 Journal Article, 2006

Ailbhe Ní Chasaide, Neasa Ní Chiaráin, Christoph Wendler, Harald Berthelsen, Amelia Kelly, Emer Gilmartin, Elaine Ní Dhonnchadha, Christer Gobl, Towards personalised, synthesis-based content in Irish (Gaelic) language education, SLaTE2011, Venice, Italy, 2011, pp4 Conference Paper, 2011

O'Regan, J., Scannell, K. and Uí Dhonnchadha, E., lemonGAWN: WordNet Gaeilge as Linked Data, Proceedings of the LREC 2016 Workshop "LDL 2016 - 5th Workshop on Linked Data in Linguistics: Managing, Building and Using Linked Language Resources", LREC - LDL 2016 5th Workshop on Linked Data in Linguistics: Managing, Building and Using Linked Language Resources, Portoro , Slovenia, 24 May 2016, edited by John P. McCrae, Christian Chiarcos, Elena Montiel Ponsoda, Thierry Declerck, Petya Osenova, Sebastian Hellmann , 2016, pp36-41 Conference Paper, 2016 URL

Non-Peer-Reviewed Publications

Uí Dhonnchadha, E. & Frenda, A., Comhrá: Corpas na Gaeilge Labhartha, Trinity College, 2013 Dataset, 2013 URL

Uí Dhonnchadha, E, Digital Support for Irish: Corpus Development, Language Resources for the Celtic Languages, University of Wales, Bangor, Dec 2005, 2005, LexiCelt Invited Talk, 2005

Uí Dhonnchadha, E. & Judge, J., Developments in Machine Translation, Indigenous, Minority and Lesser Used (IML) Languages: Promoting our Languages through Technology, Chester Beatty Library, Dublin Castle, 9-10 November, 2015, British-Irish Council Invited Talk, 2015 URL

Uí Dhonnchadha, E., Foclóir Stairiúil na Nua-Ghaeilge II: digitiú, caighdeánú, leamú an chorpais, Teangeolaíocht na Gaeilge XVI [Linguistics of the Gaelic Languages XVI], NUI Maynooth, 9-10 May 2014, 2014 Oral Presentation, 2014 TARA - Full Text

Uí Dhonnchadha, Elaine , A Review of the Passive and Autonomous in Modern Irish, The Linguistics of the Gaelic Languages XVII, NUI Maynooth, 10-11 April 2015, 2015 Oral Presentation, 2015

Uí Dhonnchadha, Elaine, An Introduction to Celtic Language Technology, CLTW14 (Celtic Languages Technology Workshop), Dublin City University, Dublin, 23 August, 2014, COLING: 25th International Conference on Computational Linguistics, Local Committee DCU Invited Talk, 2014 TARA - Full Text URL

Fransent, T. & Uí Dhonnchadha, E., Digital Support for Historical Periods of Irish, 15th International Congress of Celtic Studies, Glasgow, Scotland, 13-17 July, edited by Sims-Williams, P. & Ahlqvist, A. , 2015 Conference Paper, 2015 URL

Research Expertise

Projects

  • Title
    • Rule-based Machine Translation for Irish-English
  • Funding Agency
    • Department of Arts, Heritage and the Gaeltacht
  • Date From
    • 2015
  • Date To
    • 2017
  • Title
    • Plean Digiteach don Ghaeilge [Digital Plan for Irish Language Technology]
  • Funding Agency
    • Department of Arts, Heritage and the Gaeltacht
  • Date From
    • 2015
  • Date To
    • 2017
  • Title
    • Linguistic Annotation of Royal Irish Academy Corpus of Modern Irish (1600-2000)
  • Summary
    • Corpus preparation and automatic linguistic analysis of mainly historical texts to facilitate the diachronic study of Modern Irish. It is envisaged that the corpus linguistic methodology will greatly reduce the time frame for producing the RIA Historical Dictionary of Modern Irish.
  • Funding Agency
    • pro bono
  • Date From
    • 2013
  • Date To
    • 2017
  • Title
    • Linguistic Annotation of the Corpus Bardic Texts (1200-1600)
  • Summary
    • This project involves linguistically annotating TCD Irish Dept's corpus of Bardic poetry which dates from 1200 to 1600AD. In collaboration with Dr. Eoin Mac Cárthaigh, Roinn na Gaeilge, we are automatically lemmatizing and part-of-speech tagging each word in the corpus using language processing tools for Irish. This will facilitate the production of a bardic poetry dictionary. To date approximately half of the tokens in the corpus have been automatically recognised using the pre-existing Irish language tools. We are seeking funding to complete the work of customising the tools for linguistic annotation for this period and for checking the annotated text by 2020.
  • Funding Agency
    • n/a
  • Date From
    • 2017
  • Date To
    • 2020
  • Title
    • Corpas na Gaeilge Labhartha
  • Summary
    • To design and collect a corpus of transcribed spontaneous spoken Irish.
      GaLa Project
  • Funding Agency
    • Foras na Gaeilge
  • Date From
    • 2010
  • Date To
    • 2012
  • Title
    • WISPR: Speech Processing Resources for Welsh and Irish
  • Funding Agency
    • EU INTERREG IIIA Community Initiative Programme
  • Date From
    • 2003
  • Date To
    • 2005

Keywords

Computational linguistics; Corpus Linguistics; Education and minority language; Human computer interactions; Information technology in education; LINGUISTICS; MORPHOLOGY; Natural Language Processing; Speech processing/technology

Recognition

Memberships

Coiste Bainistíochta [Management Committee] Fhoclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann [Royal Irish Academy] 2012 – present

NSAI representative on International Standards Organisation (ISO) Language Resource Management sub-committee of Standardisation of Terminology and Language and Content Resources (TC37/SC4) 2005 – 2011

National Centre for Language Technology (NCLT), Dublin City University 2000 – 2009

Fo-choiste Ríomhaireachta den Choiste Téarmaíochta, Foras na Gaeilge. [Computing Terminology Sub-committee ] 1998 – 2000