Publications

  • 2023
    • Peng Jin, Yunfang Wu, Xuefeng Zhu, Diana McCarthy, Weiguang Qu, and Shiwen Yu (2023) Chapter 11 PKUSenseCor: A Large-Scale Word Sense Annotated Chinese Corpus Measuring Context-Word Biases In Chinese Language Resources Chu-Ren Huang, Shu-Kai Hsieh and Peng Jin (Eds)
  • 2022
    • Qianchu Liu, Diana McCarthy, Anna Korhonen (2022) Measuring Context-Word Biases in Lexical Semantic Datasets In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP) pdf available
  • 2021
    • Qianchu Liu, Edoardo M. Ponti, Diana McCarthy, Ivan Vulić, Anna Korhonen (2021) AM2iCo: Evaluating Word Meaning in Context across Low-Resource Languages with Adversarial Examples. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP) pdf available
  • 2020
    • Olga Majewska, Ivan Vulić, Diana McCarthy, and Anna Korhonen (2020) Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis. In Proceedings of the 28th International Conference on Computational Linguistics (COLING) 2020. pdf available
    • Qianchu Liu, Diana McCarthy and Anna Korhonen (2020) Towards Better Context-aware Lexical Semantics:Adjusting Contextualized Representations through Static Anchors. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing pdf available
    • Olga Majewska, Diana McCarthy, Jasper van den Bosch, Nikolaus Kriegeskorte, Ivan Vulić, and Anna Korhonen. Spatial Multi-Arrangement for Clustering and Multi-way Similarity Dataset Construction Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC) 2020. pdf available
  • 2019
    • Qianchu Liu, Diana McCarthy, Ivan Vulić and Anna Korhonen (2019) Investigating Cross-lingual Alignment Methods for Contextualized Embeddings with Token-level Evaluation In Proceedings of the SIGNLL Conference on Computational Natural Language Learning (CONLL 2019)pdf available
    • Qianchu Liu, Diana McCarthy, Anna Korhonen (2019) Second-order contexts from lexical substitutes for few-shot learning of word representations In Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics (*SEM 2019)pdf available
  • 2018
    • Olga Majewska, Diana McCarthy, Ivan Vulić and Anna Korhonen (2018) Acquiring Verb Classes Through Bottom-Up Semantic Verb Clustering In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC) pdf available, Data available
  • 2017
    • Olga Majewska, Ivan Vulić, Diana McCarthy, Yan Huang, Akira Murakami, Veronika Laippala, Anna Korhonen (2017) Investigating the cross-lingual translatability of VerbNet-style classification. In Language Resources and Evaluation pdf available DOI: https://doi.org/10.1007/s10579-017-9403-x
  • 2016
    • Andrew Bennett, Timothy Baldwin, Jey Han Lau, Diana McCarthy and Francis Bond (2016) LexSemTM: A Semantic Dataset Based on All-words Unsupervised Sense Distribution Learning. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016), Berlin, Germany. pdf available and code and data available here.
    • Diana McCarthy, Marianna Apidianaki, and Katrin Erk (2016) Word sense clustering and clusterability. Computational Linguistics, 42 (2) pp 245-275 pdf available
    • Roger Evans, Alexander Gelbukh, Gregory Grefenstette, Patrick Hanks, Miloš Jakubíček, Diana McCarthy, Martha Palmer, Ted Pedersen, Michael Rundell, Pavel Rychlý, Serge Sharoff and David Tugwell (2016) Adam Kilgarriff’s Legacy to Computational Linguistics and Beyond, In Proceedings of CICLing 2016. Springer
  • 2015
    • Diana McCarthy, Adam Kilgarriff, Miloš Jakubícek and Siva Reddy (2015) Semantic Word Sketches. In 8th International Corpus Linguistics Conference (CL 2015) pdf available
    • Jane Oakhill, Kate Cain and Diana McCarthy (2015) Inference processing in children: the contribution of depth and breadth of vocabulary knowledge. In Inferences during Reading. E O'Brien, A. E. Cook and R. F. Lorch Jr. (Eds), Cambridge University Press. pp 140-159
  • 2014
    • Paul Cook, Jey Han Lau, Diana McCarthy and Timothy Baldwin (2014) Novel word-sense identification, In Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014), Dublin, Ireland pdf available here.
    • Jey Han Lau, Paul Cook, Diana McCarthy, Spandana Gella and Timothy Baldwin (2014) Learning Word Sense Distributions, Detecting Unattested Senses and Identifying Novel Senses Using Topic Models, In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), Baltimore, USA. pdf available here.
    • Martha Palmer, Claire Bonial and Diana McCarthy (2014) SemLink+: FrameNet, VerbNet and Event Ontologies, In Proceedings of Frame Semantics in NLP. A Workshop in Honor of Chuck Fillmore (1929–2014). ACL 2014. pdf available here.
    • Marianna Apidianaki, Emilia Verzeni and Diana McCarthy (2014) Semantic clustering of pivot paraphrases. Proceedings of the Ninth Language Resources and Evaluation Conference (LREC-2014), 26-31 May, Reykjavik, Iceland. pdf available here.
  • 2013
    • Paul Cook, Jey Han Lau, Michael Rundell, Diana McCarthy and Timothy Baldwin (2013) A Lexicographic Appraisal of an Automatic Approach for Detecting New Word Senses, In Proceedings of eLex 2013, Tallinn, Estonia. pdf available here.
    • Katrin Erk, Diana McCarthy and Nick Gaylord (2013) Measuring Word Meaning in Context. Computational Linguistics 39 (3) pp 511-554 DOI 10.1162/COLI_a_00142 article available here and data available here.
    • Diana McCarthy, Ravi Sinha and Rada Mihalcea (2013) The Cross-Lingual Lexical Substitution Task. In Language Resources and Evaluation 47 (3) pp 607-638 DOI 10.1007/s10579-012-9202-3 Article available here and task website with resources here.
    • Lin Sun, Diana McCarthy and Anna Korhonen. (2013) Diathesis alternation approximation for verb clustering. In Proceedings of ACL 2013, Sofia, Bulgaria pp 736-741 Paper available here
    • Kate Wild, Andrew Church, Diana McCarthy and Jacquelin Burgess (2013) Quantifying Lexical Usage: Vocabulary Pertaining to Ecosystems and the Environment. Corpora 8 (1) pp 53-79 (doi: 10.3366/cor.2013.0034) Article available here or prepublication version available here
    • Jane Oakhill, Kate Cain, Diana McCarthy and Zoe Field (2013) Making the link between vocabulary knowledge and comprehension skill In Reading: From Words to Multiple Texts. M.A. Britt, S.R. Goldman and J-F Rouet (Eds), Routledge, Taylor and Francis Group. pp 101-114
  • 2012
    • Marco Lui, Timothy Baldwin and Diana McCarthy (2012) Unsupervised Estimation of Word Usage Similarity, In Proceedings of the 2012 Australasian Language Technology Workshop (ALTW 2012), Dunedin, New Zealand. pdf version available
    • Avinesh PVS, Diana McCarthy, Dominic Glennon and Jan Pomikálek (2012) Domain Specific Corpora from the Web In Proceedings of the 15th EURALEX International Congress. Oslo, Norway.
    • Peng Jin, John Carroll, Yunfang Wu and Diana McCarthy (2012) Distributional similarity for Chinese: Exploiting Characters and Radicals In Mathematical Problems in Engineering. Selected Papers from the 7th International Conference on Computational Intelligence and Security (CIS'2011) Volume 2012, Article ID 347257. DOI: 10.1155/2012/347257
    • Diana McCarthy, Spandana Gella and Siva Reddy (2012) DSS: Text Similarity Using Lexical Alignments of Form, Distributional Semantics and Grammatical Relations In Proceedings of SemEval 2012 Montreal, Canada. pdf available
    • Andrew MacKinlay, Rebecca Dridan, Diana McCarthy and Timothy Baldwin (2012) The Effects of Semantic Annotations on Precision Parse Ranking, In Proceedings of the First Joint Conference on Lexical and Computational Semantics (*SEM 2012), Montreal, Canada. pdf available
    • Jey Han Lau, Paul Cook, Diana McCarthy, David Newman and Timothy Baldwin (2012) Word Sense Induction for Novel Sense Detection, In Proceedings of the 13th Conference of the European Chapter of the Association for computational Linguistics (EACL 2012), Avignon, France. pdf available
    • Vojtěch Kovář and Diana McCarthy (2012) New Learner Corpus Functionality in the Sketch Engine, In Proceedings of the 2012 Asia Pacific Corpus Linguistics Conference
  • 2011
    • Peng Jin, John Carroll, Yunfang Wu and Diana McCarthy (2011) Improved word similarity computation for Chinese using sub-word information. In Proceedings of the 7th International Conference on Computational Intelligence and Security, Sanya, Hainan, China. 459-462.
    • Li Wang, Diana McCarthy and Timothy Baldwin (2011) Predicting Thread Linking Structure by Lexical Chaining, In Proceedings of the 2011 Australasian Language Technology Workshop (ALTW 2011), Canberra, Australia. pdf available
    • Siva Reddy, Diana McCarthy and Suresh Manandhar (2011) An Empirical Study on Compositionality in Compound Nouns In Proceedings of the International Joint Conference on Natural Language Processing 2011 (IJCNLP-2011), Thailand pdf available
    • Siva Reddy, Ioannis Klapaftis, Diana McCarthy and Suresh Manandhar (2011) Dynamic and Static Prototype Vectors for Semantic Composition. Proceedings of the International Joint Conference on Natural Language Processing 2011 (IJCNLP-2011), Thailand Best Paper Award. pdf available
    • Siva Reddy, Diana McCarthy, Suresh Manandhar and Spandana Gella (2011) Exemplar-based word-space model for compositionality detection: shared task system description. In Proceedings of DISCo-2011 in conjunction with ACL 2011 Note Our system was ranked first in two evaluation categories and second in two other evaluation categories. pdf available
    • Kate, Wild, Diana McCarthy, Andrew Church, and Jacqueline Burgess (2011) A Corpus Linguistics Analysis of Ecosystems Vocabulary in the Public Sphere. In Corpus Linguistics 2011: Discourse and Corpus Linguistics conference Birmingham, UK.
    • Iztok Kosem, Milos Husak and Diana McCarthy (2011) GDEX for Slovene. In proceedings of eLex2011 Bled, Slovenia. pp 151-159
    • Diana McCarthy (2011) Exploiting distributional similarity for lexical acquisition. In Proceedings of Dialogue International Conference in Computational Linguistics Bekasovo Russia 25-29th May. Conference website (Invited Paper)
    • Diana McCarthy (2011) Measuring similarity of word meaning in context with lexical substitutes and translations In Computational Linguistics and Intelligent Text Processing 12th International Conference, CICLing 2011, Tokyo, Japan, February 20-26, 2011. Proceedings, Part I. Lecture Notes in Computer Science Volume 6608, 2011, DOI: 10.1007/978-3-642-19400-9 pp238-252 (Invited Paper) slides available here and Data and R code available here
  • 2010
    • Diana McCarthy (2010) DANTE: a new resource for research at the syntax-semantics interface. In Proceedings of Interdiciplinary Workshop on Verbs Pisa, Italy. (Invited paper) PDF version available and DANTE available here
    • Miloš Jakubíček, Adam Kilgarriff, Diana McCarthy and Pavel Rychlý (2010) Syntactic searching in very large corpora for many languages. In Proceedings of Workshop on Advanced Corpus Solutions, PACLIC 24 PDF version available
    • Rada Mihalcea, Ravi Sinha and Diana McCarthy (2010) SemEval-2010 Task 2: Cross-Lingual Lexical Substitution In Proceedings of SemEval-2010: 5th International Workshop on Semantic Evaluations ACL 2010, Uppsala, Sweden. PDF version available. Visit our task website and download our resources
    • Siva Reddy, Abilash Inumella, Diana McCarthy and Mark Stevenson (2010) IIITH: Domain Specific Word Sense Disambiguation. In Proceedings of SemEval-2010: 5th International Workshop on Semantic Evaluations PDF version available. Download our resources ACL 2010, Uppsala, Sweden.
    • Diana McCarthy, Bill Keller and Roberto Navigli (2010) Getting synonym candidates from raw data in the English lexical substitution task. In Proceedings of the 14th EURALEX International Congress. Leeuwarden, The Netherlands. PDF version available
  • 2009
    • Katrin Erk and Diana McCarthy (2009) Graded word sense assignment. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2009). Singapore PDF version available
    • Katrin Erk, Diana McCarthy and Nick Gaylord (2009) Investigations on Word Senses and Word Usages In Proceedings of the Joint conference of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing ACL-IJCNLP Singapore PDF version available Data and Annotator Instructions are available here
    • Jane Oakhill, Diana McCarthy, Kate Cain and Zoe Nightingale (2009) The relation between speed of semantic access, semantic knowledge, and aspects of reading ability. Presented at The Annual Meeting of the Society for Scientific Studies of Reading , Boston, June 25-27th
    • Ravi Sinha, Diana McCarthy and Rada Mihalcea (2009) SemEval-2010 Task 2: Cross-Lingual Lexical Substitution. In Proceedings of the NAACL-HLT 2009 Workshop: SEW-2009 - Semantic Evaluations PDF version available
    • Peng Jin, Diana McCarthy, Rob Koeling and John Carroll (2009) Estimating and exploiting the entropy of sense distributions. In Proceedings of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 Conference, Boulder, Colorado. PDF version available
    • Diana McCarthy, and Roberto Navigli (2009) The English Lexical Substitution Task, In Language Resources and Evaluation 43 (2) Special Issue on Computational Semantic Analysis of Language: SemEval-2007 and Beyond, Agirre, E., Màrquez, L. and Wicentowksi, R. (Eds). pp 139-159 Springer. PDF version available
    • Diana McCarthy (2009) Word Sense Disambiguation: An Overview. In Language and Linguistics Compass 3 (2) pp 537-558 DOI: 10.1111/j.1749-818X.2009.00131.x Blackwell PDF version available
  • 2008
    • Rob Koeling and Diana McCarthy. (2008) From Predicting Predominant Senses to Using Local Context for Word Sense Disambiguation. In Semantics in Text Processing. STEP 2008 Conference Proceedings Bos, J. and Delmontee, R.. College Publications pp 129-138 PDF version available
    • Diana McCarthy (2008) Lexical substitution as a Framework for Multiword Evaluation. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008) PDF version available
    • Ryu Iida, Diana McCarthy and Rob Koeling (2008) Gloss-Based Semantic Similarity Metrics for Predominant Sense Acquisition. In Proceedings of the Third International Joint Conference on Natural Language Processing pp 561-568 PDF version available
  • 2007
    • Diana McCarthy (2007) Computers getting the drift. In Christmas Issue of Philosophical Transactions of the Royal Society A: Physical, Mathematical and Engineering Sciences 365 (1861) pp 3019-3031 electronic versions available
    • Diana McCarthy, Rob Koeling, Julie Weeds and John Carroll, (2007) Unsupervised Acquisition of Predominant Word Senses. Computational Linguistics, 33 (4) pp 553-590 electronic versions available
    • Diana McCarthy (2007) Book review of "Word Sense Disambiguation: Algorithms and Applications" Eneko Agirre and Philip Edmonds (Eds.) Computational Linguistics 33 (2) pp 255-258 pdf available
    • Diana McCarthy and Roberto Navigli (2007) SemEval-2007 Task 10: English Lexical Substitution Task In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic pp 48-53 PDF version available Resources available to download here
    • Rob Koeling and Diana McCarthy (2007) Sussx: WSD using Automatically Acquired Predominant Senses, In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval-2007) , Prague, Czech Republic pp 314-317 PDF version available
    • Diana McCarthy, Sriram Venkatapathy and Aravind K. Joshi (2007) Detecting Compositionality of Verb-Object Combinations using Selectional Preferences In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2007) pp 369-379 PDF version available
    • Rob Koeling, Diana McCarthy and John Carroll (2007) Text categorization for improved priors of word meaning In Proceedings of the Eighth International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2007), Mexico City, Mexico, 3rd Best Paper Award. PDF version available
  • 2006
    • Diana McCarthy (2006) Relating WordNet senses for word sense disambiguation In Proceedings of the ACL Workshop on Making Sense of Sense: Bringing Psycholinguistics and Computational Linguistics Together , Trento, Italy pp 17-24 PDF version available Gold Standard Data avaliable here
    • Diana McCarthy (2006) Lexical Acquisition. In Keith Brown (ed.), Encyclopedia of Language and Linguistics (2nd edn).  Elsevier: Oxford.
      Volume 7 page 61-68 ISBN 0-08-044299-4
  • 2005
    • Rob Koeling, Diana McCarthy, and John Carroll (2005) Domain-Specific Sense Distributions and Predominant Sense Acquisition. In Proceedings of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing. HLT/EMNLP 2005 pp 419-426. PDF version available Gold Standard Data avaliable here
    • Aline Villavicencio, Francis Bond, Anna Korhonen and Diana McCarthy (2005) Introduction to the special issue on multiword expressions: having a crack at a hard nut. in Computer Speech and Language 19(4). pp 365-377.
  • 2004
    • Diana McCarthy, Rob Koeling, Julie Weeds and John Carroll (2004) Automatic identification of infrequent word senses. In Proceedings of the 20th International Conference of Computational Linguistics, COLING-2004. Geneva, Switzerland. pp 1220-1226 PDF version available
    • Julie Weeds, David Weir and Diana McCarthy (2004) Characterising measures of lexical distributional similarity. In Proceedings of the 20th International Conference of Computational Linguistics, COLING-2004. Geneva, Switzerland. pp 1015-1021 PDF version available
    • Diana McCarthy, Rob Koeling, Julie Weeds and John Carroll, (2004) Finding predominant senses in untagged text. In Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics. Barclona, Spain. pp 280-287 PDF version available ACL Best Paper Award.
    • Diana McCarthy, Rob Koeling, Julie Weeds and John Carroll (2004) Using automatically acquired predominant senses for word sense disambiguation. Accepted for publication in Proceedings of the ACL SENSEVAL-3 workshop. Barclona, Spain. pp 151-154 PDF version available
    • Luís Villarejo, Lluis Márquez, Eneko Agirre, David Martínez, Bernardo Magnini, Carlo Strapparava, Diana McCarthy, Andrés Monotoyo, and Armando Suárez (2004) The 'meaning' system on the English all words task. In Proceedings of the ACL SENSEVAL-3 workshop. pp 253-256 Barclona, Spain. pdf available
    • Geoffrey Sampson and Diana McCarthy (eds.) (2004) Corpus Linguistics: Readings in a Widening Discipline Continuum International, London and New York.
    • Diana McCarthy, Rob Koeling and Julie Weeds (2004) Ranking WordNet Senses Automatically. Technical Report. CSRP 569. Department of Informatics, University of Sussex. PDF version available
    • Diana McCarthy (2004) Book review of "Word Sense Disambiguation: The Case for Combinations of Knowledge Sources" by Mark Stevenson Natural Language Engineering 10 (2) 196-200
  • 2003
    • Diana McCarthy, and John Carroll (2003) Disambiguating nouns, verbs and adjectives using automatically acquired selectional preferences, Computational Linguistics, 29(4). pp 639-654. PDF version available
    • Diana McCarthy, Bill Keller, and John Carroll (2003) Detecting a Continuum of Compositionality in Phrasal Verbs. In Proceedings of the ACL-SIGLEX Workshop on Multiword Expressions: Analysis, Acquisition and Treatment , Sapporo, Japan. Postscript version available Gold Standard Data avaliable here
  • 2002
    • Diana McCarthy (2002) Lexical Substitution as a Task for WSD Evaluation, In Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions , Philadelphia, USA. PDF version available
  • 2001
    • Diana McCarthy, John Carroll and Judita Preiss (2001) Disambiguating noun and verb senses using automatically acquired selectional preferences, In Proceedings of the SENSEVAL-2 Workshop at ACL/EACL'01 , Toulouse, France. Postscript version available
    • Diana McCarthy (2001) PhD Thesis Lexical Acquisition at the Syntax-Semantics Interface: Diathesis Alternations, Subcategorization Frames and Selectional Preferences. Gzipped postscript version available. My supervisor was Gerald Gazdar
  • 2000
    • Anna Korhonen, Genevieve Gorrell and Diana McCarthy (2000) Statistical Filtering and Subcategorization Frame Acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. Pdf version available
    • Diana McCarthy (2000) Using Semantic Preferences to Identify Verbal Participation in Role Switching Alternations. Proceedings of the first Conference of the North American Chapter of the Association for Computational Linguistics. (NAACL) , Seattle, WA. Pdf version available
    • John Carroll and Diana McCarthy (2000) Word sense disambiguation using automatically acquired verbal preferences. In Computers and the Humanities. Senseval Special Issue, Vol 34, No 1-2 Postscript version available
  • Last Century
    • Diana McCarthy and Anna Korhonen (1998) Detecting verbal participation in diathesis alternations In Proceedings of the 36th Annual Meeting of the Association for Computational Linguists., Montreal. Vol 2. pp 1493-1495 Pdf version available
    • Diana McCarthy (1998) Book Review of "The Balancing Act", Klavans and Resnik (Eds.) Journal of Logic Language and Information, 7:2 pp223-227
    • Diana McCarthy (1997) Word Sense Disambiguation for Acquisition of Selectional Preferences. In Proceedings of the ACL/EACL 97 Workshop Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications, Madrid, Spain. pp 52-61 Pdf version available
    • Diana McCarthy (1997) Estimation of a probability distribution over a hierarchical classification. In The Tenth White House Papers COGS - CSRP 440 Postscript version available