Michael Zock

Groupe TALEP
Aix-Marseille Université
Case 901 - 163 Avenue de Luminy

CogALex   Cognitive Aspects of the Lexicon
(COLING workshop).
Research   For a summary see here below.
Short bio   see here.
E-mail   michael.zock (@) lif.univ-mrs.fr
Phone   +33 (0)4 86 09 06 85
Fax   +33 (0)4 91 82 92 75
My heartfelt thanks to all those colleagues and friends who have contributed to this Festschrift (1). For more on this, read here and here below.

'If you talk to a man in a language he understands, that goes to his head.
If you talk to him in his language, that goes to his heart.' (Nelson Mandela)

Communication relies on knowledge : knowledge of language, knowledge about the world around us, and, of course, knowledge concerning people : what do they know, feel and believe in? Being an empirical, life long process of learning it is never too early to start practizing, be it for acquiring the skill or for understanding the logic behind it.

When my grandchild Augusta had grasped the relationship between the book, her and me, she became interested in it. Yet, having her own way to make sense of the world she tends to 'read' books upside down. Far from being a handicap, this may turn out to be an asset, helping her to become proficient in other scripts than only the one of her mother tongue.

This being said, she is a lot nicer (1), and so much more fun to be with when she is not in the 'study mode' (2, 3, 4, 5). Note that these pictures nicely illustrate the difference between 'giving' and 'taking', 'impression' and 'expression', the latter being my concern.

Background + current situation

After having completed my PhD in experimental psychology, —work in which I've tried to show that schools sometimes prevent kids from learning to speak,— I was appointed by the French National Research Centre (CNRS) to work at LIMSI, an AI-lab close to Paris (Orsay). I stayed there for 20 years before moving 2006 to southern France (Marseille) to join the NLP group of the LIF (Aix-Marseille Université). If you'd like to see what the area looks like, take a look here.

Currently I am emeritus research director at the CNRS and Honorary Professor of the Research Institute of Information and Language Processing (RIILP), university of Wolverhampton, UK (1).

Motivations + approach.

Speaking and writing are resource-intensive processes whose success depends not only on knowledge, but also on our momentary ability or skill to access and use it (synthesis). Yet these three conditions are not always met. Hence, our success of producing language lies somewhere in between two extremes : full access to the needed resources, or more or less limited access, yielding sub-optimal performance revealed by gaps, errors, disfluencies, etc. This being so it makes sense to create assistive technologies (1, 2, 3, 4, 5), both for supporting the mother tongue (authoring aids) or a foreign language. One may even wonder if this is not (also) one of the missions of computational linguistics. However, to build such tools and get them used in real world (desktop, class room), true interdisciplinary work is needed, not only discretely (for not saying 'shamefully') in the backyard, but also, more deliberately and clearly visible, at the centre-stage. For additional information concerning the mindset of my approach, see 'Interactive Natural Language Generation' (INLG) here below.

Research summary

My research interests lie in communication, cognitive science and language production or language generation by and large. Starting from user needs and empirical findings (psycholinguistics, neurosciences) I try to build tools helping people to acquire the skill of speaking or writing in a foreign language and in their mother tongue. My current research deals with the following four topics:

  1. Message-planning: creation of an interface (i.e. linguistically motivated ontology augmented with a graph generator), to support conceptual authoring (message composition);

  2. Outline-planning: help authors to perceive possible links between their ideas to produce well-organized thoughts, i.e. coherent discourse. Writing is thinking which also implies linking of hitherto unconnected thought;

  3. Lexical access: words being a major gateway to the mind, we must learn how to store, use and access them. My goal is to help authors to overcome the tip-of-the-tongue problem. To this end I take into account certain features of the human mind (distribution of knowledge, sometimes accessible only in fragmentary chunks) and the mental lexicon (association, spreading activation). Of course, words in books, computers or the human brain are not the same, be it for their storage (organization) access or representation (holistic vs. decomposed). Still it does make sense to take inspiration from the mental lexicon to see whether, functionally speaking, we can achieve something equivalent. This means in our case that access or retrieval consists in navigating in a huge semantic network where all words are linked via associations. Search space can be kept small, because users are generally able to active at least some of the target's (lemon) direct neighbors (yellow, acid fruit). Often they are even able to name the relationship between the two (synonym, hypernym, ...), and if none of this holds, one can still try to organize the output, and present the direct neighbors of the input (the word coming to the user's mind when looking for a target) in the form of a categorial tree.

  4. Acquisition of basic speaking skills: help students to become quickly fluent in a foreign language (both western and oriental) by learning the basic vocabulary and syntactic structures (learn words in context). The scope is the survival level, and the method used is to build an open, possibly self-extending, multilingual phrasebook augmented with a parametrizable exercise generator.

Here is a summary of my research expressed in a few words, or simply by a wordle.

Organisation of some workshops

Electronic dictionaries and the mental lexicon

1.1 CogALex (Cognitive Aspects of the Lexicon), workshop series co-located with COLING :

2016 (cfps : workshop, shared task), Osaka, Japan
2014 (cfps : workshop, shared task),
, 2010, 2008 and 2004, a forerunner.

1.2 ICCS (International Conference on Cognitive Science), Beijing, 2010.

1.3 RLTLN (Lexical graphs and NLP), TALN workshop, Marseille, 2014.

Natural Language Processing and Cognitive Science (NLPCS)

NLPCS-2013: to access the proceedings, talks and tutorials of this workshop, click here.
Prior events : 2012, 2011, 2010, 2009, 2008, 2007.

European Workshop on Natural Language Generation (1995, 1993, 1991, 1989).

Tools for AuthoringAids (cfp and proceedings), LREC, Marrakech, 2008.

Some invited talks

1° Some thoughts concerning the future of the discipline

1.1RING panel : debate with Eduard Hovy (slides), COLING, 2010, Beijing.

1.2'AI + NLP' : abstract + slides (in french), Paris, 2012.

Electronic dictionaries and the mental lexicon

2.1Roget, WordNet and beyond. RANLP-2015, Hissar, Bulgaria.

2.2 Needles in a haystack and methods to find them. Can neuroscientists, psychologists and computational linguist help us
(to build a tool) to overcome the Tip of the Tongue problem?
NetWordS conference Pisa, Italy,

'Word Knowledge and Word Usage: Representations and Processes in the Mental Lexicon'.

2.3 Wheels for the mind of the language producer:
microscopes, macroscopes, semantic maps and a good compass
. LREC, Malta, 2010.

2.4 The mental lexicon, blueprint of the dictionaries of tomorrow:
linguistic, computational and psychological aspects of a highly valuable resource
? ESSLLI, Toulouse, 2009.

2.5 How to help authors to overcome the Tip-Of-the-Tongue problem?
Lexical graphs, associative networks, and some of their inherent problems
. Toulouse (abstract, slides).

2.6Do you (still) love me? A crash course on telling and recognizing lies. Eurolan-2007 (abstract, slides), Iași, Romania.

2.7If all roads lead to Rome, they are not all alike. (abstract, in French), BLRI, Marseille, 2012


Here is a list of my publications, and here are the links to a book on lexical resources (Gala, N. & Zock, M. (Eds). Ressources Lexicales, John Benjamins, Amsterdam), as well as to some special issues devoted to 'Cognition and the Lexicon' (2015 and 2011).

Natural Language Generation systems

List of Natural Language Generators. For a more recent version see here.

Many thanks for this Festschrift, a gift that I hihgly value
both as a token of your friendship and appreciation of my work.

I would like to thank you for your empathy for my approach, eventhough it is not a major concern for most of our colleagues. I have always been interested in people and their ways of expressing feelings and thoughts (communication). Dealing mainly with language production, I am nevertheless interested in its correlate, interpretation. If we want to be understood, we must not only speak correctly, we must also be able to understand (to some extent) the person we are talking to.

Interactive NLG (INLG), i.e. natural language generation mediated by machines, is meant to help people to communicate or to acquire the skills of speaking and writing. This implies finding relevant topics (messages) and to express them then in a way likely to be understood. Various processes are involved in this (organization of thoughts, finding the relevant words, articulation,....), processes which are only partially understood. Hence gaining a better understanding of each one of them and building tools to support their learning are important goals. Yet to allow for this we need to integrate right from the start researchers with diverse expertise (psychologists, linguists, computer scientists, teachers) as well as the final user, an actor all too often overlooked. While this may look like a detour, it can be an asset, i.e. a potential accelerant to progress. Indeed, one may well make good progress and move ahead without necessarily dashing on the highway, but rather travel quietly on cross-disciplinary country roads, using the fertility of the soil and the 'gained' time to find new solutions to old problems while planting the seeds of future harvests.

