In this course students will become familiar with how to represent
word meanings computationally and a variety of methods for
automatically determining the meaning of words. These include
dictionary based methods, such as LESK, graph based methods such as
UKB, supervised methods such as sequence classification and best paths
as well as vector space methods. We will finish with a discussion of
how to provide feedback from word sense disambiguation to meaning
representation.
The course is a seminar course, with participants choosing and
presenting papers each week. Because of this, content varies from
year to year depending on the composition of the class.
The course includes a strong computational component, students
will experiment with implementing systems and algorithms, in addition
to learning about them. By the end of this course, graduate students
will have an advanced knowledge of current computational approaches to
lexical meaning. Students should be able to evaluate lexical resources
for coverage and quality as well as to design and implement systems
for determining meaning in text.
Final Deadline 2014-12-01
PROJECT TWO DUE 2014-12-01 11:59 (email is fine)
Assessment
Presentation One (20%): assigned subject
Present a paper (or series of papers), along with an extended
motivation/background and some discussion of applications. We will
discuss both the content of the paper, and also the form of your
presentation. One goal of the course is to learn how to present
information effectively.
Presentations should take an hour, with 30-40 minutes of
presentation and 30-20 minutes discussion. You should give me the
corrected slides within a week of the presentation, and I will add
them to the web page.
- Seminar 2: Rose Chen; Huizhen Wang
- Seminar 3: Egor, Giulia
- Seminar 4: Yukun, Luis
- Seminar 5: David, Tuan Anh
Presentation Two (20%): your choice (must be OKed: should fit
in with your research)
Present a paper (or series of papers), along with an extended
motivation/background and some discussion of applications. Can be
your own work (if it is ready).
- Seminar 7: Tuan Anh, Mike
- Seminar 8: David
- Seminar 9: ???
- Seminar 11: ???
Project One (30%): assigned topic
Sample: critically evaluate one lexical resource (such as WordNet
and tagged corpus). The evaluation will include writing code to
summarize its properties and compare it to other resources.
The project must include a computational component, a presentation,
and a written component. In the computational component the student
will be expected to write to code analyse data and solve problems. In
the presentation, they will present the results to their lecturer and
peers. In the written component they will explain their results in the
form of a short paper. They should incorporate any feedback from their
presentation in the final paper. The final paper should follow
the submission
guidelines for TACL.
- Tuan Anh: Extended Simplified LESK with the wordnet gloss corpus
- Egor: Examining compositionally with vector-spaces
OR phrase clustering with
- Giulia: Cross-lingual mapping: projection vs intersection
- David: ? identifying and exploiting definitions in Indonesian
and maybe other languages
- Yukun: Named entity
Project Two (30\%): your choice (must be OKed: should fit in with your research)
Sample: implement (or extend an existing implementation of) a word
sense disambiguation system.
Recommended Readings
As there is no textbook which covers the topics to the depth required, we
will rely mainly on readings. the following textbooks are recommended for
background reading.
Course Outline
Sample topics
- Representing Word Meaning
- Word Sense Disambiguation
- Dictionary based methods
- Supervised Methods
- Cross lingual disambiguation
- Graph based methods
- Vector Space Methods
- Method Combination
- WSD and applications
- Machine Translation
- Intelligent Indexing and the Semantic Web
Sample Discussions
- Computational Lexical Semantics Relational meaning and WordNets; Vector Spaces; Ontologies (Stevenson, 2003, ch 3); (Miller, 1998), (Vossen, 1998, ch2)
- Splitting or Lumping: Granularity; Meaning across languages; Multiword expressions (Navigli, 2006),(Stevenson, 2003, ch 2)
- Dictionary based methods for WSD LESK, ALT (Lesk, 1986; Ikehara et al., 1996; Baldwin et al., 2010)
- Supervised methods for WSD. CL Special Issue on WSD (www.aclweb.org/anthology-new/J/J98/)
- Cross lingual disambiguation for WSD (Dagan and Pereira, 1994; Zhong and Ng, 2009)
- Graph based methods for WSD (Agirre and Soroa, 2009; Agirre et al., 2009)
- Vector space methods for WSD (Widdows, 2004, ch 2–4)
- Method Combination (Stevenson, 2003, ch 6)
- Machine Translation (Carpuat et al., 2006; Chan et al., 2007)
- Intelligent Indexing and the Semantic Web (Shadbolt et al., 2006)
- WSD and the Representation of Meaning (Stevenson, 2003, ch 7)
References
- Agirre, Eneko and Aitor Soroa. (2009) .Personalizing pagerank for word sense disambiguation. In 12th conference of the European chapter of the Association for Computational Linguistics (EACL-2009), Greece.
- Agirre, Eneko, Oier Lopez de Lacalle, and Aitor Soroa. (2009). Knowledge-based WSD and specific domains: performing over supervised WSD. In International Joint Conference on Artificial Intelligence (IJCAI-2009), Pasadena.
- Baldwin, Timothy, Su Nam Kim, Francis Bond, Sanae Fujita, David Martinez, and Takaaki Tanaka. (2010). A reexamination of MRD-based word sense disambiguation. ACM Transactions on Asian Language Information Processing (TALIP). (to appear).
- Chan, Yee Seng, Hwee Tou Ng, and David Chiang. (2007).Word sense disambiguation improves statistical machine translation. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 33–40, Prague, Czech Republic. Association for Computational Linguistics. URL http://www.aclweb.org/anthology/P/P07/P07-0005.
- Carpuat, Marine, Pascale Fung, and Grace Ngai. (2006). Aligning word senses using bilingual corpora. ACM Transactions on Asian Language Information Processing (TALIP), 5(2):89–120, ISSN 1530-0226. doi: http://doi.acm.org/10.1145/1165255.1165256.
- Dagan, Ido and Fernando Pereira. (1994). Similarity-based estimation of word cooccurrence probabilities. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, pages 272–278, Las Cruces, New Mexico, USA. Association for Computational Linguistics. doi: 10.3115/981732.981770. URL http://www.aclweb.org/anthology/P94-1038.
- Ikehara, Satoru, Satoshi Shirai, and Francis Bond. (1996). Approaches to disambiguation in ALT-J/E. In International Seminar on Multimodal Interactive Disambiguation: MIDDIM-96, pages 107–117, Grenoble.
- Lesk, Michael (1986). Automatic sense disambiguation: How to tell a pine cone from an ice cream cone. In Proceedings of the 1986 SIGDOC Conference, pages 24–26, New York, ACM.
- Miller, George. (1998). Nouns in WordNet. In Christine Fellbaum, editor, WordNet: An Electronic Lexical Database, chapter 1, pages 23–46. MIT Press.
- Navigli, Roberto. (2006). Meaningful clustering of senses helps boost word sense disambiguation performance. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pages 105–112, Sydney, Australia.
- Shadbolt, Nigel, Wendy Hall, and Tim Berners-Lee. (2006). The semantic web revisited. IEEE Intelligent Systems, pages 1541–1672. (http://eprints.ecs.soton.ac.uk/12614/1/Semantic_Web_Revisted.pdf).
- Stevenson, Mark. (2003). Word Sense Disambiguation. CSLI Publications.
- Vossen, Piek, editor. (1998). Euro WordNet. Kluwer.
- Widdows, Dominic. (2004). Geometry and Meaning. CSLI Publications.
- Zhong, Zhi and Hwee Tou Ng. (2009). Word sense disambiguation for all words without hard labor. In International Joint Conference on Artificial Intelligence (IJCAI-2009), pages 1616–162.
Motivation
One of the core areas of study in computational linguistics is the
study of how words and sentence meaning is represented. This course
covers both linguistics and computational issues, with an emphasis on
the latter. The course also offers a grounded introduction to some
general natural language processing techniques, such as sequence
labeling, graph search, comparison of similarity and method
combination.
Aims and objectives
This course is designed to provide an advanced knowledge of current
computational approaches to lexical meaning. Students should be able
to evaluate lexical resources for coverage and quality; design and
implement systems for determining meaning in text; and present their
results effectively. Students will do two small research projects
during the course.
On completion of this course, the students should be able to:
- Critically evaluate lexical resources for cover and accuracy.
- Understand the main approaches to representing word meaning computationally
- Read, understand and present other people's research
- Be able to implement or enhance an existing method for word sense disambiguation
- Conduct independent research on word sense disambiguation
Francis Bond
<bond@ieee.org>
Computational Linguistics Lab
Division of Linguistics and Multilingual Studies
Nanyang Technological University
Level 3, Room 55, 14 Nanyang Drive, Singapore 637332
Tel: (+65) 6592 1568; Fax: (+65) 6794 6303