This course will introduce students to the basics of natural language processing (NLP), a field which combines insights from linguistics and computer science to produce applications like machine translation, information retrieval, and spell checking.
We will cover a range of topics that will help students understand how current NLP technology works and will provide students with a platform for future study and research. We will move from simple representations of language, such as finite-state techniques and n-gram analysis, to more advanced representations, such as those found in context-free and unification-based parsing.
Students who take this course will gain a thorough understanding of the fundamental methods used in natural language understanding, along with an ability to assess the strengths and weaknesses of natural language technologies based on these methods.
| M | 2:00-3:00pm |
| R | 10:30-11:30am |
| or by appointment |
| Participation | 10% | |
| Homeworks | 60% | (=6@10% each) |
| Project | 30% |
You will have some programming exercises included in your homeworks, but this course will not be programming-intensive. We will instead be focusing on the concepts behind the programs.
I rely on the Academic Resource Center for assistance in verifying the need for accommodations and developing accommodation strategies. Students who have not previously contacted the Academic Resource Center are encouraged to do so (202-687-8354; http://ldss.georgetown.edu/index.html).
| Month | Week | Day | Date | Topic | Reading | Assignments |
| Aug. | 1 | W | 31 | Intro to class | ||
| Sep. | 2 | M | 5 | LABOR DAY, NO CLASS | ||
| W | 7 | Overview (.ppt) | ch. 1 | |||
| 3 | M | 12 | Regular expressions & Automata (.ppt) | ch. 2 | ||
| W | 14 | Regular expressions & Automata | ||||
| 4 | M | 19 | Morphology & Finite-State Transducers (FSTs) (.ppt) | ch. 3 | ||
| W | 21 | Morphology & FSTs | HW1 due | |||
| 5 | M | 26 | Spelling checking (.pdf) | ch. 5 | ||
| W | 28 | Spelling/N-grams (.pdf) | ch. 6 | |||
| Oct. | 6 | M | 3 | N-grams | ||
| W | 5 | Part-of-speech (POS) tagging (.ppt) | ch. 8 | HW2 due | ||
| 7 | M | 10 | COLUMBUS DAY, NO CLASS | |||
| W | 12 | POS tagging | app. D | |||
| 8 | M | 17 | POS tagging | |||
| W | 19 | POS tagging | HW3 due | |||
| 9 | M | 24 | Context-Free Grammars (CFGs) (.pdf) | ch. 9 | ||
| W | 26 | CFGs | ||||
| 10 | M | 31 | CFGs & Parsing (.ppt) | ch. 10 | ||
| Nov. | W | 2 | CFGs & Parsing | HW4 (Perl) due | ||
| 11 | M | 7 | Unification-based parsing (.ppt) | ch. 11 | ||
| W | 9 | Unification-based parsing | ||||
| 12 | M | 14 | Probabilistic parsing (.ppt) | ch. 12 | ||
| W | 16 | Probabilistic parsing | HW5 due | |||
| 13 | M | 21 | Semantics/meaning (.pdf) | ch. 14 | ||
| W | 23 | Semantic analysis | ch. 15 | |||
| 14 | M | 28 | Word Sense Disambiguation (.pdf) | ch. 17 | ||
| W | 30 | Word Sense Disambiguation | HW6 due | |||
| Dec. | 15 | M | 5 | Wrap-up | ||
| W | 7 | Project Presentations (Description) | ||||
| 16 | F | 16 | Written projects due |
This document was generated using the LaTeX2HTML translator Version 2002-2-1 (1.70)
Copyright © 1993, 1994, 1995, 1996,
Nikos Drakos,
Computer Based Learning Unit, University of Leeds.
Copyright © 1997, 1998, 1999,
Ross Moore,
Mathematics Department, Macquarie University, Sydney.
The command line arguments were:
latex2html -split 0 syllabus.tex
The translation was initiated by Markus Dickinson on 2005-08-30