Office Hours: Wednesdays 8-9:50, FLB 4016D
Roughly speaking, the first half of the course will be devoted to
categorical methods, and the second mostly to statistical methods,
which have become very popular over the past fifteen years. Note that
many of the statistical methods can be thought of as algorithmic
extensions of the more traditional non-statistical approaches: for
example, probabilistic context free grammars might be implemented
using a chart parsing algorithm, extended to include computations of
the production probabilities. So learning the more traditional
approaches is really a prerequisite for understanding the statistical
approaches in many cases.
Syllabus
| Week | Topic | Reading | Homework |
| 1: 1/14, 1/16 |
|
HW 1 | |
| 2: 1/23 |
|
||
| 3: 1/28, 1/30 |
|
HW 2 | |
| 4: 2/4, 2/6 |
|
||
| 5: 2/11, 2/13 |
|
HW 3 | |
| 6: 2/18, 2/20 |
|
||
| 7: 2/25, 2/27 |
|
|
|
| 8: 3/3, 3/5 |
|
MIDTERM
|
|
| 9: 3/10, 3/12 |
|
||
| Spring Recess | |||
| 10: 3/24, 3/26 |
|
HW 4 | |
| 11: 3/31, 4/2 |
|
|
|
| 12: 4/7, 4/9 |
|
||
| 13: 4/14, 4/16 |
|
|
HW 5 |
| 14: 4/21, 4/23 |
|
|
HW 6 |
| 15: 4/28, 4/30 |
|
It is assumed that you will have access to a computer from which you
can access the internet, and do some of the simple computer
programming exercises that will be assigned. Students for whom this
will be a problem should let me know right away. (There is a Linux
server in FLB where we can set up an account for you if you need it.)
Texts
There is one required book for this course:
The following book is recommended. We won't use it a lot, but
it is a good book to have if you are serious about this area:
Also, we will be making heavy use of Jurafsky and Martin's Speech
and Language Processing. There is a print edition, but it is soon
to be obsolete once the new edition is out, so I don't recommend
buying it. For this semester, we will make use of the online MS, which
until recently was found here here.
Fortunately I have a backup copy: here. (NB: This is
password protected: I'll give out the password in class.)