CS224U: Natural Language Understanding

Course information

Time

TuTh 4:15pm - 5:30pm

Location

380-380C (click for map)

Staff

Instructor	Bill MacCartney	Office Hours: Tuesday & Thursday, 3pm-4pm, in Bytes Café
Instructor	Chris Potts	Office Hours: W 1:00-2:00, Th 12:00-1:00 in 460-101
TA	Dakan Wang	Office Hours: Mo 10:00-12:00 in Bytes Cafe

All of us

(This address should be used for all course correspondence, including assignments.)

Discussion forum

http://www.piazza.com/stanford/winter2012/cs224u

Catalog description Machine understanding of human language. Computational semantics (determination of sense, event structure, thematic role, time, aspect, synonymy/meronymy, causation, compositional semantics, treatment of scopal operators), and computational pragmatics and discourse (coherence relations, anaphora resolution, information packaging, generation). Theoretical issues, online resources, and relevance to applications including question answering, summarization, and textual inference. Prerequisites: one of LING180, CS224N, CS224S; and knowledge of logic (LING130A or B, CS157, or PHIL159).

Requirements

Class participation

Attendance will be taken daily, with one point assigned for each class attended. Class will begin on time and end on time; we are obliged to finish on time, and you are obliged to arrive on time.

We would like everyone to ask questions, offer ideas, etc., in class. Questions and ideas sent via email to also count as participation, though we would prefer it if everyone got involved during our class meetings.

Homeworks

There are eight homeworks, due before the start of meetings 2-9 of the term. (After that point, the assignment are oriented towards final projects.)

The homeworks will depend on materials from the readings, so you should do the readings before starting the homeworks. With the reading done, each homework should take you 15-20 minutes (longer if you decide to pursue the issues in greater depth, perhaps as a lead-in to a project).

Our goals for the homeworks: (i) to raise important questions, (ii) to foster common ground for the in-class discussions, and (iii) to help you master central NLU concepts.

All homeworks are due by the start of class on the day they are due.
Submit all homeworks by email to the course address:
Acceptable formats: txt, rtf, doc, docx, pdf.

Final project

The final project is the main assignment of the second half of the course. Final projects can be done in groups of 1-3 people. They are required to be related in a substantive way to at least one of the central topics of the course. The main components are as follows:

Literature review paper (due Feb 14, 11:59 pm): a short 6-page single-spaced paper summarizing and synthesizing 5 papers on the area of your final project. Groups of two should review 7 papers, and groups of three should review 9. The ideal is to have the same topic for your lit review and final project, but it's possible that you'll discover in the lit review that you hate the topic, so you can switch topics (or groups) for the final project; your lit review will be graded on its own terms. Tips on major things to include:
- General problem/task definition: What are these papers trying to solve? Why?
- Concise summaries of the articles: Do not simply copy the article text in full. We can read them ourselves. Put in your own words the major contributions of each article.
- Compare and contrast: Point out the similarities and differences of the papers. Do they agree with each other? Are results seemingly in conflict? If the papers address different subtasks, how are they related? (If they are not related, then you may have made poor choices for a lit review...). This section is probably the most valuable for the final project.
- Future work: Make several suggestions for how the work can be extended. Are there open questions to answer? This would presumably include how the papers relate to your final project idea.
Project milestone (due Feb 28, 11:59 pm): a short overview of your project including at least the following information:
1. A statement of the project's goals.
2. A summary of previous approaches (drawing on the lit review).
3. A summary of the current approach.
4. A summary of progress so far: what you have been done, what you still need to do, and any obstacles or concerns that might prevent your project from coming to fruition.
Presentations (March 13 and 15): We'll use the last two sessions of the course for in-class final project presentations.
Final paper (due March 19, 3:15 pm): The paper should be 8 pages long, in ACL submission format. Here are the LaTeX and Word templates for the current ACL style. Please email the paper as a PDF file to . What to put in a final project paper:
- Research papers: These are papers where you attempted some new research idea. This doesn't have to be publishable research; it's totally great to do a replication of a result you read about. Such papers should contain clear sections describing (i) the problem you are addressing; (ii) your hypothesis or proposed solution (and if you are implementing someone else's solution, where you got the idea from); (iii) alternative solutions, or at least a baseline that you are comparing your solution to; (iv) your methodology; (v) your evaluation; and (vi) some discussion of what your results imply for your hypothesis/problem.
- Implementation papers: These are papers where you code up a version of someone else's algorithm just to learn the details of the algorithm, or do a big semantic data labeling project. Here your want clear sections describing (i) the task that you are replicating, the algorithm you are implementing, or the data you are labeling; (ii) your methodology (what you did, how you did it); (iii) an evaluation, i.e., the experimental results; and (iv) a discussion of what you learned.

Policies

Grading

Your grade is determined based on:

Class participation: 10%
Homeworks: 30%
Literature review: 15%
Project milestone: 10%
Final presentation of project: 5%
Final project paper: 30%

Policy on late work

Each student will have a total of 4 free late (calendar) days applicable to any assignment (including the lit review and project milestone) except the final project paper. These can be used at any time, no questions asked. Each 24 hours or part thereof that a homework is late uses up one full late day. Once these late days are exhausted, any homework turned in late will be penalized 20% per late day. Late days are not applicable to final projects. If a group's assignment is late n days, then each group member is charged n late days.

Policy on submitting related final projects to multiple classes

On the one hand, we want to encourage you to pursue unified interdisciplinary projects that weave together themes from multiple classes. On the other hand, we need to ensure that final projects for this course are original and involve a substantial new effort.

To try to meet both these demands, we are adopting the following policy on joint submission: if your final project for this course is related to your final project for another course, you are required to submit both projects to us by our final project due date. If we decide that the projects are too similar, your project will receive a failing grade. To avoid this extreme outcome, we strongly encourage you to stay in close communication with us if your project is related to another you are submitting for credit, so that there are no unhappy surprises at the end of the term. Since there is no single objective standard for what counts as "different enough", it is better to play it safe by talking with us.

Fundamentally, we are saying that combining projects is not a shortcut. In a sense, we are in the same position as professional conferences and journals, which also need to watch out for multiple submissions. You might have a look at the current ACL/NAACL policy, which strives to ensure that any two papers submitted to those conferences are make substantially different contributions — our goal here as well.

Academic honesty

Please familiarize yourself with Stanford's honor code

http://studentaffairs.stanford.edu/judicialaffairs/policy/honor-code

We will adhere to it and follow through on its penalty guidelines.

Students with documented disabilities

Students who may need an academic accommodation based on the impact of a disability must initiate the request with the Student Disability Resource Center (SDRC) located within the Office of Accessible Education (OAE). SDRC staff will evaluate the request with required documentation, recommend reasonable accommodations, and prepare an Accommodation Letter for faculty dated in the current quarter in which the request is being made. Students should contact the SDRC as soon as possible since timely notice is needed to coordinate accommodations. The OAE is located at 563 Salvatierra Walk (phone: 723-1066).

Schedule

Week	Date	HW due	Who	Topic and Readings
1	Jan 10		Chris & Bill	History of NLU & course goals Bill and Chris's slides Winograd, Terry. 1972. Understanding Natural Language. New York: Academic Press. Sections 1-1.3. Ferrucci, David; Eric Brown; Jennifer Chu-Carroll; James Fan; David Gondek; Aditya A. Kalyanpur; Adam Lally; J. William Murdock; Eric Nyberg; John Prager; Nico Schlaefer; and Chris Welty. 2010. Building Watson: an overview of the DeepQA project. AI Magazine 31(3): 59-79. Ng, Hwee Tou and John Zelle. 1997. Corpus-based approaches to semantic interpretation in natural language processing. AI Magazine 18(4): 45-64. Mitchell, Tom. 2004. Reading the Web: a breakthrough goal for AI. In Matthew Stone and Haym Hirsh, eds., AI — the next 25 Years. Norvig, Peter. 2004. Google. In Matthew Stone and Haym Hirsh, eds., AI — the next 25 Years.
PART I: Lexical semantics
1	Jan 12	HW 1 due	Bill	Lexical semantic relations: WordNet, synonymy, hyponymy slides Jurafsky, Daniel and James H. Martin. 2009. Speech and Language Processing, 2nd edition. Chapter 19, Lexical semantics, p. 1-10. Jurafsky, Daniel and James H. Martin. 2009. Speech and Language Processing, 2nd edition. Chapter 20, Computational lexical semantics, p. 16-31. Lin, Dekang. 1998. Automatic retrieval and clustering of similar words. In Proceedings of COLING-ACL, 768-774. Montreal, Canada: ACL. Hearst, Marti. 1998. Automated discovery of WordNet relations. In Christiane Fellbaum, ed., WordNet: An Electronic Lexical Database, 131-153. Cambridge, MA: MIT Press.
2	Jan 17	HW 2 due	Bill	Word sense disambiguation slides Jurafsky, Daniel and James H. Martin. 2009. Speech and Language Processing, 2nd edition. Chapter 20, Computational lexical semantics, p. 1-16. Mihalcea, Rada. 2007. Using Wikipedia for automatic word sense disambiguation. In Proceedings of NAACL, 196-203. Rochester, NY: ACL. Optional advanced reading: McCarthy, Diana; Rob Koeling; Julie Weeds; and John Carroll. 2007. Finding predominant word senses in untagged text. In Proceedings of ACL, 279-286. Barcelona, Spain: ACL.
2	Jan 19	HW 3 due	Chris	Vector-space models of meaning slides; horoscope data; IMDB word × document matrix; IMDB word × word matrix Turney, Peter D. and Patrick Pantel. 2010. From frequency to meaning: vector space models of semantics. Journal of Artificial Intelligence Research 37: 141-188.
3	Jan 24	HW 4 due	Chris	Dependency parses for NLU slides; Gigaword NYT advmod pairs; collapsed Stanford dependencies Brown corpus de Marneffe, Marie-Catherine; Bill MacCartney; and Christopher D. Manning. 2006. Generating typed dependency parses from phrase structure parses. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006), 449-454. Genoa, Italy: ELRA. Optional de Marneffe, Marie-Catherine and Christopher Manning. 2008. The Stanford typed dependencies representation. In Proceedings of the COLING 2008 Workshop on Cross-Framework and Cross-Domain Parser Evaluation, 1-8. ACL.
3	Jan 26	HW 5 due	Bill	Relation extraction slides Jurafsky, Daniel and James H. Martin. 2009. Speech and Language Processing, 2nd edition. Chapter 22, Information extraction, p. 725-743. Zhou, GuoDong; Jian Su; Jie Zhang; and Min Zhang. 2005. Exploring various knowledge in relation extraction. In Proceedings of ACL, 427-434. Ann Arbor, MI: ACL.
4	Jan 31	HW 6 due	Bill	Relation extraction from the web slides Banko, Michele; Michael J. Cafarella; Stephen Soderland; Matt Broadhead; and Oren Etzioni. 2007. Open information extraction from the web. In Proceedings of IJCAI, 2670-2676. Mintz, Mike; Steven Bills; Rion Snow; and Dan Jurafsky. 2009. Distant supervision for relation extraction without labeled data. In Proceedings of ACL-IJCNLP, 1003-1011. Suntec, Singapore: ACL. Optional Etzioni, Oren; Michael Cafarella; Doug Downey; Ana-Maria Popescu; Tal Shaked; Stephen Soderland; Daniel S. Weld; Alexander Yates. 2005. Unsupervised named-entity extraction from the Web: an experimental study. Artificial Intelligence 165(1): 91-133.
PART II: Semantic composition
4	Feb 2	HW 7 due	Chris	Semantic role labeling slides Toutanova, Kristina; Aria Haghighi; and Christopher D. Manning. 2008. A global joint model for semantic role labeling. Computational Linguistics 34(2): 161-191. Optional Gildea, Daniel and Martha Palmer. 2002. The necessity of parsing for predicate argument recognition. In Proceedings of ACL, 239-246 Philadelphia, PA: ACL. Optional Gildea, Daniel and Daniel Jurafsky. 2002. Automatic labeling of semantic roles. Computational Linguistics 28(3): 245-288. Optional Fillmore, Charles J. and B. T. Atkins. 1992. Towards a frame-based lexicon: the case of RISK. In Adrienne Lehrer and Eva F. Kittay, eds., Frames and Fields, 75-102. Hillsdale, NJ: Erlbaum Publishers.
5	Feb 7	HW 8 due	Bill	Building semantic representations using lambda calculus slides; the NLTK Python code for HW 8 Manning, Christopher D. 2005. An introduction to formal computational semantics. Ms., Stanford University. Optional, more detailed coverage Blackburn, Patrick and Johan Bos. 2005. Representation and Inference for Natural Language: A First Course in Computational Semantics. Stanford, CA: CSLI Publications.
5	Feb 9		Percy Liang	Learning to build semantic representations slides Zettlemoyer, Luke S. and Michael Collins. 2007. Online learning of relaxed CCG grammars for parsing to logical form. In Proceedings of EMNLP, 678-687. Prague: ACL. Liang, Percy; Michael I. Jordan, and Dan Klein. 2011. Learning dependency-based compositional semantics. In Proceedings of ACL, 590-599. Portland, OR: ACL.
6	Feb 14	Lit review due	Bill	Scope ambiguity and underspecified representations slides Jurafsky, Daniel and James H. Martin. 2009. Speech and Language Processing, 2nd edition. Chapter 18, Computational Semantics, pp. 617-632. Higgins, Derrick and Jerrold M. Sadock. 2003. A machine learning approach to modeling scope preferences. Computational Linguistics 29(1): 73-96. Optional Carpenter, Bob. 1997. Type-Logical Semantics. Stanford, CA: CSLI Publications. Chapter 3 and chapter 7.
6	Feb 16		Bill	Natural logic and textual inference slides MacCartney, Bill and Christopher D. Manning. 2008. Modeling semantic containment and exclusion in natural language inference. In Proceedings of COLING, 521-528. Manchester, UK: ACL. Hickl, Andrew and Jeremy Bensley. 2007. A discourse commitment-based framework for recognizing textual entailment. Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, 171-176. Prague: ACL.
PART III: Discourse and context
7	Feb 21		Chris	Sentiment analysis fundamentals Sentiment Symposium Tutorial; Sentiment Demos; Demo of Turney & Littman's SOA algorithm Pang, Bo and Lee, Lillian. 2004. A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 271-278. Barcelona, Spain: ACL. Turney, Peter D. and Michael L. Littman. 2003. Measuring praise and criticism: inference of semantic orientation from association. ACM Transactions on Information Systems 21: 315-346. Incredibly useful general resource Pang, Bo and Lillian Lee. 2008. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval 2(1-2):1-135.
7	Feb 23		Chris	Sentiment analysis and context dependence slides Tan, Chenhao; Lillian Lee; Jie Tang; Long Jiang; Ming Zhou; and Ping Li. 2011. User-level sentiment analysis incorporating social networks. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1397-1405. San Diego, CA: ACM Digital Library. Maas, Andrew L.; Andrew Y. Ng; and Christopher Potts. 2011. Multi-dimensional sentiment analysis with learned representations. Technical report, Stanford Computer Science and Stanford Linguistics, April 2011. [Supplementary figure] Optional Kennedy, Alistair and Diana Inkpen. 2006. Sentiment classification of movie reviews using contextual valence shifters. Computational Intelligence 22:110-125.
8	Feb 28	Project milestone due	Richard Socher	Semantic composition with vectors slides Socher, Richard; Jeffrey Pennington; Eric Huang; Andrew Y. Ng; and Christopher D. Manning. 2011. Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions. In Proceedings of EMNLP-2011. Socher, Richard; Eric H. Huang; Jeffrey Pennington; Andrew Y. Ng; and Christopher D. Manning. 2011. Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection In Proceedings of NIPS-2011.
8	Mar 1		Chris	Textual segmentation and local textual coherence slides Jurafsky, Daniel and James H. Martin. 2009. Speech and Language Processing, 2nd edition. Chapter 21, Computational discourse, p. 1-15. Prasad, Rashmi; Nikhil Dinesh; Alan Lee; Eleni Miltsakaki; Livio Robaldo; Aravind Joshi; and Bonnie Webber. 2008. The Penn Discourse Treebank 2.0. In Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Daniel Tapias, ed., Proceedings of the Sixth International Language Resources and Evaluation (LREC'08). Marrakech, Morocco: European Language Resources Association (ELRA). Marcu, Daniel and Abdessamad Echihabi. 2002. An unsupervised approach to recognizing discourse relations. In Proceedings of ACL, 368-375. Philadelphia: ACL. Optional Regina Barzilay and Mirella Lapata. 2005. Modeling local coherence: an entity-based approach. Computational Linguistics 34(1):1-34.
9	Mar 6		Chris	Dialogue slides; the Cards Corpus DeVault, David and Matthew Stone. 2009. Learning to interpret utterances using dialogue history. Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009), 184-192. Athens: Association for Computational Linguistics. Allen, James; Nathanael Chambers; George Ferguson; Lucian Galescu; Hyuckchul Jung; Mary Swift; and William Taysom. 2007. PLOW: a collaborative task learning agent. Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 1514-1519. Vancouver: AAAI Press. Optional Webber, Bonnie; Markus Egg; and Valia Kordoni. 2011. Discourse structure and language technology. Natural Language Engineering 1(1): 1-55.
PART IV: Conclusion
9	Mar 8		Chris & Bill	Wrap-up slides; Stuart Schieber on reporting research results; David Goss on math style
10	Mar 13			Project presentations
10	Mar 15			Project presentations
	Mar 19 (3:15pm)	Final project due

CS 224U (LING 188/288) Natural Language Understanding Winter 2012