CS224U: Natural Language Understanding

Course information

Time

MW 4:15pm - 5:30pm

Location

320-105 (click for map)

Staff

Instructor	Bill MacCartney	Office Hours: MW, 3:00pm - 4:00pm, in Bytes Café
Instructor	Chris Potts	Office Hours: Thu/Fri 11:30am - 12:30pm 460-101
TA	Sam Bowman	Office Hours: Thu 3:00pm - 4:00pm 460-030A
TA	Milind Ganjoo	Office Hours: Wed 11:00am-12:00pm Bytes Cafe
TA	Andy Mai	Office Hours: Tues 11:00am - 12:00pm Bytes Cafe

All of us

(This address should be used for all course correspondence, including assignments.)

Discussion forum

http://www.piazza.com/stanford/spring2014/cs224u

Catalog description Machine understanding of human language. Computational semantics (determination of sense, event structure, thematic role, time, aspect, synonymy/meronymy, causation, compositional semantics, treatment of scopal operators), and computational pragmatics and discourse (coherence relations, anaphora resolution, information packaging, generation). Theoretical issues, online resources, and relevance to applications including question answering, summarization, and textual inference. Prerequisites: one of LING180, CS224N, CS224S; and knowledge of logic (LING130A or B, CS157, or PHIL159).

Requirements

Class participation

Attendance will be taken daily, with one point assigned for each class attended. Class will begin on time and end on time; we are obliged to finish on time, and you are obliged to arrive on time.

We would like everyone to ask questions, offer ideas, etc., in class. Questions and ideas sent via email to also count as participation, though we would prefer it if everyone got involved during our class meetings.

Homeworks

There are seven weekly homeworks, due at the beginning of class on Wednesdays of weeks 2 through 8. The homeworks will depend on materials from the readings, so you should do the readings before starting the homeworks. With the reading done, each homework should take you about 30-40 minutes (longer if you decide to pursue the issues in greater depth, perhaps as a lead-in to a project).

Our goals for the homeworks: (i) to raise important questions, (ii) to foster common ground for the in-class discussions, and (iii) to help you master central NLU concepts.

All homeworks are due by the start of class on the day they are due.
Submit all homeworks by email to the course address:
Acceptable formats: txt, rtf, doc, docx, pdf.
Make the subject of your submission email SUNetID: homework #1 with the appropriate SUNetID and homework number

Final project

The final project is the main assignment of the second half of the course. Final projects can be done in groups of 1-3 people. They are required to be related in a substantive way to at least one of the central topics of the course. The main components are as follows:

Literature review paper (due May 5, 11:59pm): a short 6-page single-spaced paper summarizing and synthesizing several papers on the area of your final project. Groups of one should review 5 papers, groups of two should review 7 papers, and groups of three should review 9. The ideal is to have the same topic for your lit review and final project, but it's possible that you'll discover in the lit review that you hate the topic, so you can switch topics (or groups) for the final project; your lit review will be graded on its own terms. Tips on major things to include:
- General problem/task definition: What are these papers trying to solve? Why?
- Concise summaries of the articles: Do not simply copy the article text in full. We can read them ourselves. Put in your own words the major contributions of each article.
- Compare and contrast: Point out the similarities and differences of the papers. Do they agree with each other? Are results seemingly in conflict? If the papers address different subtasks, how are they related? (If they are not related, then you may have made poor choices for a lit review...). This section is probably the most valuable for the final project.
- Future work: Make several suggestions for how the work can be extended. Are there open questions to answer? This would presumably include how the papers relate to your final project idea.
Project milestone (due May 19, 11:59pm): a short overview of your project including at least the following information:
1. A statement of the project's goals.
2. A summary of previous approaches (drawing on the lit review).
3. A summary of the current approach.
4. A summary of progress so far: what you have been done, what you still need to do, and any obstacles or concerns that might prevent your project from coming to fruition.
Presentations (June 2 & 4): We'll use the last two sessions of the course for in-class final project presentations.
Final paper (due at the end of our scheduled exam period: Tuesday, June 10, 3:15 pm): The paper should be 8 pages long, in ACL submission format. Here are the LaTeX and Word templates for the current ACL style. Please email the paper as a PDF file to What to put in a final project paper:
- Research papers: These are papers where you attempted some new research idea. This doesn't have to be publishable research; it's totally great to do a replication of a result you read about. Such papers should contain clear sections describing (i) the problem you are addressing; (ii) your hypothesis or proposed solution (and if you are implementing someone else's solution, where you got the idea from); (iii) alternative solutions, or at least a baseline that you are comparing your solution to; (iv) your methodology; (v) your evaluation; and (vi) some discussion of what your results imply for your hypothesis/problem.
- Implementation papers: These are papers where you code up a version of someone else's algorithm just to learn the details of the algorithm, or do a big semantic data labeling project. Here your want clear sections describing (i) the task that you are replicating, the algorithm you are implementing, or the data you are labeling; (ii) your methodology (what you did, how you did it); (iii) an evaluation, i.e., the experimental results; and (iv) a discussion of what you learned.

Policies

Grading

Your grade is determined based on:

Class participation: 10%
Homeworks: 30%
Literature review: 15%
Project milestone: 10%
Final presentation of project: 5%
Final project paper: 30%

Policy on late work

Each student will have a total of 4 free late (calendar) days applicable to any assignment (including the lit review and project milestone) except the final project paper. These can be used at any time, no questions asked. Each 24 hours or part thereof that a homework is late uses up one full late day. Once these late days are exhausted, any homework turned in late will be penalized 20% per late day. Late days are not applicable to final projects. If a group's assignment is late n days, then each group member is charged n late days.

Policy on submitting related final projects to multiple classes

On the one hand, we want to encourage you to pursue unified interdisciplinary projects that weave together themes from multiple classes. On the other hand, we need to ensure that final projects for this course are original and involve a substantial new effort.

To try to meet both these demands, we are adopting the following policy on joint submission: if your final project for this course is related to your final project for another course, you are required to submit both projects to us by our final project due date. If we decide that the projects are too similar, your project will receive a failing grade. To avoid this extreme outcome, we strongly encourage you to stay in close communication with us if your project is related to another you are submitting for credit, so that there are no unhappy surprises at the end of the term. Since there is no single objective standard for what counts as "different enough", it is better to play it safe by talking with us.

Fundamentally, we are saying that combining projects is not a shortcut. In a sense, we are in the same position as professional conferences and journals, which also need to watch out for multiple submissions. You might have a look at the current ACL/NAACL policy, which strives to ensure that any two papers submitted to those conferences are make substantially different contributions — our goal here as well.

Academic honesty

Please familiarize yourself with Stanford's honor code

http://studentaffairs.stanford.edu/judicialaffairs/policy/honor-code

We will adhere to it and follow through on its penalty guidelines.

Students with documented disabilities

Students who may need an academic accommodation based on the impact of a disability must initiate the request with the Student Disability Resource Center (SDRC) located within the Office of Accessible Education (OAE). SDRC staff will evaluate the request with required documentation, recommend reasonable accommodations, and prepare an Accommodation Letter for faculty dated in the current quarter in which the request is being made. Students should contact the SDRC as soon as possible since timely notice is needed to coordinate accommodations. The OAE is located at 563 Salvatierra Walk (phone: 723-1066).

Schedule

Week	Date	HW due	Who	Topic and Readings
1	Mar 31		Chris & Bill	Outlook for NLU & course goals slides Ng, Hwee Tou and John Zelle. 1997. Corpus-based approaches to semantic interpretation in natural language processing. AI Magazine 18(4): 45-64. Ferrucci, David; Eric Brown; Jennifer Chu-Carroll; James Fan; David Gondek; Aditya A. Kalyanpur; Adam Lally; J. William Murdock; Eric Nyberg; John Prager; Nico Schlaefer; and Chris Welty. 2010. Building Watson: an overview of the DeepQA project. AI Magazine 31(3): 59-79. Mitchell, Tom. 2004. Reading the Web: a breakthrough goal for AI. In Matthew Stone and Haym Hirsh, eds., AI — the next 25 Years. (Help Tom Mitchell win a lobster dinner; the clock is ticking!) Podcast: The challenge and promise of artificial intelligence. 2011. With Eric Horvitz and Peter Norvig. The Computer History Museum. Hector J. Levesque. 2013. On our best behavior. In Processings of IJCAI, 1297-1304.
1	Apr 2		Chris	Major concepts and goals of (computational) semantics and pragmatics slides Beaver, David and Joey Frazee. To appear. Semantics. In Ruslan Mitkov, ed., The Oxford Handbook of Computational Linguistics, 2nd edition. Oxford University Press. Potts, Christopher. To appear. Pragmatics. In Ruslan Mitkov, ed., The Oxford Handbook of Computational Linguistics, 2nd edition. Oxford University Press.
2	Apr 7		Chris	Distributional word representations slides [code and data] Turney, Peter D. and Patrick Pantel. 2010. From frequency to meaning: vector space models of semantics. Journal of Artificial Intelligence Research 37: 141-188.
2	Apr 9	HW 1 due	Chris	Distributed word representations and neural nets slides [shallow neural net starter code] Socher, Richard and Christopher D. Manning. 2013. Deep learning for NLP (without magic). Tutorial given at NAACL 2013, Atlanta. Through slide 68; video part 1, especially 0:22:00-1:10:00. Optional Baroni, Marco; Raffaella Bernardi; Ngoc-Quynh Do; and Chung-chieh Shan. 2012. Entailment above the word level in distributional semantics. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, 23-32. ACL. Optional Huang, Eric; Richard Socher; Christopher D. Manning; and Andrew Ng. 2012. Improving word representations via global context and multiple word prototypes. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics. Volume 1: Long Papers, 873-882. ACL. Optional Deep learning tutorial from Andrew Ng's group
3	Apr 14		Bill	Relation extraction 1 slides Jurafsky, Daniel and James H. Martin. 2009. Speech and Language Processing, 2nd edition. Chapter 22, Information Extraction, pp. 725-743. Snow, Rion; Daniel Jurafsky; and Andrew Y. Ng. 2005. Learning syntactic patterns for automatic hypernym discovery. In Processings of NIPS, 1297-1304. Mintz, Mike; Steven Bills; Rion Snow; and Dan Jurafsky. 2009. Distant supervision for relation extraction without labeled data. In Proceedings of ACL-IJCNLP, 1003-1011. Suntec, Singapore: ACL.
3	Apr 16	HW 2 due	Bill	Relation extraction 2 Banko, Michele; Michael J. Cafarella; Stephen Soderland; Matt Broadhead; and Oren Etzioni. 2007. Open information extraction from the Web. In Proceedings of IJCAI, 2670-2676. Fader, Anthony; Stephen Soderland; and Oren Etzioni. 2011. Identifying relations for open information extraction. In Proceedings of EMNLP, pp. 1535-1545. Yao, Limin; Sebastian Riedel; and Andrew McCallum. 2012. Unsupervised relation discovery with sense disambiguation. In Proceedings of ACL-2012, 712-720. Jeju Island, Korea: ACL.
4	Apr 21		Chris	Dependency parses for NLU slides de Marneffe, Marie-Catherine; Bill MacCartney; and Christopher D. Manning. 2006. Generating typed dependency parses from phrase structure parses. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006), 449-454. Genoa, Italy: ELRA. Optional de Marneffe, Marie-Catherine and Christopher Manning. 2008. The Stanford typed dependencies representation. In Proceedings of the COLING 2008 Workshop on Cross-Framework and Cross-Domain Parser Evaluation, 1-8. ACL. Optional de Marneffe, Marie-Catherine and Christopher D. Manning. 2008. Stanford typed dependencies manual. Optional de Marneffe, Marie-Catherine, Miriam Connor, Natalia Silveira, Samuel R. Bowman, Timothy Dozat; Christopher D. Manning. 2013. More constructions, more genres: Extending Stanford Dependencies. In Eva Hajicova, Kim Gerdes, and Leo Wanner (eds.), Proceedings of the 2nd International Conference on Dependency Linguistics, 187-196. ACL.
4	Apr 23	HW 3 due	Bill	Workshop 1: Project planning & system evaluation slides Domingos, Pedro. 2012. A few useful things to know about machine learning. Communications of ACM 55(10): 78-87. Resnik, Philip and Jimmy Lin. 2010. Evaluation of NLP Systems. In The Handbook of Computational Linguistics and Natural Language Processing, 271-295. Oxford: Wiley-Blackwell. Smith, Noah. 2011. Appendix B: Experimentation. In Linguistic Structure Prediction, 181-197. Morgan and Claypool. Optional Hsueh, Pei-Yun, Melville, Prem, and Sindhwani, Vikas. 2009. Data quality from crowdsourcing: a study of annotation selection criteria. In Proceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing, 27-35. Boulder, Colorado: ACL.
5	Apr 28		Bill	Introduction to semantic parsing and lambda calculus slides Manning, Christopher D. 2005. An introduction to formal computational semantics. Ms., Stanford University. Blackburn, Patrick and Johan Bos. 2003. Computational semantics. Theoria 18(1): 27-45. Optional Potts, Christopher. 2007. Logic for Linguists. Sections 5 and 9.
5	Apr 30	HW 4 due	Bill	From utterances to logical forms slides Zettlemoyer, Luke S. and Collins, Michael. 2005. Learning to map sentences to logical form: structured classification with probabilistic categorial grammars. In Proceedings of the Twenty First Conference on Uncertainty in Artificial Intelligence. Matuszek, Cynthia; Even Herbst; Luke S. Zettlemoyer; and Dieter Fox. 2012. Learning to parse natural language commands to a robot control system. In Proceedings of the 13th International Symposium on Experimental Robotics. Optional Kwiatkowski, Tom; Luke Zettlemoyer; Sharon Goldwater; and Mark Steedman. 2010. Inducing Probabilistic CCG Grammars from Logical Form with Higher-Order Unification. In Proceedings of EMNLP, 1223-1233. Cambridge, MA: ACL.
6	May 5	Lit review due	Chris	From utterances to denotations slides [reference implementations] Liang, Percy; Michael I. Jordan; and Dan Klein. 2013. Learning dependency-based compositional semantics. Computational Linguistics 39(2): 389-446. Optional Artzi, Yoav and Luke Zettlemoyer. 2013. Weakly supervised learning of semantic parsers for mapping instructions to actions. In Transactions of the ACL 1: 49-62.
6	May 7	HW 5 due	Bill	Interpreting queries with structure at Google Cai, Qingqing and Alexander Yates. 2013. Large-scale semantic parsing via schema matching and lexicon extension. In Proceedings of ACL, 423-433. Sofia, Bulgaria: ACL. Kwiatkowski, Tom; Eunsol Choi; Yoav Artzi; and Luke Zettlemoyer. 2013. Scaling semantic parsers with on-the-fly ontology matching. In Proceedings of EMNLP, 1545-1556. Seattle, WA: ACL.
7	May 12		Bill	Natural logic and textual inference slides MacCartney, Bill and Christopher D. Manning. 2009. An extended model of natural logic. In Proceedings of IWCS-8, 140-156. Tilburg, Netherlands: ACL. Optional MacCartney, Bill and Christopher D. Manning. 2008. Modeling semantic containment and exclusion in natural language inference. In Proceedings of COLING, 521-528. Manchester, UK: ACL.
7	May 14	HW 6 due	Sam Bowman	Recursive neural networks for semantic interpretation slides Bowman, Samuel. 2014. Can recursive neural tensor networks learn logical reasoning? arXiv preprint. Socher, Richard; Eric H. Huang; Jeffrey Pennington; Andrew Y. Ng; and Christopher D. Manning. 2011. Dynamic pooling and unfolding recursive autoencoders for paraphrase detection In Proceedings of NIPS-2011. Optional Socher, Richard and Christopher D. Manning. 2013. Deep learning for NLP (without magic). Tutorial given at NAACL 2013, Atlanta. Slides 98–end; video part 2.
8	May 19	Project milestone due	Chris	Sentiment analysis slides Incredibly useful general resource Pang, Bo and Lillian Lee. 2008. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval 2(1-2):1-135. Domain-sensitive sentiment lexicons Turney, Peter D. and Michael L. Littman. 2003. Measuring praise and criticism: inference of semantic orientation from association. ACM Transactions on Information Systems 21: 315-346. Sentiment and compositional semantics Socher, Richard; Alex Perelygin; Jean Wu; Jason Chuang; Christopher D. Manning, Andrew Y. Ng; and Christopher Potts. 2013. Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, 1631-1642. ACL. Sentiment and context Sudhof, Moritz; Andrés Goméz Emilsson; Andrew L. Maas; and Christopher Potts. 2014. Sentiment expression conditioned by affective transitions and social forces. To appear in Proceedings of 20th Conference on Knowledge Discovery and Data Mining. ACM. Sentiment and social ties Thomas, Matt; Bo Pang; and Lillian Lee. 2006. Get out the vote: determining support or opposition from Congressional floor-debate transcripts. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, 327-335. ACL.
8	May 21	HW 7 due	Chris	Dialogue agents slides Overview Jurafsky, Dan. 2004. Pragmatics and computational linguistics. In Laurence R. Horn and Gregory Ward, eds, Handbook of Pragmatics, 578-604 Oxford: Blackwell. Language and action Vogel, Adam and Dan Jurafsky. 2010. Learning to follow navigational directions. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, 806-814. ACL. Pragmatic agents Vogel, Adam; Max Bodoia; Christopher Potts; and Dan Jurafsky. 2013. Emergence of Gricean maxims from multi-agent decision theory. In Human Language Technologies: The 2013 Annual Conference of the North American Chapter of the Association for Computational Linguistics, 1072-1081. ACL. Approximate rationality via machine learning Vogel, Adam; Andrés Goméz Emilsson; Michael C. Frank; Dan Jurafsky; and Christopher Potts. 2014. Learning to reason pragmatically with cognitive limitations. [data and code] The browser as context Allen, James; Nathanael Chambers; George Ferguson; Lucian Galescu; Hyuckchul Jung; Mary Swift; and William Taysom. 2007. PLOW: a collaborative task learning agent. Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 1514-1519. AAAI Press.
9	May 26			[Memorial Day — no class]
9	May 28		Bill & Chris	Workshop 2: Writing up and presenting your work slides Stuart Schieber on reporting research results David Goss on math style
10	Jun 2			Project presentations
10	Jun 4			Project presentations
	Jun 10, 3:15 pm	Final project due

CS 224U (LING 188/288) Natural Language Understanding Spring 2014