Logistics, Resources and Readings

Logistics

The 10 week semester is divided into two parts with the first part dedicated to invited speakers and class discussions and the second part to project meetings.
In the first part, each week is divided into a lecture on Tuesday followed by a class discussion related to the lecture on Thursday with assigned readings.
The first week outlines the format and content of the class and project options, unrolling the options in the following weeks and suggesting sample projects.
Project proposals and team assignments are due by midterm or preferably sooner. Teams consult with the staff to define goals and produce two-page proposals.
The second part consists of team meetings with staff & consultants. Deliverables include design docs by end of week six and final reports by end of exams.

Schedule

Here is the schedule of invited speakers and talks for the first half of the quarter:

Week #1: Introduction
Tuesday: Part I: Tom Dean – [March 30 @ 4:30-5:50PM PDT] [VIDEO] [SLIDES]
Thursday: Part II: Tom Dean – [April 01 @ 4:30-5:50PM PDT] [VIDEO] [SLIDES]
Week #2: Acquiring Language
Tuesday: Speaker: Eve Clark – [April 06 @ 4:30-5:50PM PDT] [VIDEO] [SLIDES]
Thursday: Class Discussion: Eve Clark – [April 08 @ 4:30-5:50PM PDT] [VIDEO]
Week #3: Interpreting Code
Tuesday: Speaker: Rishabh Singh – [April 13 @ 4:30-5:50PM PDT] [VIDEO] [SLIDES]
Thursday: Class Discussion: Dan Abolafia – [April 15 @ 4:30-5:50PM PDT] [VIDEO] [SLIDES]
Friday: Project Discussion: Yash Savani: – [April 17 @ 10:00-11:00 PDT] [VIDEO] [SLIDES]
Week #4: Interactive Agents
Tuesday: Speaker: Greg Wayne – [April 20 @ 10:00AM PDT / 6:00PM BST] [VIDEO] [SLIDES]
Thursday: Class Discussion: Yash Savani – [April 22 @ 4:30-5:50PM PDT] [VIDEO] [SLIDES]
Friday: Project Discussion: Tom Dean: – [April 24 @ 10:00-11:00 PDT] [SLIDES]
Week #5: Analogy and Search
Tuesday: Speaker: Felix Hill – [April 27 @ 10:00AM PDT / 6:00PM BST] [VIDEO] [SLIDES]
Tuesday: Project Discussion: Tom Dean – [April 27 @ 4:30-5:50PM PDT] [VIDEO] [SLIDES]
Thursday: Class Discussion: Francis Gene Lewis – [April 29 @ 4:30-5:50PM PDT] [VIDEO] [SLIDES]
Week #6: Class Discussion
Tuesday: Project Discussion: Tom Dean – [May 4 @ 4:30-5:50PM PDT] [VIDEO] [SLIDES]
Thursday: Project Discussion: Rishabh Singh – [May 5 @ 3:30-5:50PM PDT] [VIDEO]
Friday: Project Discussion: Tom Dean – [May 7 @ 3:30-5:50PM PDT] [VIDEO]
Week #7: Class Discussion
Tuesday: Project Discussion: Rishabh Singh – [May 11 @ 3:30-5:50PM PDT] [VIDEO]
Tuesday: Project Discussion: Tom Dean – [May 11 @ 2:00-3:00PM PDT] [VIDEO]
Thursday: Project Discussion: Andrew Lampinen – [May 13 @ 11:00-12:000PM PDT] [VIDEO]
Thursday: Project Discussion: Tom Dean – [May 13 @ 3:30-5:50PM PDT] [NOTES] [SLIDES]
Week #8: Class Discussion
Thursday: Project Discussion: Tom Dean – [May 20 @ 3:30-5:50PM PDT] [VIDEO] [SLIDES]

Projects

We will encourage three basic classes of final projects each one of which connects to both the DeepMind interactive agent model described in Abramson et al [2] and the programmer's apprentice application [9]. The first two classes of projects focus on, respectively, grounded language and neural programming and will emphasize implementation, i.e., designing, building, and evaluating neural network models. The third class of project focuses on the programmer's apprentice as a whole and will emphasize designing curriculum-training protocols and hybrid automated and scalable crowd-sourced data collection strategies to facilitate human-machine interaction and exploit the different roles of natural languages and programming languages in applications related to the programmer's apprentice. You can find more detail on all three classes of projects here and we will be talking a lot more about projects in the class discussions early in the quarter¹.

Resources

Research references available either open-source or digitally from Stanford library:

Stanford Libraries has Eve Clark's "First Language Acquisition" available with unlimited access for online use or download to your phone or laptop. You can keep it for up to 21 days at which point your current checkout expires, but you can renew it as many times as you require. Here is the link: URL.
Stanford Libraries also has three copies of "Neuroscience: Exploring the Brain (Fifth Edition)" by Mark Bear, Barry Connors, and Michael Paradiso. This is a well-written basic introduction to neuroscience designed for a first course generally spanning two or more quarters. Here is the link: URL.
"Computational Cognitive Neuroscience" by Randall C. O'Reilly, Yuko Munakata, Michael J. Frank, and Thomas E. Hazy. Chapters 6: Perception and Attention, 7: Motor Control and Reinforcement Learning, and 8: Memory are particularly relevant to the topics covered in this course: PDF.
University of Texas Houston McGovern Medical School hosts an online, interactive textbook for the study of neuroscience URL, including a companion interactive electronic laboratory for the study of neuroanatomy URL. Both of these resources are of professional quality and freely available.

Lectures

Introductory lectures for 2021, related lectures and invited talks from previous years, and public forums:

Material for the March 30 and April 1 class lectures is divided into three categories²: (i) technical opportunities, (ii) biological inspiration, and (iii) implementing agent architectures – OPPORTUNITY, INSPIRATION, and IMPLEMENTATION.
Greg Wayne and Tim Lillicrap talking about imitating interactive intelligent agents and the work of the Interactive Agents Group at DeepMind described in the Abramson et al paper – INTERACTIVE INTELLIGENCE and HUMAN-AGENT INTERACTION.
Eve Clark's examples of how children learn to interpret prepositional phrases without understanding prepositions and initiate self-repair of their speech in conversation – LEARNING PREPOSITIONS and CONVERSATIONAL REPAIR.
Felix Hill's examples of how an agent's environment and perceptual limitations and can actually help improve generalization and compositionality – see the 2020 ICLR paper by Hill et al [13] and this presentation PERCEPTION & GENERALIZATION.
Matt Botvinick discusses (CS379C 2018) PREFRONTAL CORTEX & META-LEARNING and Nando de Freitas discusses compositionality, learning-to-learn, and neural programmer-interpreter architectures – NEURAL PROGRAMMER-INTERPRETER.
Randy O'Reilly provides an introduction to the brain regions responsible for reinforcement learning and the biological inspiration for long-short term memory (LSTM) [16] (CS379C 2018) – PROGRAMMING IN THE BRAIN.
Greg Wayne [27] describes the implementation of MERLIN, a goal-directed agent employing an unsupervised predictive model to compensate for operating in a partially-observable environment (CS379C 2018) – INTEGRATED COGNITIVE ARCHITECTURES.
Josh Merel [20] presents work with Matt Botvinick and Greg Wayne on the principles and implementation of hierarchical motor control systems (CS379C 2020) – HIERARCHICAL MOTOR CONTROL IN MAMMALS AND MACHINES.
Rishabh Singh discusses viewing machine learning from the perspective of neural program synthesis in his invited talk at a workshop on Neuro-Symbolic Computation at CVPR in 2020 – TOWARDS HUMAN-LIKE PROGRAM SYNTHESIS.
Peter Battaglia and Jessica Hamrick gave lectures in class on their work related to interaction networks and imagination-based learning (CS379C 2019) – STRUCTURED INTELLIGENCE, MENTAL SIMULATION, IMAGINATION, AND MODEL-BASED LEARNING.
Felix Hill's lecture on deep learning for natural language processing at the University College of London as part of the DeepMind / UCL Lecture series on deep learning (UCL 2020) – DEEP LEARNING FOR NATURAL LANGUAGE.
Silvia Bunge's lecture on the neurocognitive mechanisms supporting analogical reasoning and its development on the Analogy List Online Seminar on March 18, 2021 – NEUROCOGNITIVE MECHANISMS SUPPORTING ANALOGY.
Alexandra Carstensten's lecture on early diversity in abstract thought: how context shapes the developmental trajectory of relational reasoning on the Analogy List Online Seminar February 18, 2021 – CONTEXTUAL INFLUENCE DEVELOPMENTAL DIVERSITY.

Readings

Assigned lecture / discussion readings – the green highlighted papers are annotated copies:

Learning Analogies and Natural Language – [Clark, 2015], [Clark, 2020], [Hill et al, 2019], [Hill et al, 2021], [Lampinen et al, 2021], [McClelland et al, 2019], [Yu et al, 2018].
Cognitive Architectures and Symbolic Systems – [Abramson et al, 2020], [Hill et al, 2020], [Merel et al, 2019], [O'Reilly et al, 2014], [Santoro et al, 2021]. [Wayne et al, 2018].
Interpreting and Synthesizing Programs – [Abolafia et al, 2020], [Allamanis et al, 2018], [Balog et al, 2020], [Pierrot et al, 2019], [Reed and deFreitas, 2015]. [Yan et al, 2020].

Calendar

References

[1]	Daniel A. Abolafia, Rishabh Singh, Manzil Zaheer, and Charles Sutton. Towards modular algorithm induction. CoRR, arXiv:2003.04227, 2020.
[2]	Josh Abramson, Arun Ahuja, Arthur Brussee, Federico Carnevale, Mary Cassin, Stephen Clark, Andrew Dudzik, Petko Georgiev, Aurelia Guy, Tim Harley, Felix Hill, Alden Hung, Zachary Kenton, Jessica Landon, Timothy Lillicrap, Kory Mathewson, Alistair Muldal, Adam Santoro, Nikolay Savinov, Vikrant Varma, Greg Wayne, Nathaniel Wong, Chen Yan, and Rui Zhu. Imitating interactive intelligence. CoRR, arXiv:2012.05672, 2020.
[3]	Miltiadis Allamanis, Marc Brockschmidt, and Mahmoud Khademi. Learning to represent programs with graphs. In International Conference on Learning Representations, 2018.
[4]	Marcin Andrychowicz, Filip Wolski, Alex Ray, Jonas Schneider, Rachel Fong, Peter Welinder, Bob McGrew, Josh Tobin, Pieter Abbeel, and Wojciech Zaremba. Hindsight experience replay. CoRR, arXiv:1707.01495, 2017.
[5]	Dilip Arumugam, Siddharth Karamcheti, Nakul Gopalan, Lawson L. S. Wong, and Stefanie Tellex. Accurately and efficiently interpreting human-robot instructions of varying granularities. CoRR, arXiv:1704.06616, 2017.
[6]	Matej Balog, Rishabh Singh, Petros Maniatis, and Charles Sutton. Neural program synthesis with a differentiable fixer. CoRR, arXiv:2006.10924, 2020.
[7]	Eve V. Clark. Common Ground, pages 328--353. John Wiley & Sons, Ltd, 2015.
[8]	Eve V. Clark. Conversational repair and the acquisition of language. Discourse Processes, 57(5-6):441--459, 2020.
[9]	Thomas Dean, Maurice Chiang, Marcus Gomez, Nate Gruver, Yousef Hindy, Michelle Lam, Peter Lu, Sophia Sanchez, Rohun Saxena, Michael Smith, Lucy Wang, and Catherine Wong. Amanuensis: The Programmer's Apprentice. CoRR, arXiv:1807.00082, 2018.
[10]	Justin Fu, Anoop Korattikara, Sergey Levine, and Sergio Guadarrama. From language to goals: Inverse reinforcement learning for vision-based instruction following. In International Conference on Learning Representations, 2019.
[11]	Alex Graves, Greg Wayne, Malcolm Reynolds, Tim Harley, Ivo Danihelka, Agnieszka Grabska-Barwińska, Sergio Gómez Colmenarejo, Edward Grefenstette, Tiago Ramalho, John Agapiou, Adrià Puigdoménech Badia, Karl Moritz Hermann, Yori Zwols, Georg Ostrovski, Adam Cain, Helen King, Christopher Summerfield, Phil Blunsom, Koray Kavukcuoglu, and Demis Hassabis. Hybrid computing using a neural network with dynamic external memory. Nature, 538:471--476, 2016.
[12]	Sergio Guadarrama, Lorenzo Riano, Dave Golland, Daniel Gouhring, Yangqing Jia, Dan Klein, Pieter Abbeel, and Trevor Darrell. Grounding spatial relations for human-robot interaction. In 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 1640--1647, 2013.
[13]	Felix Hill, Andrew Lampinen, Rosalia Schneider, Stephen Clark, Matthew Botvinick, James L. McClelland, and Adam Santoro. Environmental drivers of systematicity and generalization in a situated agent. In International Conference on Learning Representations, 2020.
[14]	Felix Hill, Adam Santoro, David G. T. Barrett, Ari S. Morcos, and Timothy P. Lillicrap. Learning to make analogies by contrasting abstract relational structure. In International Conference on Learning Representations, 2019.
[15]	Felix Hill, Olivier Tieleman, Tamara von Glehn, Nathaniel Wong, Hamza Merzic, and Stephen Clark. Grounded language learning fast and slow. CoRR, arXiv:2009.01719, 2020.
[16]	Sepp Hochreiter and Jürgen Schmidhuber. Long short-term memory. Neural Computing, 9:1735--1780, 1997.
[17]	Max Jaderberg, Volodymyr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z. Leibo, David Silver, and Koray Kavukcuoglu. Reinforcement learning with unsupervised auxiliary tasks. CoRR, arXiv:1611.05397, 2016.
[18]	Andrew Lampinen, Shaw Hsu, and James L. McClelland. Analogies emerge from learning dyamics in neural networks. In Tecumseh Fitch, Claus Lamm, Helmut Leder, and Kristin Tessmar, editors, Proceedings of the 43rd Annual Meeting of the Cognitive Science Society, pages 2513--2517. Cognitive Science Society, 2020.
[19]	James L. McClelland, Felix Hill, Maja Rudolph, Jason Baldridge, and Hinrich Schütze. Extending machine language models toward human-level language understanding. CoRR, arXiv:1912.05877, 2019.
[20]	Josh Merel, Matt Botvinick, and Greg Wayne. Hierarchical motor control in mammals and machines. Nature Communications, 10(1):5489, 2019.
[21]	Randall C. O'Reilly, Alex A. Petrov, Jonathan D. Cohen, Christian J. Lebiere, Seth A. Herd, and Trent Kriete. How limited systematicity emerges: A computational cognitive neuroscience approach. In Paco Calvo and John Symons, editors, The Architecture of Cognition, pages 191--224. MIT Press, Cambridge, Massachusetts, 2014.
[22]	Deepak Pathak, Pulkit Agrawal, Alexei A. Efros, and Trevor Darrell. Curiosity-driven exploration by self-supervised prediction. CoRR, arXiv:1705.05363, 2017.
[23]	Thomas Pierrot, Guillaume Ligner, Scott E. Reed, Olivier Sigaud, Nicolas Perrin, Alexandre Laterre, David Kas, Karim Beguir, and Nando de Freitas. Learning compositional neural programs with recursive tree search and planning. CoRR, arXiv:1905.12941, 2019.
[24]	Scott E. Reed and Nando de Freitas. Neural programmer-interpreters. CoRR, arXiv:1511.06279, 2015.
[25]	Adam Santoro, Andrew Lampinen, Kory Mathewson, Timothy Lillicrap, and David Raposo. Symbolic behaviour in artificial intelligence. CoRR, arXiv:2102.03406, 2021.
[26]	Ziyu Wang, Josh Merel, Scott Reed, Greg Wayne, and Nicolas Hees Nando de Freitas. Robust imitation of diverse behaviors. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, editors, Advances in Neural Information Processing Systems 30, pages 5320--5329. Curran Associates, Inc., 2017.
[27]	Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, and Timothy Lillicrap. Unsupervised predictive memory in a goal-directed agent. CoRR, arXiv:1803.10760, 2018.
[28]	Yujun Yan, Kevin Swersky, Danai Koutra, Parthasarathy Ranganathan, and Milad Hashemi. Neural execution engines: Learning to execute subroutines. CoRR, arXiv:2006.08084, 2020.
[29]	Haonan Yu, Haichao Zhang, and Wei Xu. Interactive grounded language acquisition and generalization in a 2D world. CoRR, arXiv:1802.01433, 2018.

¹ We will encourage three basic classes of final projects each one of which connects to both the DeepMind interactive agent model described in Abramson et al [2] and the programmer's apprentice application [9]. The first two classes of projects focus on, respectively, grounded language and neural programming and will emphasize implementation, i.e., designing, building, and evaluating neural network models. The third class of project focuses on the programmer's apprentice as a whole and will emphasize designing curriculum-training protocols and hybrid automated and scalable crowd-sourced data collection strategies to facilitate human-machine interaction and exploit the different roles of natural languages and programming languages in applications related to the programmer's apprentice.

Neural Programming

The neural programming community has grown substantially in the last few years. If you are interested in working on a coding project in this space, I recommend you begin by watching the video of Rishabh's talk at CVPR 2020 entitled Towards Human-like Program Synthesis and the paper [1] by Dan Abolafia, Rishabh, and their colleagues at Google Brain. As background, you might also read the neural programmer-interpreter (NPI) paper by Scott Reed and Nando de Freitas [24] (PDF) and the follow-up paper by Pierrot et al [23] extending NPI with recursive Monte Carlo Tree Search (PDF).

Rishabh and Dan along with Yash Savani will be participating in the class this quarter and are willing and interested in advising students working on neural programming coding projects. Yash recently graduated from Stanford, is currently working for Abacus AI and has been accepted to the graduate computer science programs at both CMU and Cornell. He has also been working with me and a small team of former CS379C students on neural programming and the programmer's apprentice project.

Grounding & Analogy

There is a growing interest in grounding language for its usefulness in reducing sample complexity, improving generalization, and providing support for analogical reasoning. Eve Clark has made significant contributions to our understanding of how children acquire their first language. Her research has underscored the critical importance of the interactive and collaborative aspects of language understanding including how infants and young children interact with their parents to establish a common ground for communication. Insights from her work are likely to be applicable to the curriculum learning protocols that the Interactive Agents Group at DeepMind are developing. I recommend that you look at Eve's recent book entitled First Language Acquisition, and her recent lectures and papers listed, respectively, under Resources, Lectures and Readings.

Analogical reasoning has not received a great deal of attention from the deep learning community until recently. I recommend that you begin by watching this video of Felix's lecture entitled Grounded Language Learning and Analogy: Developmental Pathway to Human Intelligence in Machines and reading the paper by Hill et al [14] (PDF) entitled Learning to Make Analogies by Contrasting Abstract Relational Structure that presents a promising approach to using deep learning for analogical reasoning.

Eve and Felix will be participating in the class this quarter and are interested in advising students working on projects related to grounding language, analogical reasoning, and the application of insights from linguistics and developmental cognitive science to training interactive agents such as the programmer's apprentice. As a source of inspiration for possible projects on grounding, you might take a look at some of the following papers:

Fu et al [10] from Sergey Levine's lab at UC Berkeley, Arumugam et al [5] from Stefanie Tellex's lab at Brown University, and Guadarrama et al [12] from Trevor Darrell, Pieter Abbeel and their colleagues and students also at Berkeley. Lampinen et al [18] (PDF) provide insight from a cognitive science perspective on the extent to which people actually use analogy and under what circumstances. The quickest way to get some idea if you want to read one of these papers is to read their BibTeX abstracts here, and then, if you're interested in reading the paper, you should be able to find it using the title as a query. If you have any problems, ask me and I will send it to you.

Interactive Intelligence

The third class of projects spans the entire programmer's apprentice application by focusing on methods for training interactive intelligent agents and building on the work of Abramson et al [2] to address the problems involved in agents that engage in symbolic behavior [25] of the sort required from the programmer's apprentice. Unlike the previous two classes, the projects in this class will not require writing code. Instead, the deliverables include one or more of the following: agent network architectures, curriculum-learning protocols, and scalable crowd-sourced training data acquisition strategies. Teams interested in pursuing projects in this class are encouraged to collaborate with teams working on the first two classes of projects in order to better understand the issues surrounding language-grounding and automated programming,

The emphasis of the projects in this class should be on applying the general concepts developed in the IAG paper and, the principles that follow from Eve Clark's observations and analysis of the way children learn their first language, to the design of a programmer's apprentice. Examples of interesting issues to address include whether to include, as part of the curriculum learning protocol, a variant of self-supervised motor babbling in which the agent generates random sequences of words before graduating to sequences in a simple pidgin language that observe enough structure to make them comprehensible – interpretable with a little effort on the part of the programmer/parent, but are not strictly grammatical.

The programmer and apprentice might also employ a simple sign language that would enable a form of shared attention, perhaps using an eye tracker to naturally draw attention to a portion of the monitor screen or using the mouse to specify an area of shared interest. Eve's description of self-initiated repair [8] (PDF) could provide the basis for designing a powerful new method of semi-supervised learning to shape the programmer's behavior in terms of providing useful feedback on the apprentice's efforts to communicate in both natural and programming language.

The marked-up version (PDF) of Abramson et al highlights the technologies employed in the curriculum used to train the IAG agent. The paper by Santoro et al [25] (PDF) describes the challenges and opportunities in building agents that engage in symbolic behaviors. If you're interested in this project, you will probably also want to familiarize yourself with recent developments in training reinforcement learning systems including hindsight experience replay [4], reinforcement learning with unsupervised auxiliary tasks [17], and curiosity-driven exploration by self-supervised prediction [22] – this fifteen-minute video provides a quick introduction to all three.

If you want to work on a project in the third class of projects, in addition to Eve and Felix, I expect Greg Wayne would be interested in hearing about and providing feedback on your ideas. Greg's research contributions on unsupervised predictive memory [27], robust imitation learning [26], hierarchical motor control [20], and dynamic external memory [11] are well worth your time exploring. You can find a link to a Stanford Libraries digital version of Eve's First Language Acquisition on the Resources page and I suggest that you first take a look at the summary that I put together. Recall that for copyright reasons these notes are access controlled; if you didn't hear or forgot the username and password that I gave out in the first class, just ask a fellow student or send me an email and I'll be glad to send it to you.

² Originally there were four categories, with the fourth being "biographical". The intent was to provide students with some background on my personal, educational and professional trajectory since in previous classes students have expressed curiosity and it is generally awkward to answer such questions in class. After some thought, I decided to relegate it to a footnote rather than have it intrude on the primary focus of this document. If you are interested in learning about my personal and professional background you can read about it here, and I would be happy to answer any lingering questions you might have outside of class time.