Linguis 159: Language Processing
1 Course Information
Lecture times | Tuesdays & Thursdays 12:30-1:50pm |
Lecture Location | SST 442 |
Syllabus | http://socsci.uci.edu/~rfutrell/teaching/ling159-2018/ |
Canvas site | https://canvas.eee.uci.edu/courses/11658 |
2 Instructor Information
Instructor | Richard Futrell (rfutrell@uci.edu) |
Instructor's office | SSPB 2215 |
Instructor's office hours | Mondays 4-5pm |
3 Course Description
This course is on human language processing: what is the process in the human mind that converts language to meaning and meaning to language. We will cover experimental studies on human language understanding, as well as models and approaches from computational linguistics. Students will learn how to formulate and test precise theories of how language processing works by discussing and evaluating state-of-the-art research papers.
Detailed topics: Bayesian inference and information theory as underlying principles of language processing, speech perception, noisy channel models of sentence understanding, human languages as efficient codes for meaning, (probabilistic) context-free grammars for modeling syntactic structure, working memory effects on sentence processing, language evolution and how constraints on language processing shape languages, distributional vector-space methods for modeling word meanings, modern neural network methods for modeling language and deriving meaning from language.
4 Course Format
Class time will be spent on a mixture of lectures and seminar-style discussions about research papers. Homework will consist of short paper responses. The major assessments will be two review papers in which you review, evaluate, and propose extensions to a research paper of your choice.
Students may bring laptops to class as long as they are closed during lectures and discussions, unless we are using them as part of exercises.
5 Intended audience
This course is intended for advanced undergraduates studying language science, cognitive science, computer science, psychology, languages, and related fields. Some background in linguistics, such as Linguis 3, will make the class easier, but we will be reviewing the necessary concepts from linguistics as we go. We will be developing models using some probability theory: some background in probability will make the class and readings easier, but we will introduce/review the necessary concepts early on in the class. We will not do any math that is advanced beyond high-school algebra.
Here is a survey for students beginning the class.
6 Readings
We will have two kinds of readings: background readings and primary-literature readings. There is no course textbook. You don't need to buy anything for this course. All readings are provided as pdf documents either here or on the Canvas site. Some of the pdf documents are password-protected. You can find the password in the announcements on the Canvas site.
- Background readings. These are readings taken from textbooks which provide context and orientation to a problem we are studying. You will not be directly assessed on your knowledge of these background readings, but you will find that reading them makes lectures and the primary-literature readings dramatically more comprehensible.
- Primary-literature readings. These are research articles taken from scholarly journals, intended for a scientific audience of other researchers. For each primary-literature reading, you will be completing a paper response, as described below. In addition to the assigned readings, I am also providing a large list of further research articles which will form the basis for your Review Papers.
The background readings are drawn from these books:
- J&M — Dan Jurafsky & James Martin (2018). Speech and Language Processing, 3rd edition.
- Sedivy — Julie Sedivy (2018). Language in Mind: An Introduction to Psycholinguistics. Oxford University Press.
- Gleick — James Gleick (2011). The Information: A History, a Theory, a Flood. Pantheon Books.
7 Syllabus (subject to modification)
Day | Topic | Background reading | Primary-literature reading | Deadlines |
---|---|---|---|---|
9/27 | Introduction | |||
10/2 | Probability and inference | Intro to Bayes' Rule | ||
10/4 | Speech perception | Sedivy 4.3 | ||
10/9 | Noisy channel models | Gibson et al. (2013) | ||
10/11 | Efficient Coding I | Gleick Ch. 7 | ||
10/16 | Efficient Coding II | Mahowald et al. (2013) | ||
10/18 | Ambiguity I | Sedivy 8-8.2 | ||
10/23 | Ambiguity II | Tanenhaus et al. (1995) | ||
10/25 | Syntactic structure I | Sedivy 8.3 J&M 10-10.3 | Decide on target article for Review Paper 1 | |
10/30 | Syntactic structure II | |||
11/1 | Syntactic structure III / Prediction I | Sedivy 8.4 | ||
11/6 | Prediction II | Altmann & Kamide (1999) | ||
11/8 | Working memory I | Sedivy 8.5, J&M 13-13.4 | Review Paper 1 due | |
11/13 | Working memory II | Futrell et al. (2015) | ||
11/15 | Working memory III / Local coherence I | |||
11/20 | Local coherence II | Kamide & Kukona (2018) | ||
11/22 | Thanksgiving break | |||
11/27 | Language evolution I | Tamariz & Kirby (2016) | ||
11/29 | Language evolution II | Fedzechkina et al. (2012) | Decide on target article for Review Paper 2 | |
12/4 | Word meaning I | Sedivy 7-7.1, J&M 6-6.3 | ||
12/6 | Word meaning II | Caliskan et al. (2017) | ||
12/14 | (Finals week) | Review Paper 2 due |
8 Requirements & Grading
Grade breakdown
Work Grade percentage Paper responses 35% Review paper 1 25% Review paper 2 30% Participation 10%
- Description of requirements
- Paper responses. For each primary-literature reading, you will be required to produce a paper response with your reactions and thoughts about the article. The paper response consists of answers to three discussion questions that I will provide for each article. Paper responses should be completed 1 hour before class, so that I can review your responses ahead of the classroom discussion. Discussion questions about a paper will be made available 4 days before the class where we discuss that paper.
Review papers. You will be required to write two review papers (6-8 pages) about original research articles. These are like extended paper responses, with a special focus on critically evaluating the paper, developing new predictions, and proposing experiments to test those predictions. I will provide a pool of research papers that you can choose from for this project. If you wish, you may work in groups of 2 for these projects and turn in joint writeups.
More info on review papers, including the list of papers you can choose from.
Assignment late policy
Assignments (other than paper responses) can be turned in up to 7 days late; 10% of your score will be deducted for each 24 hours of lateness (rounded up). For example, if an assignment is worth 80 points, you turn it in 3 days late, and earn a 70 before lateness is taken into account, your score will be (1-0.3)*70=49.
Working together
You may work together on homework, but the final writeups that you turn in must be written by you alone. For the Review Papers, you may work together and turn in a joint writeup.
Mapping of class score to letter grade
I grade the course on a curve, but I guarantee minimum grades based on these thresholds:
Threshold Guaranteed minimum grade >= 90% A >= 80% B >= 70% C >= 60% D So for example a score of 90.0001% guarantees you an A-, but you could end up with a higher grade due to the curve.
9 Academic Integrity
We will be adhering fully to the standards and practices set out in UCI's policy on academic integrity.