Project Overview

Can targeted modifications of training data improve neural networks’ ability to predict human sentence processing difficulty?

Faculty Sponsor

Prasad (gprasad@colgate.edu)

Department(s)

Computer Science
Psychological and Brain Sciences

Abstract

Temporarily ambiguous sentences such as (1) are read more slowly than their unambiguous counterparts such as (2) with the same meaning. Why is this the case? 
  1. The little girl fed the lamb remained relatively calm. 
  2. The little girl who was fed the lamb remained relatively calm. 
One account of sentence processing, Surprisal Theory, argues that the amount of time taken to read a word depends on how predictable the word is in a given sentence: the word “remained” is more predictable in (2) because of the disambiguating words “who was”. To test this, previous work has measured the predictability of words using neural network language models (NLMs) and used this to predict human reading times in sentences like (1) and (2). This work found that the estimates from current NLMs underestimated processing difficulty. 
The goal of this project is to modify the training data of these models in a targeted manner, and test if any of these targeted changes result in estimates that better align with human reading times. Such an investigation can shed light on why humans generate the predictions they do when reading sentences. Students working on this project will:  As part of these tasks, students will learn to use and modify existing code to generate sentences, train neural network models, and generate plots. 

Student Qualifications

Students must have experience using Python (covered in COSC101) and a basic knowledge of data structures (covered in COSC102). They must also be willing and able to work as an RA during Spring 2023. Having curiosity about linguistics and/or human sentence processing is preferable.

Number of Student Researchers

3 students

Project Length

8 weeks




<< Back to List





If you have questions, please contact Karyn Belanger (kgbelanger@colgate.edu).