Reeder

This is an old revision of the document!

The Role of Distributional Information in Linguistic Category Formation

Abstract

A crucial component of language acquisition involves organizing words into grammatical categories and discovering relations between them. Many studies have argued that phonological or semantic cues or multiple correlated cues are required for learning. Here we examine how distributional variables will shift learners from forming a category of lexical items to maintaining lexical specificity. In a series of artificial language learning experiments, we vary a number of distributional variables to category structure and test how adult learners use this information to inform their hypotheses about categorization. Our results show that learners are sensitive to the contexts in which each word occurs, the overlap in contexts across words, the non-overlap of contexts (or systematic gaps), and the size of the data set. These variables taken together determine whether learners fully generalize or preserve lexical specificity.

Introduction

Language acquisition crucially involves finding the grammatical categories of words in the input. The organization of elements into categories, and the generalization of patterns from some seen element combinations to novel ones, account for important aspects of the expansion of linguistic knowledge in early stages of language acquisition. One hypothesis of how learners approach the problem of categorization is that the categories (but not their contents) are innately specified prior to experiencing any linguistic input, with the assignment of tokens to categories accomplished with minimal exposure. A second possibility is that the categories are formed around a semantic definition. A third hypothesis, explored in the present research, is that the distributional information in the environment is sufficient (along with a set of learning biases) to extract the categorical structure of natural language. While it is likely that each of these sources of evidence makes important contributions to language acquisition, this third hypothesis regarding distributional learning has often been thought to be an unlikely contributor, given the information processing limitations of young children and the complexity of the computational processes that would be entailed.