New evidence for chunk-based models in word segmentation.

6533b830fe1ef96bd1297b5e

RESEARCH PRODUCT

New evidence for chunk-based models in word segmentation.

Barbara Tillmann Ronald Peereman Pierre Perruchet Bénédicte Poulin-charronnat

subject

Exploit Computer science First language Experimental and Cognitive Psychology computer.software_genre Language Development 050105 experimental psychology 03 medical and health sciences 0302 clinical medicine Arts and Humanities (miscellaneous)Chunking (psychology)Developmental and Educational Psychology Humans Learning 0501 psychology and cognitive sciences Segmentation Language Communication Parsing Two-alternative forced choice business.industry 05 social sciences Text segmentation General Medicine Models Theoretical Constructed language [ SDV.NEU ] Life Sciences [q-bio]/Neurons and Cognition [q-bio.NC][SDV.NEU]Life Sciences [q-bio]/Neurons and Cognition [q-bio.NC]Artificial intelligence Cues business computer 030217 neurology & neurosurgery Natural language processing

description

International audience; : There is large evidence that infants are able to exploit statistical cues to discover the words of their language. However, how they proceed to do so is the object of enduring debates. The prevalent position is that words are extracted from the prior computation of statistics, in particular the transitional probabilities between syllables. As an alternative, chunk-based models posit that the sensitivity to statistics results from other processes, whereby many potential chunks are considered as candidate words, then selected as a function of their relevance. These two classes of models have proven to be difficult to dissociate. We propose here a procedure, which leads to contrasted predictions regarding the influence of a first language, L1, on the segmentation of a second language, L2. Simulations run with PARSER (Perruchet & Vinter, 1998), a chunk-based model, predict that when the words of L1 become word-external transitions of L2, learning of L2 should be depleted until reaching below chance level, at least before extensive exposure to L2 reverses the effect. In the same condition, a transitional-probability based model predicts above-chance performance whatever the duration of exposure to L2. PARSER's predictions were confirmed by experimental data: Performance on a two-alternative forced choice test between words and part-words from L2 was significantly below chance even though part-words were less cohesive in terms of transitional probabilities than words.

year	journal	country	edition	language
2014-03-11

10.1016/j.actpsy.2014.01.015 https://hal.archives-ouvertes.fr/hal-00964876