6533b830fe1ef96bd1297b5e

RESEARCH PRODUCT

New evidence for chunk-based models in word segmentation.

Barbara TillmannRonald PeeremanPierre PerruchetBénédicte Poulin-charronnat

subject

ExploitComputer scienceFirst languageExperimental and Cognitive Psychologycomputer.software_genreLanguage Development050105 experimental psychology03 medical and health sciences0302 clinical medicineArts and Humanities (miscellaneous)Chunking (psychology)Developmental and Educational PsychologyHumansLearning0501 psychology and cognitive sciencesSegmentationLanguageCommunicationParsingTwo-alternative forced choicebusiness.industry05 social sciencesText segmentationGeneral MedicineModels TheoreticalConstructed language[ SDV.NEU ] Life Sciences [q-bio]/Neurons and Cognition [q-bio.NC][SDV.NEU]Life Sciences [q-bio]/Neurons and Cognition [q-bio.NC]Artificial intelligenceCuesbusinesscomputer030217 neurology & neurosurgeryNatural language processing

description

International audience; : There is large evidence that infants are able to exploit statistical cues to discover the words of their language. However, how they proceed to do so is the object of enduring debates. The prevalent position is that words are extracted from the prior computation of statistics, in particular the transitional probabilities between syllables. As an alternative, chunk-based models posit that the sensitivity to statistics results from other processes, whereby many potential chunks are considered as candidate words, then selected as a function of their relevance. These two classes of models have proven to be difficult to dissociate. We propose here a procedure, which leads to contrasted predictions regarding the influence of a first language, L1, on the segmentation of a second language, L2. Simulations run with PARSER (Perruchet & Vinter, 1998), a chunk-based model, predict that when the words of L1 become word-external transitions of L2, learning of L2 should be depleted until reaching below chance level, at least before extensive exposure to L2 reverses the effect. In the same condition, a transitional-probability based model predicts above-chance performance whatever the duration of exposure to L2. PARSER's predictions were confirmed by experimental data: Performance on a two-alternative forced choice test between words and part-words from L2 was significantly below chance even though part-words were less cohesive in terms of transitional probabilities than words.

10.1016/j.actpsy.2014.01.015https://hal.archives-ouvertes.fr/hal-00964876