Connecting Cues: Overlapping Regularities Support Cue Discovery in Infancy

SAHNI, Sarah D ; SEIDENBERG, Mark S ; et al.

In: Child development, Jg. 81 (2010), Heft 3, S. 727-736

Online academicJournal - print; 10; 1 p.1/2

Zugriff:

Volltext (PDF)

The present work examined the discovery of linguistic cues during a word segmentation task. Whereas previous studies have focused on sensitivity to individual cues, this study addresses how individual cues may be used to discover additional, correlated cues. Twenty-four 9-month-old infants were familiarized with a speech stream in which syllable-level transitional probabilities and an overlapping novel cue served as cues to word boundaries. Infants' behavior at test indicated that they were able to discover the novel cue. Additional experiments showed that infants did not have a preexisting preference for specific test items and that transitional probability information was necessary to acquire the novel cue. Results suggest one way learners can discover relevant linguistic structure amid the multiple overlapping properties of natural language.

Connecting Cues: Overlapping Regularities Support Cue Discovery in Infancy.

The present work examined the discovery of linguistic cues during a word segmentation task. Whereas previous studies have focused on sensitivity to individual cues, this study addresses how individual cues may be used to discover additional, correlated cues. Twenty‐four 9‐month‐old infants were familiarized with a speech stream in which syllable‐level transitional probabilities and an overlapping novel cue served as cues to word boundaries. Infants' behavior at test indicated that they were able to discover the novel cue. Additional experiments showed that infants did not have a preexisting preference for specific test items and that transitional probability information was necessary to acquire the novel cue. Results suggest one way learners can discover relevant linguistic structure amid the multiple overlapping properties of natural language.

Natural languages exhibit structure at multiple levels in parallel (e.g., phonological, lexical, morphological, syntactic, and discourse). For the adult listener, this complexity creates temporary ambiguities that must be resolved for speech to be understood. Individual bits of information are imprecise, such as the meaning of words like bow, colon, saw, and wave. Such ambiguities are resolved via a constraint satisfaction process that exploits correlations among different types of information ([19]). While individual cues are often unreliable, combinations of cues are not. The principal characteristic of the constraint satisfaction process is that it allows learners to utilize the correlations of cues. For example, the word saw has several meanings (related to seeing, cutting, a tool for cutting, etc.) and is thus highly ambiguous in isolation. Embedded within an utterance such as "I saw you,""I" restricts its interpretation to verbs. The object "you" further restricts the interpretation to the "seeing" meaning, since a person is more likely to be seen than sawed, though the result might differ if the context were a magic show. Similarly, hearing saw in a hardware store suggests the noun interpretation of saw as a tool. Rapid online comprehension is possible because of our ability to exploit constraints between different types of information ([34]).

While studies of adult language have investigated how constraints are combined to resolve ambiguities, studies of language acquisition have examined how children use statistical cues to learn their native language. These are complementary issues, the "constraints" that are relevant to adult listeners are the "cues" by which the child acquires language ([33]). Seminal work by [31] showed that 8‐month‐olds are sensitive to the transitional probability (TP) between two syllables (the frequency of the two syllables divided by the frequency of the first syllable) when listening to a fluent stream of speech. This study, and the large body of work that has followed it, suggests that infants are sensitive to statistical regularities that exist in natural languages and can use them to learn aspects of their native language.

Experiments investigating language acquisition via statistical learning have typically focused on infants' abilities to use one statistical cue at a time (e.g., [4]; [31]). However, natural speech is complex, containing overlapping regularities at multiple levels. For the language learner, this presents a difficult problem: How are these cues discovered? There are many ways speech can be analyzed; how does the child determine which aspects of the input are relevant? Moreover, the fact that any given bit of information may contribute to multiple levels of analysis (e.g., /b/ is the first sound in the word baby, the beginning of the first syllable, receives primary stress making it louder and longer, the transition point between the words the baby, etc.) creates a difficult learning problem. The complexity of this learning problem is sometimes thought to limit severely the explanatory role played by statistical learning in language acquisition ([39]).

Alternatively, the constraint satisfaction approach suggests that the complexity of natural language provides a rich system for learning mechanisms to exploit. Linguistic regularities reinforce each other across levels, allowing statistical learning mechanisms to capitalize on multiple cues and redundancies. For example, lexical stress patterns are found in numerous languages of the world. These patterns consist of a specific ordering of strong and weak syllables that occur frequently and can help identify word boundaries or classify groups of words. In English, many words have a trochaic, or strong–weak, stress pattern, as in the words BAby and MOmmy ([5]). In other languages, it is more common for words to have an iambic (weak–strong) stress pattern, as in the word guiTAR. And in some languages stress cannot be used to group syllables or identify word boundaries at all. Because these cues vary from language to language, they must be learned. How then does the infant discover that stress patterns are informative? Strong regularities like lexical stress overlap with other regularities at multiple levels, highlighting and reinforcing their utility. The acoustic regularities (i.e., higher pitch, longer duration, and increased volume) of stressed syllables can draw attention to the beginning of trochaic words. Distributional cues, such as the overrepresentation of trochaic items in speech to English‐learning children, ensure that young language learners have plenty of exposure to the pattern. Together these regularities can enhance the accessibility of the lexical stress pattern.

Psycholinguistic studies support the hypothesis that infants are sensitive to the conjunction of multiple probabilistic cues ([8]; [36]). Additionally, studies of infant categorization and conceptual development demonstrate that the natural environment provides infants with a multitude of correlated cues that they are able to exploit (e.g., [1]; [20]; [28]; [40]; [41]). Finally, connectionist models have shown that simple learning mechanisms that capitalize on structure within a complex system can exploit multiple correlated cues that exist in the infants' world. Computational models have demonstrated that problems such as finding word boundaries ([3]), generating properly inflected forms ([13]; [25]), and grouping common objects into categories ([29]) can be solved using multiple cues.

Despite this progress, it remains to be determined how language learners isolate and combine cues given the complexity of human language. In considering this problem, it is helpful to distinguish between two classes of potentially useful cues: language‐general cues and language‐specific cues. Measures of co‐occurrence or predictability between syllables are language‐general cues, in that they operate in similar ways across natural languages. For example, transitional probabilities are not specific to any given language (though the units over which these computations are performed are).

Other cues may or may not be useful in any given language and are thus language specific, and must be learned. For example, languages have different lexical stress patterns (iambic vs. trochaic), and in some languages stress patterns do not mark boundaries or help individual units cohere. By 9 months of age, infants typically show sensitivity to a range of language‐specific cues (for a recent review, see [32]).

In the domain of word segmentation, previous work suggests that younger infants tend to use language‐general cues and later shift to language‐specific cues ([38]). In a segmentation study using a nonsense language, TPs, a language‐general cue operating over novel syllable combinations was placed in conflict with the language‐specific stress pattern of English. Six‐and‐a‐half month‐olds segmented the fluent speech using the language‐general strategy of relying on TPs. In contrast, infants who were 2 months older used language‐specific lexical stress patterns (also see [14]). This shift suggests that over time, infants become more sensitive to idiosyncratic cues, learning which regularities are relevant (and, presumably, which are not) for their native language. However, little is known about how this process unfolds.

How might infants discover these language‐specific cues? One potential explanation is that language‐general cues provide a basis for discovering overlapping or co‐occurring language‐specific cues. For example, in word segmentation, infants may use their sensitivity to TP cues, which is present early in life ([18]; [37]) to discover language‐specific cues that are correlated with TPs.

The present work tested the hypothesis that infants can discover novel cues by exploiting redundancies between language‐general and language‐specific cues. Nine‐month‐old infants were exposed to a fluent speech stream that contained two overlapping cues to word boundaries: a language‐general cue (TPs) and a language‐specific cue (/t/‐onsets). TPs are known to be salient to 9‐month‐old infants. The second cue was specific to the artificial language and therefore novel: Each word in the speech stream began with /t/. Experiment 1 was thus designed to test the hypothesis that infants can use the language‐general TP cue to discover the overlapping language‐specific /t/‐onset cue. The /t/‐initial syllables are only informative as a cue to word boundaries due to their overlap with the TP cue; the TP cue positions the /t/‐initial syllable at the onset of each word. Consequently, the only way infants can extract this pattern is to use its overlap with the TP cue. We tested infants using items that were all novel relative to the exposure corpus but that varied in their use of the /t/‐onset cue. The question of interest was whether infants would be sensitive to the presence of /t/‐onsets in the test items. If so, this would provide evidence that infants can isolate individual cues by using redundancies in the speech stream.

Experiment 1

To examine whether infants can use a language‐general segmentation cue to discover an overlapping novel language‐specific cue, infants heard a fluent speech stream that contained two overlapping cues to word boundaries: (a) dips in TPs at word boundaries and (b) /t/‐initial syllables at word onsets. To determine whether infants acquired the novel /t/‐onset pattern, test items either adhered to the pattern (began with a /t/‐syllable) or violated it (contained a medial /t/). Crucially, these items were previously unheard combinations of syllables from the speech stream (i.e., TP = 0). Therefore, TP information would not allow infants to distinguish between the two types of test items. Instead, successful discrimination hinged on discovery of the /t/‐initial pattern present in the speech stream played during familiarization.

Method

Participants. Twenty‐four 9.5‐month‐old monolingual English‐learning infants (mean age = 9.5 months, range = 9.0–10.0) participated in this experiment. All infants were born full‐term and had fewer than four prior ear infections and no history of hearing or vision impairments. Data from an additional 8 infants were excluded due to fussiness (4) and parents stopping the experiment (4).

Stimuli. A fluent stream of speech was created from recordings of a female native English speaker who was blind to the structure of the artificial language. The language contained six bisyllabic words: tohsigh, teemay, tiepu, tukee, tayla, and tafo. A pseudosynthesis technique was used to create the speech stream, which allowed for use of naturally produced syllables while permitting control over coarticulation, duration, pitch, and volume of all syllables in the language. All three‐syllable sequences that occurred in the language, both within and between word boundaries (e.g., tohsightee, sighteemay), were recorded in a monotone, isochronous register. Medial syllables were spliced out of the three‐syllable sequences and concatenated together with no silence between syllables. By using these medial syllables, coarticulation within each syllable and between every pair of syllables in the language was maintained. Syllables were edited prior to concatenation to have the same duration, pitch, and volume. The stream contained 40 repetitions (2 min 17 s) of each word in a pseudorandom order with no word appearing twice in succession (see the Appendix for a transcript of the complete familiarization language). Each within‐word syllable pair had a TP of 1.0; between‐word syllable pairs had a TP between.1 and.25 (M = .20). The speech stream thus contained two overlapping and completely redundant cues to word boundaries: dips in TPs and /t/‐onsets.

Four novel test items were constructed from syllables in the artificial language. Two of these items began with /t/ (tiemay, tohla), and two contained a medial /t/ (fota, keetu). Test items were created from recordings of each bisyllabic item spoken in isolation. Duration, pitch, and volume were edited so that all test items were essentially equivalent.

Procedure. During familiarization, infants listened to the speech stream at a comfortable volume, played over speakers mounted on each sidewall while viewing an unrelated Baby Einstein video. An experimenter then entered the booth, covered the monitor that displayed the video, and placed headphones playing masking music on the caregiver. The test phase began with two practice trials (a recording of piano tones), designed to help the infants learn the contingency between their head‐turns and the lights and sounds. The practice trials were followed by 12 test trials, three blocks of each of the four test items (tiemay, tohla, fota, and keetu). Infants' ability to discriminate the test items was assessed using the Headturn Preference Procedure ([16]). The experimenter was seated outside the booth, observing the infants' head‐turns on a closed circuit TV, and controlling the experiment via custom software. Lights were mounted on the center wall (directly facing the infant) and sidewalls. Each trial began with the center light blinking. Once the infant fixated on the light, it was extinguished and one of the sidelights began to blink. When the infant fixated on the blinking sidelight a sound was played from the speaker below the light. On each test trial, an item was repeated until the infant looked away for at least 2 s, or until the item had repeated 15 times. If the infant failed to fixate on the side light for at least 1 s during a test trial, the trial was excluded and an additional trial of that test item was automatically added after the third test block.

Results

We tested infants' ability to discriminate /t/‐initial from /t/‐medial test items over the three blocks of testing with a 2 (test item type: /t/‐initial vs. /t/‐medial) × 3 (test block: 1, 2, 3) repeated‐measures analysis of variance (ANOVA; means shown in Table 1). The main effect of test item type (/t/‐initial vs. /t/‐medial) was not significant, F(1, 23) = 0.975, p = .33. The assumption of sphericity was violated for the Type × Block interaction term (Mauchly's W = .687) and so multivariate analyses were used to evaluate the significance of the interaction. With a large violation of sphericity (i.e., when Mauchly's W < .7), the statistical power of multivariate techniques tends to be greater than univariate techniques ([17]; [24]). There was a significant interaction between test item type and block, F(2, 22) = 8.15, p = .002. As shown in Figure 1, the significant interaction reflects a reversal in the direction of preference over the course of testing. The familiarity preference present in the first two test blocks shifts to a novelty preference in the third block. Block interactions and shifts in direction of preference have been previously observed elsewhere in the literature ([8]) but are not often discussed.

1  Mean and Standard Errors for Looking Times

	Experiment 1	Experiment 2	Experiment 3
/t/‐initial	/t/‐medial	/t/‐initial	/t/‐medial	/t/‐initial	/t/‐medial
Block 1	8.69 (0.80)	7.54 (0.70)	8.44 (0.88)	8.75 (0.70)	8.30 (0.96)	7.75 (0.66)
Block 2	7.45 (0.71)	6.02 (0.75)	7.00 (0.57)	7.56 (0.70)	6.77 (0.56)	6.95 (0.68)
Block 3	4.58 (0.39)	5.94 (0.56)	6.79 (0.68)	6.32 (0.79)	7.18 (0.85)	6.77 (0.73)
Blocks 1 and 2 (averaged)	8.07 (0.58)	6.78 (0.59)	7.57 (0.60)	8.18 (0.53)	7.53 (0.58)	7.35 (0.55)
All trials (averaged)	6.92 (0.46)	6.53 (0.46)	7.40 (0.51)	7.52 (0.48)	7.31 (0.39)	7.09 (0.50)

Graph: 1 Experiment 1: Mean looking times to /t/‐initial and /t/‐medial test items for each test block.

Subsequent analyses focused on the first two blocks (eight test trials), as these looking times are more proximal to the familiarization phase and thus most likely to reflect learning from the fluent speech. A one‐way (test item type: /t/‐initial vs. /t/‐medial) repeated‐measures ANOVA revealed a significant difference in looking times to the two types of items, F(1, 23) = 6.30, p = .02 (see Figure 2). Infants looked longer to the /t/‐initial test items, which adhered to the pattern presented during familiarization. Recall that the TPs between syllables in the test items were all zero. Thus, infants could not have discriminated /t/‐initial from /t/‐medial test items based on TP cues. These results suggest that infants learned the /t/‐initial pattern and generalized it to include the novel test items.

Graph: 2 The average looking time for /t/‐initial and /t/‐medial items across test Blocks 1 and 2 for all three experiments.

Discussion

The results of Experiment 1 suggest that infants were able to exploit the /t/‐initial pattern, successfully discriminating novel test items that followed the pattern from those that did not. This pattern was not immediately obvious in the input; the speech stream consisted of syllables beginning with /t/ alternating with syllables that began with other sounds. In order to discover that the /t/ segment signaled word onsets, infants presumably capitalized on the TP cues in the speech stream, which also provided cues to word boundaries. On this view, infants discovered the language‐specific /t/‐onset cue by capitalizing on the language‐general TP cue.

One interesting feature of these data is that infants' looking behavior changed over the course of testing. Familiarity to novelty preference switches is not uncommon in infant behavioral studies, though the factors responsible for the shift may vary ([7]; [12]). The test items used in this study consisted of novel combinations of syllables from the familiarization language. This introduction of novel items at test forces infants to generalize beyond the training corpus, making it more likely that participants will show a familiarity preference at the outset of testing (e.g., [38]). However, during the course of testing, infants received differential amounts of exposure to the test items. Figure 3 depicts the difference in looking times to /t/‐initial and /t/‐medial test items across the three test blocks. Initially, infants looked longer to /t/‐initial items, thus receiving more exposure to them than to the /t/‐medial items. This pattern of listening during testing may have led the infants to become bored with these items, moving them toward a novelty preference. To test this hypothesis, we examined individual participants' looking preferences across the testing session. Fifteen of 24 participants showed an initial familiarity preference that transitioned into a novelty preference, 5 showed an initial familiarity preference that remained a familiarity preference, and 4 showed an initial novelty preference that remained a novelty preference. A majority of participants showed the predominant pattern of increased exposure to the /t/‐initial items in the first two blocks with a novelty preference in the third block. A chi‐square test confirmed that this pattern of behavior would not be expected by chance (χ2 = 20.3, df = 3, p = .0001). This pattern of results suggests that while infants' initial test responses were linked to learning during the familiarization phase, the novelty preference in Block 3 may have reflected infants' experiences during testing.

Graph: 3 The difference between looking times to /t/‐initial and /t/‐medial test items, for each block for all three experiments. Note. Positive values indicate a familiarity preference; negative values indicate a novelty preference.

The findings from Experiment 1 support the hypothesis that infants used low TPs at word boundaries to acquire the overlapping but novel /t/‐onset pattern. However, there is an alternate explanation for these results: It is possible that infants' behavior reflected preexisting preferences for individual items. A counterbalanced language composed of items that all contain a medial /t/ would clarify this issue. However, previous work ([6]) has shown that it may be easier to generalize from patterns that occur at the edges of sequences, as opposed to those occurring medially. Consequently, the two counterbalanced languages might not be equally learnable. A second experiment tested this possibility with a new group of infants, who participated only in the test phase of the experiment. If infants in Experiment 1 had an a priori preference for the /t/‐initial test items, infants in Experiment 2 should show a similar pattern of behavior. If infants in Experiment 2 do not show a similar pattern, this would suggest that exposure to familiarization materials that contained the two overlapping cues was necessary to elicit the preference for /t/‐initial test items.

Experiment 2

This experiment was designed to test the hypothesis that infants in Experiment 1 listened longer to the /t/‐initial test items due to an a priori preference for these particular items. Infants in Experiment 2 were not exposed to the familiarization speech stream, participating only in the testing procedure used in Experiment 1.

Method

Participants. Twenty‐four 9.5‐month‐old monolingual English‐learning infants (mean age = 9.5 months, range = 9.0–9.9) participated. Data from an additional 5 infants were excluded from the analyses because of experimenter error (1), fussiness (2), failure to contribute at least two trials for each item (1), and mean looking time to one or both sides less than 3 s (1).

Stimuli. The test items were the same as those used in Experiment 1.

Procedure. There was no exposure phase. The testing procedure was identical to Experiment 1, with 2 practice trials followed by 12 test trials.

Results and Discussion

As in Experiment 1, a 2 (test item type: /t/‐initial vs. /t/‐medial) × 3 (test block: 1, 2, 3) repeated‐measures ANOVA was conducted. There was no significant effect of test item type (/t/‐initial vs. /t/‐medial), F(1, 23) = 0.081, p = .78, nor was the interaction between block and test item type significant, F(2, 46) = 0.35, p = .70 (see Table 1 and Figure 3). These results indicate that infants in Experiment 2 did not discriminate between /t/‐initial and /t/‐medial test items.

We next conducted a 2 (test item type: /t/‐initial vs. /t/‐medial) × 2 (group: Experiment 1, Experiment 2) repeated‐measures ANOVA over the data from the first two test blocks from Experiments 1 and 2. This analysis was intended to determine whether the behavior of infants differed reliably across the two experiments. The interaction between test item type and group was significant, F(1, 46) = 6.19, p = .017, suggesting that infants who heard the familiarization materials showed a different pattern of behavior at test than those who did not (see Figure 2).

This between‐group analysis, coupled with the within‐subject analysis showing no effect of test item type, indicates that infants in Experiment 2 did not have an a priori preference for the /t/‐initial items relative to the /t/‐medial items. We can therefore attribute infants' behavior in Experiment 1 to familiarization with the fluent speech stream. Nevertheless, it is still unclear which aspects of the familiarization stimuli elicited infants' successful discrimination between /t/‐initial and /t/‐medial test items. It is possible that, as originally hypothesized, low TPs at word boundaries anchor the alternating /t/ syllables, allowing infants to extract the /t/‐onset cue and generalize it to the novel test items. Another possibility is that infants are extracting the extremely regular alternating /t/ syllable pattern (created because each of the bisyllabic words begins with a /t/) and uniformly mapping the /t/ syllable to word onsets. According to this alternative hypothesis, infants could capitalize on a systematic pattern (/t/‐onsets) without the aid of another cue. On this account, they detect the regular alternation and map it onto onsets, potentially because onsets are privileged perceptually and/or lexically (e.g., [2]; [15]; [21]; [22]).

To explore this hypothesis, we designed a new speech stream to determine whether infants could extract the novel /t/‐onset pattern without the aid of another cue for bootstrapping. The resulting speech stream did not have low TPs at word boundaries but still contained the /t/‐onset pattern. If infants can extract the /t/‐onset pattern without an overlapping cue, infants in Experiment 3 should show a significant difference in looking time to /t/‐initial items compared to /t/‐medial items. However, if TPs played a critical role in the discovery of the novel pattern via bootstrapping, infants should not show a significant difference in looking time to /t/‐initial items compared to /t/‐medial items in the absence of TP cues.

Experiment 3

This experiment was designed to determine whether the overlapping cue from Experiment 1, low TPs at word boundaries, was necessary for infants to acquire the novel /t/‐onset pattern. Infants were exposed to a new fluent speech stream that did not have low TPs at word boundaries but still contained the novel /t/‐onset pattern. The procedure and test items were identical to Experiment 1.

Method

Participants. Twenty‐four 9.5‐month‐old monolingual English‐learning infants (mean age = 9.5 months, range = 9.1–10.0) participated in this experiment. Data from an additional 17 infants were excluded from the analyses because of parental interference (5), sleepiness (1), external noise (1), fussiness (7), and failure to contribute at least one trial for each item type in every block (3).

Stimuli. A fluent speech stream was created using the procedure and words (tohsigh, teemay, tiepu, tukee, tayla, and tafo) from Experiment 1. Again, all syllables in the language were measured and edited such that duration, pitch, and volume were equivalent for all syllables. Unlike Experiment 1, in which words were repeated in a random order, the words in this fluent speech stream were repeated in exactly the same order 40 times (teemaytieputukeetafotaylatohsighteemaytieputukeetafotaylatohsigh. . .; see [4] for another example of this method). This method generated a 2 min 7 s stream in which every pair of syllables had a TP of 1.0 and every other syllable in the language began with a /t/. The test items were identical to those in Experiments 1 and 2 (tiemay, tohla, fota, and keetu).

Procedure. The experimental procedure was the same as Experiment 1.

Results and Discussion

As in the previous experiments, we first ran a 2 (test item type: /t/‐initial vs. /t/‐medial) × 3 (test block: 1, 2, 3) repeated‐measures ANOVA. There was no significant effect of test item type, F(1, 23) = 0.07, p = .80, and the interaction with block was not significant, F(1, 23) = 0.27, p = .61 (see Table 1 and Figure 3). This pattern of results indicates that infants did not differentiate between /t/‐initial and /t/‐medial test items in the absence of the TP cue. We next ran a 2 (test item type: /t/‐initial vs. /t/‐medial) × 2 (group: Experiment 1, Experiment 3) repeated‐measures ANOVA contrasting the data from the first two test blocks of Experiments 1 and 3. This Test Item Type × Group interaction did not reach significance, F(1, 46) = 2.87, p = .20 (see Figure 2).

Though the between‐group analysis is inconclusive, infants' failure to discriminate the test items in Experiment 3 is consistent with the hypothesis that infants were influenced by the presence of TP cues in Experiment 1. Without the low TPs at word boundaries to anchor the alternating /t/ syllables to segment onsets, infants seem unable to extract the novel pattern.

General Discussion

The results of these experiments indicate that infants are able to use a language‐general regularity (dips in TPs at word boundaries) to discover a second, language‐specific regularity (/t/‐onsets). Moreover, infants generalized this newly learned cue to novel items, as demonstrated by their test performance. These findings suggest one possible class of solutions to the learning problem described earlier: How do infants discover relevant linguistic cues when there are many way to analyze speech and a single bit of information can be informative at multiple levels? Just as adults use correlations among cues to resolve ambiguities when using language, infants are able to use such correlations to acquire language, as suggested by the constraint satisfaction approach. Thus, infants' learning capacities, such as the ability to encode correlations across different types of information ([30]), seem well matched to properties of natural language. What seems initially to be an insurmountable barrier to learning—the fact that elements of language contribute to multiple levels of structure simultaneously—actually helps solve the language acquisition problem (see [11], for a similar example in word learning).

Research over the past decade has shown that infants are sensitive to many different patterns that can be informative for language learning. It remains unclear, however, how infants find individual cues and combine different types of information to understand the complex structure of their language. Behavioral research on multiple cue usage in this domain has typically taken the form of cue‐conflict studies, examining relative reliance on different types of information over time (e.g., [14]; [23]; [38]). Although studies using this approach have been quite revealing, they cannot address how infants may capitalize on the redundancies in natural speech or how cues are discovered. More recent research has focused on the use of multiple probabilistic patterns to categorize lexical items ([8]; [10]; [36]). Results from the present work demonstrate that the complexity of natural speech does not necessarily hinder language acquisition but in fact may facilitate learning. Paradoxically, complexity may help learning—as long as the complexity is consistent with the structure to be acquired ([26], [27]).

The process observed in the present work is reminiscent of bootstrapping: Partial information about one element of language provides evidence about another element, which in turn provides further evidence for the first element ([9]). On this view, the infant begins to pick up on TPs, facilitating discovery of the /t/‐onset cue, which in turn may further the consolidation of the TP cue. Both the TPs and the /t/‐onset cue provided evidence about the boundaries between words in the fluent speech stream. The /t/ cue is different, insofar as its discovery depended on some prior learning about TPs. We are not claiming that TPs or any specific regularity is necessary for this process to operate. Rather, any cues that infants can use to extract linguistic structure, and that overlap with other discoverable patterns, should be available for use in this fashion. This is an area where computational models of bootstrapping mechanisms would be informative. Existing models have typically built in different types of regularities, focusing on how combinations of given regularities can yield better learning outcomes than individual regularities ([3]; [35]). Models of how the cues themselves are identified, and the dependencies between them in learning, would be very timely.

This work shows that infants can use the overlapping nature of speech to isolate cues. Language‐general regularities, such as TPs, may support the discovery of language‐specific cues. Critically, the present research also expands the scope of statistical learning mechanisms. Not only can infants use such mechanisms to exploit the structure of their language, but they can also use statistical learning to discover the structure of their native language.

Appendix

Transcript of Speech Stream From Experiment 1

teemaytohsighteemaytukeetafotieputaylatohsightayla tukeeteemaytieputaylateemaytukeeteemaytohsightu keetafoteemaytukeetohsighteemaytohsighteemaytoh sightafoteemaytafotohsightukeetieputukeetieputafo teemaytohsighteemaytukeetafotieputaylatohsightayla tukeeteemaytieputaylateemaytukeeteemaytohsi tee maytohsighteemaytukeetafotieputaylatohsightaylatu keeteemaytieputaylateemaytukeeteemaytohsightukee tafoteemaytukeetohsighteemaytohsighteemaytohsight afoteemaytafotohsightukeetieputukeetieputafoteemay tohsighteemaytukeetafotieputaylatohsightaylatukee teemaytieputaylateemaytukeeteemaytohsi teemaytoh sighteemaytukeetafotieputaylatohsightaylatukeetee maytieputaylateemaytukeeteemaytohsightukeetafotee maytukeetohsighteemaytohsighteemaytohsightafotee maytafotohsightukeetieputukeetieputafoteemaytoh sighteemaytukeetafotieputaylatohsightaylatukeetee maytieputaylateemaytukeeteemaytohsi teemaytoh sighteemaytukeetafotieputaylatohsightaylatukeetee maytieputaylateemaytukeeteemaytohsightukeetafotee maytukeetohsighteemaytohsighteemaytohsightafotee maytafotohsightukeetieputukeetieputafoteemaytoh sighteemaytukeetafotieputaylatohsightaylatukeetee maytieputaylateemaytukeeteemaytohsi teemaytoh sighteemaytukeetafotieputaylatohsightaylatukeetee maytieputaylateemaytukeeteemaytohsightukeetafotee maytukeetohsi.

Footnotes 1 We thank Allison Dahlke, Catherine Moore, and Diana Dorovany for help with stimulus construction and subject running; Katherine Kortenkamp for consultation on statistical analyses; an anonymous referee for the suggestion of the chi‐square analysis in Experiment 1; and Jessica Hay, Stephen Laniel, Rachel Sussman, Erik Thiessen, and four anonymous referees for helpful comments on an earlier version of this manuscript. This work was supported by NIH Grant F31DC008737 to S.D.S., NIH Grant RO1HD37466 to J.R.S., and NICHD Core Grant P30HD03352 to the Waisman Center. References Bhatt, R. S., Wilk, A., Hill, D., & Rovee‐Collier, C. (2004). Correlated attributes and categorization in the first half‐year of life. Developmental Psychobiology, 44, 103 – 115. 2 Brent, M. R., & Cartwright, T. A. (1996). Distributional regularity and phonotactic constraints are useful for segmentation. Cognition, 6, 93 – 125. 3 Christiansen, M. H., Allen, J., & Seidenberg, M. S. (1998). Learning to segment speech using multiple cues: A connectionist model. Language and Cognitive Processes, 13, 221 – 268. 4 Curtin, S., Mintz, T. H., & Christiansen, M. H. (2005). Stress changes the representational landscape: Evidence from word segmentation. Cognition, 96, 233 – 262. 5 Cutler, A., & Carter, D. M. (1987). The predominance of strong initial syllables in the English vocabulary. Computer Speech and Language, 2, 133 – 142. 6 Endress, A. D., Scholl, B. J., & Mehler, J. (2005). The role of salience in the extraction of algebraic rules. Journal of Experimental Psychology: General, 134, 406 – 419. 7 Fantz, R. L. (1964). Visual experience in infants: Decreased attention to familiar patterns relative to novel ones. Science, 146, 668 – 670. 8 Gerken, L., Wilson, R., & Lewis, W. (2005). Infants can use distributional cues to form syntactic categories. Journal of Child Language, 32, 249 – 268. 9 Gleitman, L. R., & Wanner, E. (Eds.). (1982). Language acquisition: The state of the art. New York: Cambridge University Press. Gómez, R. L., & Lakusta, L. (2004). A first step in form‐based category abstraction by 12‐month‐old infants. Developmental Science, 7, 567 – 580. Hirsh‐Pasek, K., Golinkoff, R. M., Hennon, E., & Maguire, M. J. (2004). Hybrid theories at the frontier of developmental psychology: The emergentist coalition model of learning as a case in point. In D. G. Hall & S. R. Waxman (Eds.), Weaving a lexicon (pp. 173 – 204). Cambridge, MA: MIT Press. Hunter, M. A., Ames, E. W., & Koopman, R. (1983). Effects of stimulus complexity and familiarization time on infant preferences for novel and familiar stimuli. Developmental Psychology, 19, 338 – 352. Joanisse, M. F., & Seidenberg, M. S. (1999). Impairments in verb morphology following brain injury: A connectionist model. Proceedings of the National Academy of Sciences of the United States of America, 96, 7592 – 7597. Johnson, E. K., & Jusczyk, P. W. (2001). Word segmentation by 8‐month‐olds: When speech cues count more than statistics. Journal of Memory and Language, 44, 548 – 567. Jusczyk, P. W., Jusczyk, A. M., Kennedy, L. J., Schomberg, T., & Koenig, N. (1995). Young infants' retention of information about bisyllabic utterances. Journal of Experimental Psychology: Human Perception and Performance, 21, 822 – 836. Kemler Nelson, D. G., Jusczyk, P. W., Mandel, D. R., Myers, J., Turk, A., & Gerken, L. (1995). The head‐turn preference procedure for testing auditory perception. Infant Behavior & Development, 18, 111 – 116. Keppel, G. (1991). Design and analysis: A researcher's handbook (3rd ed.). Englewood Cliffs, NJ: Prentice Hall. Kirkham, N. Z., Slemmer, J. A., & Johnson, S. P. (2002). Visual statistical learning in infancy: Evidence for a domain general learning mechanism. Cognition, 83, B35 – B42. MacDonald, M. C., Pearlmutter, N. J., & Seidenberg, M. S. (1994). The lexical nature of syntactic ambiguity resolution. Psychological Review, 101, 676 – 703. Madole, K. L., Oakes, L. M., & Cohen, L. B. (1993). Developmental changes in infants' attention to function and form‐function correlations. Cognitive Development, 8, 189 – 209. Magnuson, J. S., Dixon, J. A., Tanenhaus, M. K., & Aslin, R. N. (2007). The dynamics of lexical competition during spoken word recognition. Cognitive Science, 31, 133 – 156. Marslen‐Wilson, W., & Zwitserlood, P. (1989). Accessing spoken words: The importance of word onsets. Journal of Experimental Psychology: Human Perception and Performance, 15, 576 – 585. Mattys, S. L., Jusczyk, P. W., Luce, P. A., & Morgan, J. L. (1999). Phonotactic and prosodic effects on word segmentation in infants. Cognitive Psychology, 38, 465 – 494. Mendoza, J. L., Toothaker, L. E., & Nicewander, W. A. (1974). A Monte Carlo comparison of the univariate and multivariate methods for the two‐way repeated measure design. Multivariate Behavioral Research, 9, 165 – 178. Mirkovic, J., MacDonald, M. C., & Seidenberg, M. S. (2005). Where does gender come from? Evidence from a complex inflectional system. Language and Cognitive Processes, 20, 139 – 167. Morgan, J. L., Meier, R. P., & Newport, E. L. (1987). Structural packaging in the input to language learning: Contributions of prosodic and morphological marking of phrases to the acquisition of language. Cognitive Psychology, 19, 498 – 550. Morgan, J. L., Meier, R. P., & Newport, E. L. (1989). Facilitating the acquisition of syntax with cross‐sentential cues to phrase structure. Journal of Memory and Language, 28, 360 – 374. Rakison, D. H. (2004). Infants' sensitivity to correlations between static and dynamic features in a category context. Journal of Experimental Child Psychology, 89, 1 – 30. Rogers, T. T., & McClelland, J. L. (2004). Semantic cognition: A parallel distributed processing approach. Cambridge, MA: MIT Press. Rose, S. A., & Ruff, H. A. (1987). Cross‐modal abilities in human infants. In J. D. Osofsky (Ed.), Handbook of infant development (2nd ed., pp. 318 – 362). Oxford, UK: Wiley. Saffran, J. R., Aslin, R. N., & Newport, E. L. (1996). Statistical learning by 8‐month‐old infants. Science, 274, 1926 – 1928. Saffran, J. R., & Sahni, S. D. (in press). Learning the sounds of language. In M. Joanisse, M. Spivey, & K. McCrae (Eds.), Cambridge handbook of psycholinguistics. Cambridge, UK: Cambridge University Press. Seidenberg, M. S. (1997). Language acquisition and use: Learning and applying probabilistic constraints. Science, 275, 1599 – 1603. Seidenberg, M. S., & MacDonald, M. C. (1999). A probabilistic constraints approach to language acquisition and processing. Cognitive Science, 23, 569 – 588. Shi, R., Morgan, J. L., & Allopenna, P. (1998). Phonological and acoustic bases for earliest grammatical category assignment: A cross‐linguistic perspective. Journal of Child Language, 25, 169 – 201. Shi, R., Werker, J. F., & Morgan, J. L. (1999). Newborn infants' sensitivity to perceptual cues to lexical and grammatical words. Cognition, 72, B11 – B21. Teinonen, T., Fellman, V., Naatanen, R., Aklu, P., & Huotilainen, M. (2009). Statistical language learning in neonates revealed by event‐related brain potentials. BMC Neuroscience, 10, 21. Thiessen, E. D., & Saffran, J. R. (2003). When cues collide: Use of stress and statistical cues to word boundaries by 7‐ to 9‐month‐old infants. Developmental Psychology, 39, 706 – 716. Yang, C. D. (2004). Universal grammar, statistics or both? Trends in Cognitive Sciences, 8, 451 – 456. Younger, B. A. (1992). Developmental change in infant categorization: The perception of correlations among facial features. Child Development, 63, 1526 – 1535. Younger, B. A., & Cohen, L. B. (1986). Developmental change in infants' perception of correlations among attributes. Child Development, 57, 803 – 815.

By Sarah D. Sahni; Mark S. Seidenberg and Jenny R. Saffran

Reported by Author; Author; Author

Titel:	Connecting Cues: Overlapping Regularities Support Cue Discovery in Infancy
Autor/in / Beteiligte Person:	SAHNI, Sarah D ; SEIDENBERG, Mark S ; SAFFRAN, Jenny R
Link:	Volltext (PDF) View record from FRANCIS Archive
Zeitschrift:	Child development, Jg. 81 (2010), Heft 3, S. 727-736
Veröffentlichung:	Malden, MA: Wiley-Blackwell, 2010
Medientyp:	academicJournal
Umfang:	print; 10; 1 p.1/2
ISSN:	0009-3920 (print)
Schlagwort:	Cognition Cognición Homme Human Hombre Langage Language Lenguaje Développement cognitif Cognitive development Desarrolo cognitivo Développement verbal Language development Desarrollo verbal Etude expérimentale Experimental study Estudio experimental Nourrisson Infant Lactante Parole Speech Habla Perception verbale Verbal perception Percepción verbal Segmentation Segmentación Sciences biologiques et medicales Biological and medical sciences Sciences biologiques fondamentales et appliquees. Psychologie Fundamental and applied biological sciences. Psychology Psychologie. Psychophysiologie Psychology. Psychophysiology Psychologie du développement Developmental psychology Développement de l'enfant Child development Nouveau-né. Nourrisson Newborn. Infant Psychologie. Psychanalyse. Psychiatrie Psychology. Psychoanalysis. Psychiatry Pediatrics Pédiatrie Psychology, psychopathology, psychiatry Psychologie, psychopathologie, psychiatrie
Sonstiges:	Nachgewiesen in: FRANCIS Archive Sprachen: English Original Material: INIST-CNRS Document Type: Article File Description: text Language: English Author Affiliations: University of Wisconsin-Madison, United States Rights: Copyright 2015 INIST-CNRS ; CC BY 4.0 ; Sauf mention contraire ci-dessus, le contenu de cette notice bibliographique peut être utilisé dans le cadre d’une licence CC BY 4.0 Inist-CNRS / Unless otherwise stated above, the content of this bibliographic record may be used under a CC BY 4.0 licence by Inist-CNRS / A menos que se haya señalado antes, el contenido de este registro bibliográfico puede ser utilizado al amparo de una licencia CC BY 4.0 Inist-CNRS

Klicken Sie ein Format an und speichern Sie dann die Daten oder geben Sie eine Empfänger-Adresse ein und lassen Sie sich per Email zusenden.

BibTeX Citavi, JabRef, u.a.
(Literaturverwaltung)

PDF kein Volltext!
(Merkzettel, Notizen)

RIS Endnote, Citavi u.a.
(Literaturverwaltung)

MODS
(XML zur Weiterverarbeitung)

oder

Wählen Sie das für Sie passende Zitationsformat und kopieren Sie es dann in die Zwischenablage, lassen es sich per Mail zusenden oder speichern es als PDF-Datei.

Gewünschter Zitations-Stil:

oder

Bitte prüfen Sie, ob die Zitation formal korrekt ist, bevor Sie sie in einer Arbeit verwenden. Benutzen Sie gegebenenfalls den "Exportieren"-Dialog, wenn Sie ein Literaturverwaltungsprogramm verwenden und die Zitat-Angaben selbst formatieren wollen.