Usha Goswami | University of Cambridge (original) (raw)

Papers by Usha Goswami

Research paper thumbnail of Novel word learning deficits in infants at family risk for dyslexia

Research paper thumbnail of The Temporal Modulation Structure of Infant-Directed Speech

Research paper thumbnail of Delayed development of phonological constancy in toddlers at family risk for dyslexia

Infant Behavior & Development, Nov 1, 2019

Research paper thumbnail of Infant‐directed speech to infants at risk for dyslexia: A novel cross‐dyad design

Research paper thumbnail of The Role of Paired Associate Learning in Acquiring Letter-Sound Correspondences: A Longitudinal Study of Children at Family Risk for Dyslexia

Scientific Studies of Reading, Dec 8, 2020

Research paper thumbnail of Atypical beta-band effects in children with dyslexia in response to rhythmic audio-visual speech

Children with dyslexia are known to show impairments in perceiving speech rhythm, which impact th... more Children with dyslexia are known to show impairments in perceiving speech rhythm, which impact their phonological development. Neural rhythmic speech studies have reported atypical delta phase in children with dyslexia, but beta band effects have not yet been studied. It is known that delta phase modulates the amplitude of the beta band response during rhythmic tasks via delta-beta phase-amplitude coupling (PAC). Accordingly, the atypical delta band effects reported for children with dyslexia may imply related atypical beta band effects. Here we analyse EEG data collected during a rhythmic speech paradigm from 51 children (21 typically-developing; 30 with dyslexia) who attended to a talking head repeating "ba" at 2Hz. Phase entrainment in the beta band, angular velocity in the beta band, power responses in the beta band and delta-beta PAC were assessed for each child and each group. Phase entrainment in the beta band was only significant for children without dyslexia. Chil...

Research paper thumbnail of Decoding speech information from EEG data with 4-, 7- and 11-month-old infants: Using convolutional neural network, mutual information-based and backward linear models

Background: Computational models that successfully decode neural activity into speech are multipl... more Background: Computational models that successfully decode neural activity into speech are multiplying in the adult literature, with convolutional neural networks (CNNs), backward linear models, and mutual information (MI) models all being applied to neural data in relation to speech input. This is not the case in the infant literature. New Method: Three different computational models, two novel for infants, were applied to decode low-frequency speech envelope information. Previously-employed backward linear models were compared to novel CNN and MI-based models. Fifty infants provided EEG recordings when aged 4, 7, and 11 months, while listening passively to natural speech (sung or chanted nursery rhymes) presented by video with a female singer. Results: Each model computed speech information for these nursery rhymes in two different low-frequency bands, delta (1 – 4 Hz) and theta (4 – 8 Hz), thought to provide different types of linguistic information. All three models demonstrated ...

Research paper thumbnail of Through a glass darkly: Research biases that result from wearing ‘literate glasses’

Annee Psychologique, 2018

Research paper thumbnail of Cortical tracking of visual rhythmic speech by 5- and 8-month-old infants: Individual differences in phase angle relate to language outcomes up to 2 years

It is known that the rhythms of speech are visible on the face, accurately mirroring changes in t... more It is known that the rhythms of speech are visible on the face, accurately mirroring changes in the vocal tract. These low-frequency visual temporal movements are tightly correlated with speech output, and both visual speech (for example, mouth motion) and the acoustic speech amplitude envelope entrain neural oscillations. Low-frequency visual temporal information (‘visual prosody’) is known from behavioural studies to be perceived by infants, but oscillatory studies are currently lacking. Here we measure cortical tracking of low-frequency visual temporal information by five- and eight-month-old infants using a rhythmic speech paradigm (repetition of the syllable “ta” at 2 Hz). Eye-tracking data was collected simultaneously with EEG, enabling computation of cortical tracking and phase angle during visual-only speech presentation. Significantly higher power at the stimulus frequency indicated that cortical tracking occurred across both ages. Further, individual differences in preferr...

Research paper thumbnail of Infant low-frequency EEG cortical power, cortical tracking and phase-amplitude coupling predicts language a year later

Cortical signals have been shown to track acoustic and linguistic properties of continual speech.... more Cortical signals have been shown to track acoustic and linguistic properties of continual speech. This phenomenon has been measured across the lifespan, reflecting speech understanding as well as cognitive functions such as attention and prediction. Furthermore, atypical low-frequency cortical tracking of speech is found in children with phonological difficulties (developmental dyslexia). Accordingly, low-frequency cortical signals, especially in the delta and theta ranges, may play a critical role in language acquisition. A recent investigation Attaheri et al., 2022 (1) probed cortical tracking mechanisms in infants aged 4, 7 and 11 months as they listened to sung speech. Results from temporal response functions (TRF), phase-amplitude coupling (PAC) and dynamic theta-delta power (PSD) analyses indicated speech envelope tracking and stimulus related power (PSD) via the delta & theta neural signals. Furthermore, delta and theta driven PAC was found at all ages with gamma amplitudes d...

Research paper thumbnail of Atypical speech production of multisyllabic words and phrases by children with developmental dyslexia

The prevalent ‘core phonological deficit’ model of dyslexia proposes that the reading and spellin... more The prevalent ‘core phonological deficit’ model of dyslexia proposes that the reading and spelling difficulties characterizing affected children stem from prior developmental difficulties in processing speech sound structure, for example perceiving and identifying syllable stress patterns, syllables, rhymes and phonemes. Yet spoken word production appears normal. This suggests an unexpected disconnect between speech input and speech output processes. Here we investigated the output side of this disconnect from a speech rhythm perspective by measuring the speech amplitude envelope (AE) of multisyllabic spoken phrases. The speech AE contains crucial information regarding stress patterns, speech rate, tonal contrasts and intonational information. We created a novel computerized speech copying task in which participants copied aloud familiar spoken targets like “Aladdin”. Seventy-five children with and without dyslexia were tested, some of whom were also receiving an oral intervention d...

Research paper thumbnail of Atypical cortical encoding of speech identifies children with Dyslexia versus Developmental Language Disorder

Slow cortical oscillations play a crucial role in processing the speech envelope, which is percei... more Slow cortical oscillations play a crucial role in processing the speech envelope, which is perceived atypically by children with Developmental Language Disorder (DLD) and developmental dyslexia. Here we use electroencephalography (EEG) and natural speech listening paradigms to identify neural processing patterns that characterize dyslexic versus DLD children. Using a story listening paradigm, we show that atypical power dynamics and phase-amplitude coupling between delta and theta oscillations characterize dyslexic and DLD children groups, respectively. We further identify EEG common spatial patterns (CSP) during speech listening across delta, theta and beta oscillations describing dyslexic versus DLD children. A linear classifier using four deltaband CSP variables predicted dyslexia status (0.77 AUC). Crucially, these spatial patterns also identified children with dyslexia in a rhythmic syllable task EEG, suggesting a core developmental deficit in neural processing of speech rhythm...

Research paper thumbnail of Emergence of the cortical encoding of phonetic features in the first year of life

Even prior to producing their first words, infants are developing a sophisticated speech processi... more Even prior to producing their first words, infants are developing a sophisticated speech processing system, with robust word recognition present by 4-6 months of age. These emergent linguistic skills, observed with behavioural investigations, are likely to rely on increasingly sophisticated neural underpinnings. The infant brain is known to robustly track the speech envelope, however to date no cortical tracking study could investigate the emergence of phonetic feature encoding. Here we utilise temporal response functions computed from electrophysiological responses to nursery rhymes to investigate the cortical encoding of phonetic features in a longitudinal cohort of infants when aged 4, 7 and 11 months, as well as adults. The analyses reveal an increasingly detailed and acoustically-invariant phonetic encoding over the first year of life, providing the first direct evidence that the pre-verbal human cortex learns phonetic categories. By 11 months of age, however, infants still did...

Research paper thumbnail of Decoding of Speech Information using EEG in Children with Dyslexia: Less Accurate Low-Frequency Representations of Speech, Not “Noisy” Representations

The amplitude envelope of speech carries crucial low-frequency acoustic information that assists ... more The amplitude envelope of speech carries crucial low-frequency acoustic information that assists linguistic decoding. The sensory-neural Temporal Sampling (TS) theory of developmental dyslexia proposes atypical encoding of speech envelope information <10 Hz, leading to atypical phonological representations. Here a backward linear TRF model and story listening were employed to estimate the speech information encoded in the electroencephalogram in the canonical delta, theta and alpha bands by 9-year-old children with and without dyslexia. TRF decoding accuracy provided an estimate of how faithfully the children’s brains encoded low-frequency envelope information. Between-group analyses showed that the children with dyslexia exhibited impaired reconstruction of speech information in the delta band. However, when the quality of speech encoding for each child was estimated using child-by-child decoding models, then the dyslexic children did not differ from controls. This suggests that...

Research paper thumbnail of Neural phase angle from two months when tracking speech and non-speech rhythm linked to language performance from 12 to 24 months

Atypical phase alignment of low-frequency neural oscillations to speech rhythm has been implicate... more Atypical phase alignment of low-frequency neural oscillations to speech rhythm has been implicated in phonological deficits in developmental dyslexia. Atypical phase alignment to rhythm could thus also characterize infants at risk for later language difficulties. Here, we investigate phase-language mechanisms in a neurotypical infant sample. 122 two-, six- and nine-month-old infants were played speech and non-speech rhythms while EEG was recorded in a longitudinal design. The phase of infants’ neural oscillations aligned consistently to the stimuli, with group-level convergence towards a common phase. Individual low-frequency phase alignment related to subsequent measures of language acquisition up to 24 months of age. Accordingly, individual differences in language acquisition are related to the phase alignment of cortical tracking of auditory and audiovisual rhythms in infancy, an automatic neural mechanism. Automatic rhythmic phase-language mechanisms could eventually serve as bi...

Research paper thumbnail of Hierarchical amplitude modulation structures and rhythm patterns: Comparing Western musical genres, song, and nature sounds to Babytalk

PLOS ONE

Statistical learning of physical stimulus characteristics is important for the development of cog... more Statistical learning of physical stimulus characteristics is important for the development of cognitive systems like language and music. Rhythm patterns are a core component of both systems, and rhythm is key to language acquisition by infants. Accordingly, the physical stimulus characteristics that yield speech rhythm in “Babytalk” may also describe the hierarchical rhythmic relationships that characterize human music and song. Computational modelling of the amplitude envelope of “Babytalk” (infant-directed speech, IDS) using a demodulation approach (Spectral-Amplitude Modulation Phase Hierarchy model, S-AMPH) can describe these characteristics. S-AMPH modelling of Babytalk has shown previously that bands of amplitude modulations (AMs) at different temporal rates and their phase relations help to create its structured inherent rhythms. Additionally, S-AMPH modelling of children’s nursery rhymes shows that different rhythm patterns (trochaic, iambic, dactylic) depend on the phase re...

Research paper thumbnail of Local Temporal Regularities in Child-Directed Speech in Spanish

Journal of Speech Language and Hearing Research, Oct 17, 2022

Purpose: To characterize the local (utterance-level) temporal regularities of child-directed spee... more Purpose: To characterize the local (utterance-level) temporal regularities of child-directed speech (CDS) that might facilitate phonological development in Spanish, classically termed a syllable-timed language. Method: 18 female adults addressed their 4-year-old children versus other adults spontaneously and also read aloud (CDS versus ADS). We compared CDS and ADS speech productions using a spectrotemporal model (S-AMPH, Leong & Goswami, 2015), obtaining three temporal metrics: 1) distribution of modulation energy, 2) temporal regularity of stressed syllables, and 3) syllable rate. Results: CDS was characterized by 1) significantly greater modulation energy in the lower frequencies (0.5-4 Hz), 2) more regular rhythmic occurrence of stressed syllables, and 3) a slower syllable rate than ADS, across both spontaneous and read conditions. Discussion: CDS is characterized by a robust local temporal organization (i.e., within utterances) with amplitude modulation bands aligning with delta and theta electrophysiological frequency bands respectively showing greater phase synchronization than in ADS, facilitating parsing of stress units and syllables. These temporal regularities, together with the slower rate of production of CDS, might support 1 the automatic extraction of phonological units in speech and hence support the phonological development of children.

Research paper thumbnail of Cortical Oscillations in Pre-verbal Infants Track Rhythmic Speech and Non-speech Stimuli

The foundations for language acquisition are laid in infancy. A key feature of infant-directed sp... more The foundations for language acquisition are laid in infancy. A key feature of infant-directed speech (IDS) is that the slowest modulations of its amplitude envelope (~2 Hz) contain more energy than in adult-directed speech. These slow modulations may provide a cross-language rhythmic scaffold for the neural tracking of speech in infancy. To investigate relations between early neural processing of speech and language acquisition in English, the BabyRhythm project followed 113 infants during infancy and toddlerhood. The neural predictor of language development reported here was the cortical tracking of slow, rhythmic audiovisual stimuli, processing of which is known to differ in older children with dyslexia. To find out how such stimuli are tracked early in development, infants were presented with videos of a woman repeating the syllable “Ta” twice per second, and a ball bouncing on a drum to create a 2Hz beat. At the ages of six and nine months, infants exhibited a significant peak ...

Research paper thumbnail of Phonological Skills and Learning to Read. Psychology Press & Routledge Classic Editions

Research paper thumbnail of Difficulties in auditory organization as a cause of reading backwardness? An auditory neuroscience perspective

Developmental Science, 2016

Over 30 years ago, it was suggested that difficulties in the 'auditory organization' of word form... more Over 30 years ago, it was suggested that difficulties in the 'auditory organization' of word forms in the mental lexicon might cause reading difficulties. It was proposed that children used parameters such as rhyme and alliteration to organize word forms in the mental lexicon by acoustic similarity, and that such organization was impaired in developmental dyslexia. This literature was based on an 'oddity' measure of children's sensitivity to rhyme (e.g. wood, book, good) and alliteration (e.g. sun, sock, rag). The 'oddity' task revealed that children with dyslexia were significantly poorer at identifying the 'odd word out' than younger children without reading difficulties. Here we apply a novel modelling approach drawn from auditory neuroscience to study the possible sensory basis of the auditory organization of rhyming and non-rhyming words by children. We utilize a novel Spectral-Amplitude Modulation Phase Hierarchy (S-AMPH) approach to analysing the spectro-temporal structure of rhyming and non-rhyming words, aiming to illuminate the potential acoustic cues used by children as a basis for phonological organization. The S-AMPH model assumes that speech encoding depends on neuronal oscillatory entrainment to the amplitude modulation (AM) hierarchy in speech. Our results suggest that phonological similarity between rhyming words in the oddity task depends crucially on slow (delta band) modulations in the speech envelope. Contrary to linguistic assumptions, therefore, auditory organization by children may not depend on phonemic information for this task. Linguistically, it is assumed that 'book' does not rhyme with 'wood' and 'good' because the final phoneme differs. However, our auditory analysis suggests that the acoustic cues to this phonological dissimilarity depend primarily on the slower amplitude modulations in the speech envelope, thought to carry prosodic information. Therefore, the oddity task may help in detecting reading difficulties because phonological similarity judgements about rhyme reflect sensitivity to slow amplitude modulation patterns. Slower amplitude modulations are known to be detected less efficiently by children with dyslexia. Research highlights • We apply a novel model of speech encoding based on the neuronal oscillatory hierarchy to the rhyme oddity task. • We show that children's rhyme judgements depend primarily on auditory sensitivity to relatively slow amplitude envelope information.

Research paper thumbnail of Novel word learning deficits in infants at family risk for dyslexia

Research paper thumbnail of The Temporal Modulation Structure of Infant-Directed Speech

Research paper thumbnail of Delayed development of phonological constancy in toddlers at family risk for dyslexia

Infant Behavior & Development, Nov 1, 2019

Research paper thumbnail of Infant‐directed speech to infants at risk for dyslexia: A novel cross‐dyad design

Research paper thumbnail of The Role of Paired Associate Learning in Acquiring Letter-Sound Correspondences: A Longitudinal Study of Children at Family Risk for Dyslexia

Scientific Studies of Reading, Dec 8, 2020

Research paper thumbnail of Atypical beta-band effects in children with dyslexia in response to rhythmic audio-visual speech

Children with dyslexia are known to show impairments in perceiving speech rhythm, which impact th... more Children with dyslexia are known to show impairments in perceiving speech rhythm, which impact their phonological development. Neural rhythmic speech studies have reported atypical delta phase in children with dyslexia, but beta band effects have not yet been studied. It is known that delta phase modulates the amplitude of the beta band response during rhythmic tasks via delta-beta phase-amplitude coupling (PAC). Accordingly, the atypical delta band effects reported for children with dyslexia may imply related atypical beta band effects. Here we analyse EEG data collected during a rhythmic speech paradigm from 51 children (21 typically-developing; 30 with dyslexia) who attended to a talking head repeating "ba" at 2Hz. Phase entrainment in the beta band, angular velocity in the beta band, power responses in the beta band and delta-beta PAC were assessed for each child and each group. Phase entrainment in the beta band was only significant for children without dyslexia. Chil...

Research paper thumbnail of Decoding speech information from EEG data with 4-, 7- and 11-month-old infants: Using convolutional neural network, mutual information-based and backward linear models

Background: Computational models that successfully decode neural activity into speech are multipl... more Background: Computational models that successfully decode neural activity into speech are multiplying in the adult literature, with convolutional neural networks (CNNs), backward linear models, and mutual information (MI) models all being applied to neural data in relation to speech input. This is not the case in the infant literature. New Method: Three different computational models, two novel for infants, were applied to decode low-frequency speech envelope information. Previously-employed backward linear models were compared to novel CNN and MI-based models. Fifty infants provided EEG recordings when aged 4, 7, and 11 months, while listening passively to natural speech (sung or chanted nursery rhymes) presented by video with a female singer. Results: Each model computed speech information for these nursery rhymes in two different low-frequency bands, delta (1 – 4 Hz) and theta (4 – 8 Hz), thought to provide different types of linguistic information. All three models demonstrated ...

Research paper thumbnail of Through a glass darkly: Research biases that result from wearing ‘literate glasses’

Annee Psychologique, 2018

Research paper thumbnail of Cortical tracking of visual rhythmic speech by 5- and 8-month-old infants: Individual differences in phase angle relate to language outcomes up to 2 years

It is known that the rhythms of speech are visible on the face, accurately mirroring changes in t... more It is known that the rhythms of speech are visible on the face, accurately mirroring changes in the vocal tract. These low-frequency visual temporal movements are tightly correlated with speech output, and both visual speech (for example, mouth motion) and the acoustic speech amplitude envelope entrain neural oscillations. Low-frequency visual temporal information (‘visual prosody’) is known from behavioural studies to be perceived by infants, but oscillatory studies are currently lacking. Here we measure cortical tracking of low-frequency visual temporal information by five- and eight-month-old infants using a rhythmic speech paradigm (repetition of the syllable “ta” at 2 Hz). Eye-tracking data was collected simultaneously with EEG, enabling computation of cortical tracking and phase angle during visual-only speech presentation. Significantly higher power at the stimulus frequency indicated that cortical tracking occurred across both ages. Further, individual differences in preferr...

Research paper thumbnail of Infant low-frequency EEG cortical power, cortical tracking and phase-amplitude coupling predicts language a year later

Cortical signals have been shown to track acoustic and linguistic properties of continual speech.... more Cortical signals have been shown to track acoustic and linguistic properties of continual speech. This phenomenon has been measured across the lifespan, reflecting speech understanding as well as cognitive functions such as attention and prediction. Furthermore, atypical low-frequency cortical tracking of speech is found in children with phonological difficulties (developmental dyslexia). Accordingly, low-frequency cortical signals, especially in the delta and theta ranges, may play a critical role in language acquisition. A recent investigation Attaheri et al., 2022 (1) probed cortical tracking mechanisms in infants aged 4, 7 and 11 months as they listened to sung speech. Results from temporal response functions (TRF), phase-amplitude coupling (PAC) and dynamic theta-delta power (PSD) analyses indicated speech envelope tracking and stimulus related power (PSD) via the delta & theta neural signals. Furthermore, delta and theta driven PAC was found at all ages with gamma amplitudes d...

Research paper thumbnail of Atypical speech production of multisyllabic words and phrases by children with developmental dyslexia

The prevalent ‘core phonological deficit’ model of dyslexia proposes that the reading and spellin... more The prevalent ‘core phonological deficit’ model of dyslexia proposes that the reading and spelling difficulties characterizing affected children stem from prior developmental difficulties in processing speech sound structure, for example perceiving and identifying syllable stress patterns, syllables, rhymes and phonemes. Yet spoken word production appears normal. This suggests an unexpected disconnect between speech input and speech output processes. Here we investigated the output side of this disconnect from a speech rhythm perspective by measuring the speech amplitude envelope (AE) of multisyllabic spoken phrases. The speech AE contains crucial information regarding stress patterns, speech rate, tonal contrasts and intonational information. We created a novel computerized speech copying task in which participants copied aloud familiar spoken targets like “Aladdin”. Seventy-five children with and without dyslexia were tested, some of whom were also receiving an oral intervention d...

Research paper thumbnail of Atypical cortical encoding of speech identifies children with Dyslexia versus Developmental Language Disorder

Slow cortical oscillations play a crucial role in processing the speech envelope, which is percei... more Slow cortical oscillations play a crucial role in processing the speech envelope, which is perceived atypically by children with Developmental Language Disorder (DLD) and developmental dyslexia. Here we use electroencephalography (EEG) and natural speech listening paradigms to identify neural processing patterns that characterize dyslexic versus DLD children. Using a story listening paradigm, we show that atypical power dynamics and phase-amplitude coupling between delta and theta oscillations characterize dyslexic and DLD children groups, respectively. We further identify EEG common spatial patterns (CSP) during speech listening across delta, theta and beta oscillations describing dyslexic versus DLD children. A linear classifier using four deltaband CSP variables predicted dyslexia status (0.77 AUC). Crucially, these spatial patterns also identified children with dyslexia in a rhythmic syllable task EEG, suggesting a core developmental deficit in neural processing of speech rhythm...

Research paper thumbnail of Emergence of the cortical encoding of phonetic features in the first year of life

Even prior to producing their first words, infants are developing a sophisticated speech processi... more Even prior to producing their first words, infants are developing a sophisticated speech processing system, with robust word recognition present by 4-6 months of age. These emergent linguistic skills, observed with behavioural investigations, are likely to rely on increasingly sophisticated neural underpinnings. The infant brain is known to robustly track the speech envelope, however to date no cortical tracking study could investigate the emergence of phonetic feature encoding. Here we utilise temporal response functions computed from electrophysiological responses to nursery rhymes to investigate the cortical encoding of phonetic features in a longitudinal cohort of infants when aged 4, 7 and 11 months, as well as adults. The analyses reveal an increasingly detailed and acoustically-invariant phonetic encoding over the first year of life, providing the first direct evidence that the pre-verbal human cortex learns phonetic categories. By 11 months of age, however, infants still did...

Research paper thumbnail of Decoding of Speech Information using EEG in Children with Dyslexia: Less Accurate Low-Frequency Representations of Speech, Not “Noisy” Representations

The amplitude envelope of speech carries crucial low-frequency acoustic information that assists ... more The amplitude envelope of speech carries crucial low-frequency acoustic information that assists linguistic decoding. The sensory-neural Temporal Sampling (TS) theory of developmental dyslexia proposes atypical encoding of speech envelope information <10 Hz, leading to atypical phonological representations. Here a backward linear TRF model and story listening were employed to estimate the speech information encoded in the electroencephalogram in the canonical delta, theta and alpha bands by 9-year-old children with and without dyslexia. TRF decoding accuracy provided an estimate of how faithfully the children’s brains encoded low-frequency envelope information. Between-group analyses showed that the children with dyslexia exhibited impaired reconstruction of speech information in the delta band. However, when the quality of speech encoding for each child was estimated using child-by-child decoding models, then the dyslexic children did not differ from controls. This suggests that...

Research paper thumbnail of Neural phase angle from two months when tracking speech and non-speech rhythm linked to language performance from 12 to 24 months

Atypical phase alignment of low-frequency neural oscillations to speech rhythm has been implicate... more Atypical phase alignment of low-frequency neural oscillations to speech rhythm has been implicated in phonological deficits in developmental dyslexia. Atypical phase alignment to rhythm could thus also characterize infants at risk for later language difficulties. Here, we investigate phase-language mechanisms in a neurotypical infant sample. 122 two-, six- and nine-month-old infants were played speech and non-speech rhythms while EEG was recorded in a longitudinal design. The phase of infants’ neural oscillations aligned consistently to the stimuli, with group-level convergence towards a common phase. Individual low-frequency phase alignment related to subsequent measures of language acquisition up to 24 months of age. Accordingly, individual differences in language acquisition are related to the phase alignment of cortical tracking of auditory and audiovisual rhythms in infancy, an automatic neural mechanism. Automatic rhythmic phase-language mechanisms could eventually serve as bi...

Research paper thumbnail of Hierarchical amplitude modulation structures and rhythm patterns: Comparing Western musical genres, song, and nature sounds to Babytalk

PLOS ONE

Statistical learning of physical stimulus characteristics is important for the development of cog... more Statistical learning of physical stimulus characteristics is important for the development of cognitive systems like language and music. Rhythm patterns are a core component of both systems, and rhythm is key to language acquisition by infants. Accordingly, the physical stimulus characteristics that yield speech rhythm in “Babytalk” may also describe the hierarchical rhythmic relationships that characterize human music and song. Computational modelling of the amplitude envelope of “Babytalk” (infant-directed speech, IDS) using a demodulation approach (Spectral-Amplitude Modulation Phase Hierarchy model, S-AMPH) can describe these characteristics. S-AMPH modelling of Babytalk has shown previously that bands of amplitude modulations (AMs) at different temporal rates and their phase relations help to create its structured inherent rhythms. Additionally, S-AMPH modelling of children’s nursery rhymes shows that different rhythm patterns (trochaic, iambic, dactylic) depend on the phase re...

Research paper thumbnail of Local Temporal Regularities in Child-Directed Speech in Spanish

Journal of Speech Language and Hearing Research, Oct 17, 2022

Purpose: To characterize the local (utterance-level) temporal regularities of child-directed spee... more Purpose: To characterize the local (utterance-level) temporal regularities of child-directed speech (CDS) that might facilitate phonological development in Spanish, classically termed a syllable-timed language. Method: 18 female adults addressed their 4-year-old children versus other adults spontaneously and also read aloud (CDS versus ADS). We compared CDS and ADS speech productions using a spectrotemporal model (S-AMPH, Leong & Goswami, 2015), obtaining three temporal metrics: 1) distribution of modulation energy, 2) temporal regularity of stressed syllables, and 3) syllable rate. Results: CDS was characterized by 1) significantly greater modulation energy in the lower frequencies (0.5-4 Hz), 2) more regular rhythmic occurrence of stressed syllables, and 3) a slower syllable rate than ADS, across both spontaneous and read conditions. Discussion: CDS is characterized by a robust local temporal organization (i.e., within utterances) with amplitude modulation bands aligning with delta and theta electrophysiological frequency bands respectively showing greater phase synchronization than in ADS, facilitating parsing of stress units and syllables. These temporal regularities, together with the slower rate of production of CDS, might support 1 the automatic extraction of phonological units in speech and hence support the phonological development of children.

Research paper thumbnail of Cortical Oscillations in Pre-verbal Infants Track Rhythmic Speech and Non-speech Stimuli

The foundations for language acquisition are laid in infancy. A key feature of infant-directed sp... more The foundations for language acquisition are laid in infancy. A key feature of infant-directed speech (IDS) is that the slowest modulations of its amplitude envelope (~2 Hz) contain more energy than in adult-directed speech. These slow modulations may provide a cross-language rhythmic scaffold for the neural tracking of speech in infancy. To investigate relations between early neural processing of speech and language acquisition in English, the BabyRhythm project followed 113 infants during infancy and toddlerhood. The neural predictor of language development reported here was the cortical tracking of slow, rhythmic audiovisual stimuli, processing of which is known to differ in older children with dyslexia. To find out how such stimuli are tracked early in development, infants were presented with videos of a woman repeating the syllable “Ta” twice per second, and a ball bouncing on a drum to create a 2Hz beat. At the ages of six and nine months, infants exhibited a significant peak ...

Research paper thumbnail of Phonological Skills and Learning to Read. Psychology Press & Routledge Classic Editions

Research paper thumbnail of Difficulties in auditory organization as a cause of reading backwardness? An auditory neuroscience perspective

Developmental Science, 2016

Over 30 years ago, it was suggested that difficulties in the 'auditory organization' of word form... more Over 30 years ago, it was suggested that difficulties in the 'auditory organization' of word forms in the mental lexicon might cause reading difficulties. It was proposed that children used parameters such as rhyme and alliteration to organize word forms in the mental lexicon by acoustic similarity, and that such organization was impaired in developmental dyslexia. This literature was based on an 'oddity' measure of children's sensitivity to rhyme (e.g. wood, book, good) and alliteration (e.g. sun, sock, rag). The 'oddity' task revealed that children with dyslexia were significantly poorer at identifying the 'odd word out' than younger children without reading difficulties. Here we apply a novel modelling approach drawn from auditory neuroscience to study the possible sensory basis of the auditory organization of rhyming and non-rhyming words by children. We utilize a novel Spectral-Amplitude Modulation Phase Hierarchy (S-AMPH) approach to analysing the spectro-temporal structure of rhyming and non-rhyming words, aiming to illuminate the potential acoustic cues used by children as a basis for phonological organization. The S-AMPH model assumes that speech encoding depends on neuronal oscillatory entrainment to the amplitude modulation (AM) hierarchy in speech. Our results suggest that phonological similarity between rhyming words in the oddity task depends crucially on slow (delta band) modulations in the speech envelope. Contrary to linguistic assumptions, therefore, auditory organization by children may not depend on phonemic information for this task. Linguistically, it is assumed that 'book' does not rhyme with 'wood' and 'good' because the final phoneme differs. However, our auditory analysis suggests that the acoustic cues to this phonological dissimilarity depend primarily on the slower amplitude modulations in the speech envelope, thought to carry prosodic information. Therefore, the oddity task may help in detecting reading difficulties because phonological similarity judgements about rhyme reflect sensitivity to slow amplitude modulation patterns. Slower amplitude modulations are known to be detected less efficiently by children with dyslexia. Research highlights • We apply a novel model of speech encoding based on the neuronal oscillatory hierarchy to the rhyme oddity task. • We show that children's rhyme judgements depend primarily on auditory sensitivity to relatively slow amplitude envelope information.