Search in:

Clinical Linguistics & Phonetics Volume 32, 2018 - Issue 10

Submit an article Journal homepage

Open access

1,165

Views

CrossRef citations to date

Altmetric

Listen

Note

A software program to assist coding of prelinguistic vocalizations in real time

Elisabeth WilladsenDepartment of Nordic Studies and Linguistics, University of Copenhagen, DenmarkCorrespondence[email protected]
View further author information

Christina PerssonSahlgrenska Academy, University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, SwedenView further author information

Duncan AppelbeClinical Trials Research Centre, University of Liverpool, United KingdomView further author information

Pages 972-978 | Received 13 Feb 2018, Accepted 06 Apr 2018, Published online: 18 Jun 2018

Cite this article
https://doi.org/10.1080/02699206.2018.1463396
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF

ABSTRACT

Numerous studies have confirmed that prelinguistic utterances are precursors to speech, and there is ample evidence that, for example, frequency of canonical syllables and syllable inventory size correlate with speech and language measures at older ages.

Traditionally, prelinguistic utterances have been assessed by phonetic transcription which is difficult and time-consuming in infants. Recently, a more time-efficient methodology to assess prelinguistic utterances in real time, called naturalistic listening, was developed (Ramsdell et al., 2012). In a large international NIDCR-funded randomized controlled trial, Timing of Primary Surgery for with Cleft Palate (TOPS), including many coders, a software program (TimeStamper) was developed to assist in coding of prelinguistic vocalizations in real time, to ensure consistency of the coding procedures. Coders upload a video (or audio) file and watch and listen to the recording in real time without any possibility of pausing or taking notes. In real time, the coder registers each speech-like syllable as canonical or non-canonical. TimeStamper automatically calculates the percentage of canonical syllables of all syllables registered (canonical babbling ratio). At the end of a recording, TimeStamper assists in assessing presence/absence of canonical babbling and syllable inventory size. The software is presented and instructions for free access are provided.

KEYWORDS:

Prelinguistic vocalizations
canonical babbling
syllable inventory size
real time listening
TimeStamper

It is well established that prelinguistic vocalizations are precursors to speech. The appearance of adult-like well-formed syllables, canonical syllables, is an important developmental milestone in the first year of life. In typically developing children, it appears no later than at 10 months of age (e.g. Oller, Citation1980; Stark, Citation1980). Onset of canonical babbling is a robust milestone and is achieved even in populations regarded as at risk for speech and language disorders due to premature birth, low socioeconomic status and low educational levels (Eilers et al., Citation1993; Oller, Eilers, Neal, & Cobo-Lewis, Citation1998; Oller, Eilers, Neal, & Schwartz, Citation1999). From onset of canonical babbling, the amount of canonical syllables grows and infants’ babbling becomes increasingly complex in structure and resemblance with meaningful speech, as it takes on intonation and stress patterns of mature speech during the first year of life. Thereby, it is possible to assess speech development even before a child says her/his first word.

Amount and diversity of canonical syllables at age 1 have been found to correlate with different speech and language measures at older ages in typically developing children. For instance, 12-month olds with larger syllable inventories produced 50 different words earlier than children with smaller inventories (Menyuk, Liebergott, & Schultz, Citation1986), and correlations have been reported between larger consonant inventories at age 1 and better articulation skills up to 3 years of age (Menyuk et al., Citation1986; Vihman & Greenlee, Citation1987).

Pronounced delays in onset of canonical babbling have consistently been reported in severely hearing impaired infants (e.g. Eilers and Oller, Citation1994; Schauwers, Gillis, Daemers, De Beukelaer, & Govaerts, Citation2004; Stoel-Gammon & Otomo, Citation1986) and in children with William’s syndrome (Masataka, Citation2001). Patten et al. (Citation2014) studied children later diagnosed with autism spectrum disorder and reported lower canonical babbling ratios (CBR) in the study group as compared to a control group. Also, studies of children with an unrepaired cleft palate reported a risk for delay in canonical babbling (Chapman, Hardin-Jones, Schulte, & Halter, Citation2001; Hardin-Jones, Chapman, & Schulte, Citation2003).

Studies of prelinguistic utterances have almost exclusively used phonetic transcription as a tool. However, this is very time and resource consuming and therefore difficult to use in studies with many participants. Phonetic transcription of prelinguistic vocalizations has also consistently been reported to cause difficulty with agreement between transcribers, especially for non-canonical utterances (Ramsdell, Kimbrough Oller, & Ethington, Citation2007). Moreover, Ramsdell, Oller, Buder, Ethington, and Chorna (Citation2012) showed that phonetic transcription grossly overestimates parents’ report of their infants’ syllable inventory. Since parents are typically the ones to negotiate word meaning with their child as his/her speech and language develops, overestimation of a child’s productive phonetic inventory may mean any predictions taken from phonetic transcription could be misleading or inaccurate. However, underestimation similarly misrepresents the child’s abilities with attendant implications for analyses and clinical decisions and there are many circumstances where more information may be needed. Thus, users should carefully consider the purpose of their investigation when selecting the assessment method.

We adjusted and tested naturalistic listening in real time from Ramsdell et al. (Citation2012) in a clinical setting using student listeners without experience in infant coding. In naturalistic listening in real time, the coder listens to a sample of recorded speech and assesses a child’s syllable productions continuously without pausing the recording and without taking notes. While listening, canonical and non-canonical syllables are identified. At the end of a recording, the coder lists the syllables she/he found the child produced with control from memory and makes a judgement on presence or absence of canonical babbling.

The students learned the method easily and showed very good inter-rater reliability for syllable inventory in typically developing children and in children with cleft palate (Willadsen et al., Citation2017). This methodology was then adopted in a large international RCT, Timing of Primary Surgery for with Cleft Palate (TOPS, https://clinicaltrials.gov/ct2/show/NCT00993551), that assesses speech development of 558 infants with cleft palate from 1–5 years of age. We developed a software program, TimeStamper, for the coding procedure of 12-month-olds to equalize assessment conditions.

TimeStamper assists the coder in assessment of canonical babbling in two ways. Firstly, at the end of a recording, the coder makes a judgement of whether or not the child observed was in the canonical babbling stage. Secondly, based on the coder’s assessment of syllable productions, the CBR is calculated automatically by TimeStamper (more detail is provided below). TimeStamper also assists in creating an overview of the child’s syllable inventory regarding size and content.

TimeStamper is written in Java (v1.7) utilizing the video libraries from VLC (http://www.videolan.org/vlc/download-windows.en-GB.html). As such it will run on any operating system that supports Java and that there are VLC libraries compiled for (e.g. Windows and MacOS). The application has no other requirements to run, and the speed of processor and the amount of free memory will determine the quality of the playback experienced by the user. Any computer meeting the minimum requirements for Windows 10 should be able to run the application with no difficulties.

The TimeStamper application can play video (mp4/ogv) or audio (mp3) files and has been designed so as to make the process of assessing sound or video recordings of canonical babbling as straightforward as is possible ():

Figure 1. Process flow in TimeStamper.

The end user can configure the application (via a text-based configuration file) to use different keypresses to specify different types of sound (e.g. canonical and non-canonical), with the time point that a key was pressed being stored within the system.

The user interface has been designed to facilitate the analysis of multiple audio/video files by an international assessing team (specifically for the TOPS trial) and the attribution of the results for ease of tracking. When first started, the application will appear as shown in . The application will not start in full screen mode by default; however, this is the recommended mode of operation so as to ensure that any keypresses are not missed.

Figure 2. The home screen for TimeStamper.

Once launched, the user has only one option, to load a new audio/video file, via the ‘Load’ button (). If the user attempts to move forward without completing all of the required, then a warning message is displayed to the user to prompt for completion ().

Figure 3. The initial load screen.

Figure 4. Example for data missing warning.

Once all fields have been completed, the media is loaded and the application is ready for use. The user then commences their assessment. Prior to the assessment commencing, a warning is displayed to ensure that the assessor is aware that sufficient time should be allotted as pausing is not an option. During the assessment, the listener hits a key – default S or L – for each canonical syllable and non-canonical syllable, respectively. The program automatically stamps a time marker for each key strike and also calculates the CBR. A coder can see the count of syllables on the screen so that she/he knows the annotation was registered. However, a coder is only able to see the last annotation to avoid any influence on the final decision of whether or not she/he thinks the child was babbling canonically. At the end of the recording, a window pops up asking the coder to select canonical yes/no, and then to list the syllables she/he found the child produced with control (these syllables are stored in UTF-8 format so as to allow for the use of special characters, such as those used in the International Phonetic Database). When the coder clicks done, the answers are saved automatically in two different locations but not visible to the coder.

The output from the application is split between the canonical and non-canonical syllables () and calculates the percentage split between the different behaviour types, that is, the CBR (see line 3, ).

Figure 5. Example for canonical syllable ratio output.

shows a screen shot of pop-up window for information on ‘Canonical yes/no’ and listing of syllables produced with control, for ease of future analysis.

Figure 6. Example for syllable record screen.

Utilization of this method of storing the results allows for ease of data analysis and attribution to a specific recording and coder. When the recording’s data have been saved, the process can be repeated – the user is reminded at this point that loading a new video will result in all of the previous data being lost.

This tool can be used to assess prelinguistic vocalizations of typically developing infants as well as infants from clinical subgroups with risk of speech and language delay such as children with autism spectrum disorder, hearing impairment and cleft palate. It may be useful in research with many participants and/or many coders, and it may prove valid for audit or quality registries as it is performed in real time. The assessment in real time reduces the time and resource demand markedly (work smarter not harder). With the commonality of output files, it is relatively straightforward to combine the extracts from multiple coders into a single file for a more detailed analysis.

The TimeStamper software can be obtained by contacting [email protected] and will be made available for non-commercial use under the terms of the Apache 2.0 License (https://choosealicense.com/license/apache-2.0/), and referencing this paper in any publication arising from the use of this software.

Declaration of interest

Its contents are solely the responsibility of the authors and do not necessarily represent the official views of the NIDCR.

Acknowledgements

The authors thank Professor David K. Oller for his generous help in sharing his knowledge about naturalistic listening in real time. The authors thank Victor Hansen who developed the first version of the TimeStamper software.

Additional information

Funding

The development of TimeStamper was made possible by Grant Numbers [U01DE018664] and [U01DE018837] from the National Institute of Dental and Craniofacial Research (NIDCR).

Related Research Data

Vocal Patterns in Infants with Autism Spectrum Disorder: Canonical Babbling Status and Vocalization Frequency

Source: The University of North Carolina at Chapel Hill University Libraries

Predicting phonetic transcription agreement: Insights from research in infant vocalizations

Source: Informa UK Limited

The Role of Prematurity and Socioeconomic Status in the Onset of Canonical Babbling in Infants

Source: Elsevier BV

Assessment of prelinguistic vocalizations in real time: a comparison with phonetic transcription and assessment of inter-coder-reliability.

Source: Taylor & Francis

Infant vocalizations and the early diagnosis of severe hearing impairment

Source: Elsevier BV

Late Onset Canonical Babbling: A Possible Early Marker of Abnormal Development

Source: American Association on Intellectual and Developmental Disabilities (AAIDD)

Predicting Phonological Development

Source: Palgrave Macmillan UK

Babbling development of hearing-impaired and normally hearing subjects.

Source: American Speech Language Hearing Association

STAGES OF SPEECH DEVELOPMENT IN THE FIRST YEAR OF LIFE

Source: Elsevier

THE EMERGENCE OF THE SOUNDS OF SPEECH IN INFANCY

Source: Elsevier

The Impact of Cleft Type on Early Vocal Development in Babies With Cleft Palate

Source: SAGE Publications

Precursors to speech in infancy: the prediction of speech and language disorders.

Source: Elsevier BV

Identification of Prelinguistic Phonological Categories

Source: American Speech Language Hearing Association

Why early linguistic milestones are delayed in children with Williams syndrome: late onset of hand banging as a possible rate–limiting constraint on the emergence of canonical babbling

Source: Wiley

Linking provided by

References

Chapman, K. L., Hardin-Jones, M., Schulte, J., & Halter, K. A. (2001). Vocal development of 9-month-old babies with cleft palate. Journal of Speech, Language, and Hearing Research, 44(6), 1268–1283. doi:10.1044/1092-4388(2001/099)
PubMed Web of Science ®Google Scholar
Eilers, R. E., & Oller, D. K. (1994). Infant vocalizations and the early diagnosis of severe hearing impairment. The Journal of Pediatrics, 124(2), 199–203. doi:10.1016/S0022-3476(94)70303-5
PubMed Web of Science ®Google Scholar
Eilers, R. E., Oller, D. K., Levine, S., Basinger, D., Lynch, M. P., & Urbano, R. (1993). The role of prematurity and socioeconomic status in the onset of canonical babbling in infants. Infant Behavior and Development, 16, 297–315. doi:10.1016/0163-6383(93)80037-9
Web of Science ®Google Scholar
Hardin-Jones, M., Chapman, K. L., & Schulte, J. (2003). The impact of cleft type on early vocal development in babies with cleft palate. The Cleft Palate-Craniofacial Journal, 40(5), 453–459. doi:10.1597/1545-1569(2003)040<0453:TIOCTO>2.0.CO;2
PubMed Web of Science ®Google Scholar
Masataka, N. (2001). Why early linguistic milestones are delayed in children with Williams syndrome: Late onset of hand banging as a possible rate-limiting constraint on the emergence of CB. Developmental Science, 4, 158–164. doi:10.1111/1467-7687.00161
Web of Science ®Google Scholar
Menyuk, P., Liebergott, J., & Schultz, M. (1986). Predicting phonological development. In B. Lindblom & R. Zetterstrom (Eds.), Precursors of early speech (vol. 44, pp. 79–93). Houndmills: Stockton Press.
Google Scholar
Oller, D. K., Eilers, R. E., Neal, A. R., & Cobo-Lewis, A. B. (1998). Late onset canonical babbling: A possible early marker of abnormal development. American Journal on Mental Retardation, 103, 249–265. doi:10.1352/0895-8017(1998)103<0249:LOCBAP>2.0.CO;2
PubMedGoogle Scholar
Oller, D. K., Eilers, R. E., Neal, A. R., & Schwartz, H. K. (1999). Precursors to speech in infancy: The prediction of speech and language disorders. Journal of Communication Disorders, 32, 223–246. doi:10.1016/S0021-9924(99)00013-1
PubMed Web of Science ®Google Scholar
Oller, D. K. (1980). The emergence of the sounds of speech in infancy. In G. Yeni-Komshian, J. Kavanagh, & C. Ferguson (Eds.), Child phonology (Vol. 1, pp. 93–112). Production New York, NY: Academic Press.
Google Scholar
Patten, E., Belardi, K., Baranek, G. T., Watson, L. R., Labban, J. D., & Oller, D. K. (2014). Vocal patterns in infants with autism spectrum disorder: Canonical babbling status and vocalization frequency. Journal of Autism and Developmental Disorders, 44(10), 2413–2428. doi:10.1007/s10803-014-2047-4
PubMed Web of Science ®Google Scholar
Ramsdell, H. L., Kimbrough Oller, D., & Ethington, C. A. (2007). Predicting phonetic transcription agreement: Insights from research in infant vocalizations. Clinical Linguistics & Phonetics, 21(10), 793–831. doi:10.1080/02699200701547869
PubMed Web of Science ®Google Scholar
Ramsdell, H. L., Oller, D. K., Buder, E. H., Ethington, C. A., & Chorna, L. (2012). Identification of prelinguistic phonological categories. Journal of Speech, Language, and Hearing Research, 55(6), 1626–1639. doi:10.1044/1092-4388(2012/11-0250)
PubMed Web of Science ®Google Scholar
Schauwers, K., Gillis, S., Daemers, K., De Beukelaer, C., & Govaerts, P. J. (2004). Cochlear implantation between 5 and 20 months of age: The onset of babbling and the audiologic outcome. Otology & Neurotology, 25(3), 263–270. doi:10.1097/00129492-200405000-00011
PubMed Web of Science ®Google Scholar
Stark, R. E. (1980). Stages of speech development in the first year of life. In G. Yeni-Komshian, J. Kavanagh, & C. Ferguson (Eds.), Child phonology (Vol. 1, pp. 73–90). New York, NY: Academic Press.
Google Scholar
Stoel-Gammon, C., & Otomo, K. (1986). Babbling development of hearing impaired and normally hearing subjects. Journal of Speech and Hearing Disorders, 51, 33–41. doi:10.1044/jshd.5101.33
PubMedGoogle Scholar
Vihman, M. M., & Greenlee, M. (1987). Individual differences in phonological development ages one and three years. Journal of Speech, Language, and Hearing Research, 30(4), 503–521. doi:10.1044/jshr.3004.503
Web of Science ®Google Scholar
Willadsen, E., Persson, M. C., Lohmander, A., Patrick, K., Shaw, W. C., & Oller, D. K., (2017, February). Naturalistic assessment of prelinguistic vocalizations in infants with cleft palate: a methodological study. Paper presented at the 13th International Congress of Cleft Lip and Palate and Related Craniofacial Anomalies, Chennai.
Google Scholar

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

A software program to assist coding of prelinguistic vocalizations in real time

ABSTRACT

Declaration of interest

Acknowledgements

Related Research Data

References

Information for

Open access

Opportunities

Help and information

A software program to assist coding of prelinguistic vocalizations in real time

ABSTRACT

Declaration of interest

Acknowledgements

Additional information

Funding

Related Research Data

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date