Accommodations Toolkit

Text-to-speech (Computer Generated Voice): Research

This fact sheet on text-to-speech is part of the Accommodations Toolkit published by the National Center on Educational Outcomes (NCEO). It summarizes information and research findings on the use of text-to-speech with computer-generated speech as an assessment accessibility feature or accommodation for students with disabilities.[1] The toolkit also contains a summary of states’ accessibility policies for text-to-speech.

What is text-to-speech with computer-generated speech? Text-to-speech with computer-generated speech refers to technology that reads aloud written directions and test items employing computer software to generate a synthesized voice. This version of audio delivery is different from others that use the human voice, either in-person by a test administrator at the time of the assessment or a previously recorded human voice. It is important to note that students can be provided with text-to-speech in several ways (e.g., read aloud test directions only; read aloud items and directions; read aloud items, passages, and directions) (Lee et al., 2021).

What are the research findings on who should use this accommodation? Not all students benefit from using text-to-speech and care should be used to ensure that policies target the small number of students who may benefit from using this accommodation. Some students who have print-related disabilities may find it useful, including those with reading-related learning disabilities (Young, 2017) and visual impairments (Hansen et al., 2016). According to Flowers et al. (2011), many states’ accessibility policies for summative assessments used for accountability allow text-to-speech for students in grades 3 through 11 with health impairments, attention deficit disorders, intellectual disabilities, or specific learning disabilities.

What are the research findings on implementation of text-to-speech with computer-generated speech? Ten studies provided evidence about the impact of text-to-speech using a synthesized voice on assessment performance. Some of the studies showed positive or negative effects, while other studies found little or no impact.

Two studies indicated that the text-to-speech accommodation using a synthesized voice had no significant effect on performance for students with disabilities when compared to the use of no accommodations. Meyer & Bouck (2014) found that students with reading-related learning disabilities in middle school scored no differently on reading comprehension using text-to-speech software than with no accommodations. Another study found that students with learning disabilities in grade 5 scored the same when completing test-like writing tasks using a text-to-speech accommodation with synthesized voice as they did without any accommodations (Siliό & Barbetta, 2010).
Similarly, two studies that compared human read-aloud and text-to-speech delivery found that students did not benefit more from speech with synthesized voice than from human read-aloud delivery or no accommodation (Flowers et al, 2011; Meyer & Bouck, 2017).
Two very small studies indicated that the text-to-speech accommodation using a synthesized voice had some benefit for some students. Young (2017) found that three of four high school students with learning and additional disabilities improved their performance on an assessment measuring vocabulary, literary analysis, and comprehension when provided with the computer-generated text-to-speech. Hansen et al. (2016) studied the use of multiple accommodations to provide three students (grades 8 and 9) with visual disabilities access to science simulation items. The study found that text-to-speech via a screen reader allowed for easy navigation of the test item, but was not enough to provide students with a full understanding of the test items.
One study indicated that students with disabilities experienced negative effects when using synthesized text-to-speech programs. An analysis of three states’ reading comprehension scores indicated that grade 4 students with various disabilities performed better without computer-based text-to-speech software than with it (Ricci, 2015).
There were mixed findings across two studies that explored whether the use of reading pens with synthesized voices to read aloud difficult words improved student performance. Schmitt et al. (2011) found that when a reading pen was used to convert text to synthesized speech, two of three high school students demonstrated comprehension improvements. In a study of middle school students, including students with and without disabilities, Thurlow et al. (2010) found that just 11 of the 76 of the participants performed better in reading comprehension; 65 of the 76 students experienced no benefit.

What perceptions do students and teachers have about text-to-speech with computer-generated speech? Six research studies examined student and teacher perceptions of the use of text-to-speech with a synthesized voice. In all the studies, students had favorable perceptions of text-to-speech software. They preferred synthesized audio to in-person oral delivery (Flowers et al., 2011), were comfortable using the software (Higgins & Katz, 2013; Lee et al., 2021), reported stronger information recall, and liked managing the speed of speech (Meyer & Bouck, 2014; Young, 2017).

Students found the reading pen format helpful to scan and decode difficult words (Schmitt et al., 2011; Thurlow et al., 2010). Flowers et al. (2011) found that teachers had favorable perceptions of the use of text-to-speech using a synthesized voice.

What have we learned overall? Text-to-speech using computer-generated speech has been broadly provided to allow students with disabilities to more easily access academic assessments even though the research on synthesized text-to-speech accommodations has had mixed findings. Across studies, students with disabilities have experienced positive effects, no effects, and negative effects from using this accommodation. Students who do not know the content will not do well on an assessment with or without the use of text-to-speech. However, students, as well as their teachers, had positive perceptions of synthesized text-to-speech, even in situations where there were no performance benefits from using the accommodation.

References

Flowers, C., Kim, D. H., Lewis, P., & Davis, V. C. (2011). A comparison of computer-based testing and pencil-and-paper testing for students with a read-aloud accommodation. Journal of Special Education Technology, 26(1), 1–12. https://doi.org/10.1177/016264341102600102
Hansen, E. G., Liu, L., Rogat, A., & Hakkinen, M. T. (2016). Designing innovative science assessments that are accessible for students who are blind. Journal of Blindness Innovation and Research, 6(1). https://doi.org/10.5241/6-91
Higgins, J., & Katz, M. (2013). A comparison of audio representations of mathematics content. Journal of Special Education Technology, 28(3), 59–66. https://doi.org/10.1177/016264341302800305
Lee, D., Buzick, H., Sireci, S. G., Lee, M., & Laitusis, C. (2021). Embedded accommodation and accessibility support usage on a computer-based statewide achievement test. Practical Assessment, Research, and Evaluation, 26(25). https://doi.org/https://doi.org/10.7275/21926674
Meyer, N. K., & Bouck, E. C. (2014). The impact of text-to-speech on expository reading for adolescents with LD. Journal of Special Education Technology, 29(1), 21–34. https://doi.org/10.1177/016264341402900102
Meyer, N. K., & Bouck, E. C. (2017). Read-aloud accommodations, expository text, and adolescents with learning disabilities. Learning Disabilities: A Multidisciplinary Journal, 22(1), 34–47. https://doi.org/10.18666/ldmj-2017-v22-i1-7932
Ricci, N. N. (2015). The effect of the read-aloud testing accommodation on the 2011 fourth-grade National Assessment of Educational Progress in Reading, for students with disabilities in the New York State metropolitan, tri-state area. Dissertation Abstracts International: Section A. Humanities and Social Sciences, 77(01E). http://search.proquest.com/docview/1720330696/abstract.
Schmitt, A. J., McCallum, E., Rubinic, D., & Hawkins, R. (2011). Reading pen decoding and vocabulary accommodations: Impact on student comprehension accuracy and rate. Journal of Evidence-Based Practices for Schools, 12(2), 223–240.
Siliό, M. C., & Barbetta, P. M. (2010). The effects of word prediction and text-to-speech technologies on the narrative writing skills of Hispanic students with specific learning disabilities. Journal of Special Education Technology, 25(4), 17–32. https://doi.org/10.1177/016264341002500402.
Thurlow, M. L., Moen, R. E., Lekwa, A. J., & Scullin, S. B. (2010). Examination of a reading pen as a partial auditory accommodation for reading assessment. Retrieved from University of Minnesota, Partnership for Accessible Reading Assessment website: http://www.readingassessment.info/resources/publications/ReadingPen/ReadingPenReport.pdf
Young, M. C. (2017). The effects of text-to-speech on reading comprehension of students with learning disabilities. Dissertation Abstracts International: Section A. Humanities and Social Sciences, 78(11E). http://search.proquest.com/docview/1927469340/abstract.

Attribution

Ressa, V., Rogers, C., Lazarus, S. S., Hinkle, A. R., & Goldstone, L. (2024). Text-to-speech (computer generated voice): Research (NCEO Accommodations Toolkit #4a.1). National Center on Educational Outcomes.

The Center is supported through a Cooperative Agreement (#H326G210002) with the Research to Practice Division, Office of Special Education Programs, U.S. Department of Education. The Center is affiliated with the Institute on Community Integration at the College of Education and Human Development, University of Minnesota. Consistent with EDGAR §75.62, the contents of this report were developed under the Cooperative Agreement from the U.S. Department of Education, but do not necessarily represent the policy or opinions of the U.S. Department of Education or Offices within it. Readers should not assume endorsement by the federal government. Project Officer: David Egnor