Scheduled at 10:15 am in Mattie Silks on Friday, November 21.
#41350Speaker(s)
- Daniel Goerz, CEO, Habitat Learn Inc
Session Details
- Length of Session: 1-hr
- Format: Lecture
- Expertise Level: Beginner
- Type of session: General Conference
Summary
Human-sounding text-to-speech (HTTS) can reduce learning barriers in Higher Ed, (ex. reading fatigue, auditory processing issues, and voice bias). Modern HTTS uses real voice actors and deep learning to create natural, customizable voices with emotional depth improving comprehension, comfort, and inclusion—especially for First-Generation and English Language Learnering students. Research shows relatable voices enhance learning and equity. Used ethically, HTTS supports personalized, private, and more accessible education experiences
Abstract
Examine the growing role of human-sounding text-to-speech (HTTS) technology in overcoming learning barriers in higher ed. Traditional HTTS voices are often robotic, monotone, and British-accented, making them difficult for many students to connect with—especially those experiencing reading fatigue, auditory processing disorders, or language challenges. These systems have also lacked representation, often failing to reflect the diverse identities of first-generation, multilingual, and underrepresented students.
Recent innovations in HTTS, driven by deep learning and real voice actor recordings, have produced more natural, expressive, and customizable voices. These modern HTTS systems include variations in tone, pitch, emotion, and regional accents. Hearing course content in a familiar or culturally relevant voice helps students feel more comfortable, boosts comprehension, and strengthens their sense of belonging. Research highlights that relatable voices enhance processing speed, reduce cognitive load, and support academic equity. Human-sounding HTTS also aids students who struggle to understand fast-paced or accented lectures. By transforming transcripts into clear, customizable audio, it provides accessible review options while protecting instructor intellectual property.
While human-like HTTS is a valuable tool, the lecture underscores that it does not replace human support. Limitations remain—particularly with structured or technical content like complex equations—and accuracy still requires human review. Ethical considerations are also critical, including transparency around AI usage, prevention of misuse (e.g., deepfakes), and ensuring voice actors give informed consent. HTTS can significantly improve accessibility, comprehension, and inclusion when used responsibly. It empowers students to not only access course material but also connect with it on a personal and cultural level, helping close equity gaps and support diverse learning needs in Higher education.
Keypoints
- HTTS enhances comprehension/retention through hearing voices that reflect cultural and linguistic identities
- Customizable TTS voices reduce cognitive barriers for 1st gen, ELL and neurodiverse learners
- Use of ethical HTTS delivers accessible, personalized learning without replacing the need for human support
Disability Areas
Cognitive/Learning
Topic Areas
Artificial Intelligence, Assistive Technology, Captioning/Transcription, Uncategorized
Speaker Bio(s)
Daniel Goerz
Note taker, Learning Strategist, Exam Proctor Alternate Format Specialist (Kurzweil), Assistive Technologist Founded Note Taking Express(renamed Habitat Learn Inc) in 2014 to better serve diverse learners Leads Messenger Pigeon, a platform built with input from students and accessibility teams Advocates for inclusive, human-centred technology Believes technology is a tool, not a replacement, for thoughtful, community- driven support

