Create an Audio Description for a Kaltura Media Entry
Overview
In order to be fully accessible and compliant with WCAG 2.1 AA standards, educational multimedia are required to have audio descriptions. While users are welcome to work with vendors to pay for audio descriptions, this article describes how to create one from scratch.
Critical Concepts
Definitions
Audio descriptions are spoken narrations that describe key visual elements of a video such as actions, settings, or on-screen text, making the content accessible to blind or visually impaired audiences. There are two types: standard audio descriptions and extended audio descriptions. Nowadays, both standard and extended audio descriptions are narrated using text-to-speech engines within our web browsers (rather than having actors read scripts).
Standard audio descriptions are narrations carefully timed to fit within natural pauses in the spoken dialogue and also appear as text onscreen. Because they require careful timing to fit between dialogue without interrupting the flow of the video, producing these descriptions can be time-consuming and requires specialized skills to ensure clarity, conciseness, and relevance.
Extended audio descriptions are required if the pauses between pieces of spoken dialogue in the video aren't sufficient to narrate the relevant visual content. They pause the video at certain points to allow for sufficient time.
Things to Keep in Mind
You can only upload extended audio descriptions on the front end. At present (February 2026), Kaltura doesn't allow users to upload standard audio descriptions. That said, the vast majority of educational content requires extended audio descriptions since there often aren't enough pauses in the dialogue to fully explain the visual content.
Steps to Take
There are three main tasks involved in generating an audio description:
Creating the initial file
Uploading it to the entry
Editing it in the Kaltura caption editor
Step 1: Create the Initial File
On your computer, open a basic text editor.
Note: both major operating systems offer free text editors. On Windows: Notepad. On a Mac: TextEdit.
In your document, add the following text at the top (be sure to include the space between the lines):
WEBVTT
1
00:00:01.000 --> 00:00:02.000
Start of description.
Click File > Save As.
Name the file whatever you like, but ensure that the file ends with .vtt (e.g. audio_description.vtt).
You now have a file that Kaltura will recognize as an extended audio description. "Start of description" is just a placeholder.
Step 2: Upload the Extended Audio Description to the Entry
Instructions on how to get to the edit page for a Kaltura entry in either Canvas or MediaSpace.
Click the profile icon at the top right of the page and select Login.
Enter your active directory credentials.
Click the profile icon again and select My Media.
Locate the video in question, click the kebab on its row (three dots), and select Edit.
Below the media player, click the Captions tab.
Click Add extended audio description.
In the "Upload an EAD file" window that appears:
Click Browse to locate and select the .vtt file you created on your computer.
Click the Select Language pulldown menu to identify the language of the audio description.
In the Accuracy pulldown menu, select the percentage that you believe represents its accuracy. (This will most likely be 100% since you're creating this yourself.)
In the Label field, provide a name for the extended audio description. We recommend making it clear that it's an audio description, e.g. "EAD - English" or "English Extended Audio Description."
Click Save.
Step 3: Edit the Extended Audio Description
Click Edit Captions.
In the pull-down menu near the top of the page, select the label for your extended audio description if it's not already selected.
Click the play button in the media player to play the video.
When you come to a moment where visuals need to be described, press the pause button.
Click on the first timecode (the "start" timecode) and enter the timecode from the paused video.
Note: the timecode format for captions and audio descriptions in Kaltura is hours:minutes:seconds,milliseconds.
Click on the second timecode (the "end" timecode) and enter a value two seconds later than the start timecode.
Note: the end timecode is largely irrelevant. However it MUST be a value that is later than the start timecode.
Enter your description in the adjacent field.
If you have additional audio descriptions to add, hover your mouse over one of the rows and click the + Add audio description button that appears.
Repeat steps a-h as needed.
When you're done creating your extended audio description, be sure to click Save.
You'll be asked to update the accuracy. Select whatever value you wish and click Save again.
Click Back to return to the edit page.
If you still have questions or need additional assistance, feel free to contact us at kaltura@ucsd.edu.