Web Captioning Overview
Article Contents
Translations
Translations of this article are available in:
- French - External Link - courtesy of Ideose - External Link
- German - External Link - courtesy of SEO Matrix - External Link
- German - External Link - courtesy of MediaUp - External Link
- Russian - External Link - courtesy of Slava Mikheev - External Link
Closed vs. Open Captions
Captions as typically seen on television
Most people are familiar with closed captioning, a technique of displaying the captioned text only when it is desired. All television sets with screen sizes of 13 inches and larger must contain the hardware to display captions. Closed captioning of most pre-recorded television programs is now a legal requirement in the United States. Television closed captioning is used by millions of individuals who are deaf or hard of hearing; millions more use it in the classroom or in noisy environments—like bars, restaurants, and airports. As the average age of the population increases, so does the number of people with hearing impairments. According to US government figures, one person in five has some functional hearing limitation. Because of the growing need for access to captions, many live broadcasts (such as news and sports events) and DVD and VHS programs now include closed captioning.
Closed captions for television are very limited in their formatting, because the caption look, feel, and location are determined by the caption decoder built into the television set. You can get more information about television captioning at Captioning FAQ - external link.
Captions as seen on DVD
Open captions include the same text as closed captions, but the captions are a permanent part of the picture, and cannot typically be turned off. This is much like watching subtitling of foreign language films. DVD utilizes a form of subtitling to display captions. Open captions are not decoded by the television set, but are a part of the video information. This technique allows for more control over caption location, size, color, font, and timing.
Captions as seen in Windows Media Player
For web video, captions can be open, closed, or both. A common technique for open captioning is adding the caption text directly to the video itself. This requires a video editing or encoding program that allows you to overlay titles onto the video. The captions are visible to anybody viewing the video clip and cannot be turned off. This gives you total control over the way the captions appear, but can be very time consuming and expensive to produce. The more common way of captioning audio and video on the web is to use functionality within the multimedia players to display the captions along with or on top of the video or audio.
Audio Descriptions
Audio descriptions provide additional information about what is visible on the screen. This allows video content to be accessible to those with visual disabilities. Though not commonly utilized in television and movies, it is gaining in popularity. Audio descriptions are helpful on the web if visual content in web video provides important content not available through the audio alone. An example of audio descriptions for something you have probably seen is found below. Can you visualize what is being described?
Listen to Audio Descriptions in MP3 Format (152KB)
If web video is produced with accessibility in mind, then audio descriptions are often unnecessary, as long as visual elements within the video are described in the audio.
Example of Audio Descriptions
In producing an instructional video for the web, the need for audio descriptions can often be avoided. If the video were displaying a list of five important items, the narrator might say, "As you can see, there are five important points." In this case, audio descriptions would be necessary to provide the visual content to those with visual disabilities who cannot 'see' what the important points are. However, if the narrator says, "There are five important points. They are..." and then reads or describes each of the points, then the visual content is being conveyed through audio and there is no additional need for audio descriptions.
Transcripts
Transcripts also provide an important part of making web multimedia content accessible. Transcripts allow anyone that cannot access content from web audio or video to read a text transcript instead. Transcripts do not have to be verbatim accounts of the spoken word in a video. They can contain additional descriptions, explanations, or comments that may be beneficial. Transcripts allow deaf/blind users to get content through the use of refreshable Braille and other devices. For most web video, both captions and a text transcript should be provided. For content that is audio only, then a transcript will usually suffice.
Transcripts provide a textual version of the content that can be accessed by anyone. They also allow the content of your multimedia to be searchable. Screen reader users may also prefer the transcript over listening to the audio of the web multimedia. Most proficient screen reader users set their assistive technology to read at a rate much faster than most humans speak. This allows the screen reader user to access the transcript of the video and get the same content in less time than listening to the actual audio content.
Captioning Terms and Technologies
On the web, the primary multimedia technologies are Microsoft's Windows Media Player, Apple's Quicktime, RealNetwork's RealPlayer, and Macromedia Flash. Unfortunately, there is no fully standardized mechanism for captioning across these technologies. Each media player handles captions differently. Below are some common technologies and terms that apply to captioning within the various media players.
- SMIL (Synchronized Multimedia Integration Language)
- A standards-based language used by Quicktime and RealPlayer to control the layout and presentation of visual and audible items. SMIL is used to control the display, positioning, and timing of captions and audio/video multimedia. The captions themselves are stored in a Text Track file if you’re using Quicktime or a RealText file if you’re using RealPlayer.
- SAMI (Synchronized Accessible Media Interchange)
- Microsoft’s technique for adding captions for Windows Media Player. A SAMI file contains the text to be displayed within the captions and information that synchronizes individual caption displays to the multimedia presentation.
- Text Track
- Quicktime uses a Text Track file to store the caption text and synchronization (timing) information.
- RealText
- RealPlayer uses a RealText file to store the captions text and synchronization (timing) information.
- MAGpie
- Developed by the CPB/WGBH National Center for Accessible Media (NCAM), MAGpie is a free tool for creating caption files that can be utilized by media players.
- Hi-Caption
- HiSoftware's Hi-Caption allows creation of captions for media players.