Bynder can automatically generate transcripts for audio and video assets in your Bynder DAM via Speech-to-Text. This feature automatically converts audio content for multiple languages into text (transcriptions), making these assets easily searchable. Users can locate keywords or phrases used within videos and audio files without having to manually add individual tags. Clicking on a word in the generated transcript will play the media from that specific location. In addition, you can improve the accessibility of your content by adding closed captions to your videos.
How to Enable the Speech-to-Text Search
Please contact your Customer Success Manager to learn more about enabling this feature and any associated costs.
Download Transcripts for Video and Audio Assets
Note
The subtitles will display within Bynder only. Subtitles will not appear for assets embedded outside of Bynder (i.e., via embed code) or in any other Bynder modules.
- Navigate to Assets.
- Open your video.
- Click Transcript
- A new window will pop up where you can view the transcript.
Note
If you click on a word in the transcript it will bring you to the exact location in the video.
- View the date generated, length, language, word count, and confidence score.
- Click on the three file formats (SRT, VTT, TXT)to download the transcripts and enable closed captions for several supported languages.
File Restrictions
The following files cannot be transcribed:
- Files larger than 2GB
- Files longer than 4 hours
- Files shorter than 3 seconds
Confidence Score
A confidence score indicates the accuracy of a transcript. See below for some of the factors that can affect the confidence score:
- Audio Quality: The quality of the audio input can significantly affect the confidence score. Clear, noise-free audio produces a higher confidence: poor quality or loud background audio results in lower scores.
-
Speaker Variability: If multiple speakers are in the audio, this can produce a lower confidence score, as distinguishing between different voices can be challenging.
- Currently, single-language identification is supported. If two languages are spoken in the media, the predominant language will be transcribed.
- Language Complexity: Complex vocabulary, accents, and dialects can impact the confidence score. Uncommon or technical terms may lead to lower scores.
Related Articles
Use the Search Bar to Find Assets
AI Search Experience (formerly EMRAYS Integration)
Search for Text in Images (OCR)
Comments
0 comments
Please sign in to leave a comment.