Transcribe Video to Text

In today's fast-paced digital age, the ability to transcribe audio and video to text offers immense value in education, business, and research. Converting lectures and meetings into written format enhances accessibility, aids learning, and streamlines data processing. By offering searchable and easily understandable content, this feature speeds up the process of finding and analyzing information. This helps both individuals and organizations to make quicker, well-informed decisions.

What is Transcribe Video?

The model that processes the data uses a special technology called Whisper, developed by OpenAI, to turn spoken words into written text. You can learn more about Whisper here: Whisper - OpenAI.

For this release, Whisper can understand and transcribe speech in English only. It can distinguish between the speakers if there are multiple people in the video.

Currently, we support MP4, MKV, and MOV video formats for Transcribe Video.

Tip: We also have an option to transcribe audio to text.

Note: The transcribe Video to Text feature is included in the Documents Plus subscription at $9.99 monthly.

How to use Transcribe Video

There are some key points that have to be taken into consideration before using the feature:

an Internet connection is needed for the feature to work
the feature won’t work in the background
only locally stored files can be transcribed for now
the model is downloaded to the device and operates locally, ensuring user data remains secure without being sent elsewhere
enabling this feature may increase battery usage
while the feature works well with spoken content like lectures and podcasts, it may not be suitable for music and noises, potentially leading to inaccurate results
this feature is not supported on certain devices: iPhone 11 and older iPhone models; iPad (9th, 8th generation) iPad mini (5th generation), iPad Pro 12.9-inch (4th, 3rd generation), iPad Air (3rd generation), iPad Pro 11-inch (2nd, 1st generation)

Since this is an experimental feature, we encourage users to share their feedback. Your input is invaluable as we continue to refine and improve this technology. You can drop us a line at rdsupport@readdle.com.

Currently, we have 4 entry points for this feature:

Actions menu — can be entered via the three dots button.
Extension — can be accessed from other apps via Share.
Smart Actions – depending on the file type and the action performed, the banner automatically suggests your further actions.
Video Actions - can be accessed through the video player.

How Transcribe Video works

Choose any of the entry points and enable the feature.
The Whisper neurotechnology model will be downloaded to your device.
The audio will be extracted from the video.
After this, the transcription process begins.
The transcribed file will be saved to the exact location of the original video.

Note: The transcribed file will be saved in TXT format.

If you’d like to get individual help from our Customer Support team, follow the steps on the Contact Us page.

Thank you! Tell us more about your experience with Documents Help Center:

The information in the article is confusing or wrong.
I don’t like the described feature or policy.
There isn’t information I was looking for.

It’s optional, but it will help us improve further

Description

Have some specific request?

Get help faster following a few simple steps and we’ll get back to you as soon as possible.