Product Update: Automated Video Transcription

Written By
Philippe Dame
  • Posted On
August 31, 2017

As of today, all Recollective sites have received a free and significant new feature: Fully automated video transcription.

No longer will your participants' most vivid and candid insights be trapped inside a video everyone must watch. Every single word uttered in a video will now be right there on the page for you to read, search, excerpt and summarize.

With support for over 110 languages and variants, our ever-improving AI-powered speech-to-text conversion will accurately transcribe all the spoken audio content in participant-submitted videos. This includes webcam recordings, mobile device submissions and traditional video uploads.

Transcript Appearance

Once your site has the new software update, the new transcription feature will be active. A text transcript will appear directly below or alongside participant-submitted videos so you can listen and read simultaneously.

We are already working on an interactive transcript experience. Recollective will soon highlight words as they're spoken and allow word selection to control video playback.

UPDATE (Oct 3, 2017): Recollective now has fully interactive video transcripts!

Improving Transcripts

Machine learning and artificial intelligence are amazing as they get better with time. Accuracy can still vary based on sound quality and accent variations. Recollective provides you full control to improve the transcripts that are generated to ensure your word clouds and text excerpts are accurate.

First, it's important to know that such transcripts are a raw extraction of text and won't include some of the things you might expect, like punctuation.

If you ever receive a transcript that is of very poor quality, we recommend you check the audio quality of the video during its playback. If it's good, ensure the language shown in the transcript header matches the actual language spoken in the video.

Recollective offers an option to instantly re-transcribe a video with another language or language variant to improve accuracy. For example, we support 13 English and 20 Spanish accents and so selecting the closest match will greatly increase the transcription accuracy.

Once you're satisfied with the automated transcript, you're still welcome to make minor edits directly to the transcript. To do so, select "Edit" from the menu above the transcript.

UPDATE (Oct 3, 2017): You can now just double-click any word to instantly edit or remove it. This update is part of the new interactive video transcript. There is no longer a dedicated edit mode.

Processing Older Videos

If you have a study already underway, you might be wondering if Recollective will transcribe the videos you've already collected. The short answer is, yes, please contact us if you are conducting an active study with older vidoes that need transcription.

Recollective can separate the audio from all past submitted videos so the transcription engine can convert the spoken words into text.

Saving Excerpts

Valuable insights found in your video transcripts can now be saved and easily reported. Just highlight one or more words to reveal the "Save Excerpt" menu option. This follows the same process available for saving verbatim excerpts on all other text submitted by participants.

Once a snippet of text is saved, you can apply one or more codes to the excerpt. This means all video content can finally be included in your Excerpt Reports.

In forthcoming releases of Recollective, excerpts from video transcripts will automatically generate a matching video clip! You'll be able to quickly assemble different "highlight reels" without bulky video exports or complex post-processing!

Disabling Transcription

If you are not content with the transcripts generated by this new feature, be aware that the transcription feature can be disabled for your site. Simply contact us via our standard support channels to share with us any concerns you have. We're always looking to improve.

Supported Languages and Variants

Recollective's interface is fully translated into 18 unique languages and, in this new update, we can now recognize over 110 spoken languages and variants.

UPDATE (June 2020): We now support 24 languages for the interface and 127 spoken languages and variants. View our resource article on Supported Languages.

Below is the complete list of the spoken languages supported for video transcription. In our next release, we'll allow participants to select their spoken language variant prior to video submission to maximize transcription accuracy.

  • Afrikaans (South Africa)
  • Amharic (Ethiopia)
  • Armenian (Armenia)
  • Azerbaijani (Azerbaijan)
  • Indonesian (Indonesia)
  • Malay (Malaysia)
  • Bengali (Bangladesh)
  • Bengali (India)Catalan (Spain)
  • Czech (Czech Republic)
  • Danish (Denmark)
  • German (Germany)
  • English (Australia)
  • English (Canada)
  • English (Ghana)
  • English (United Kingdom)
  • English (India)
  • English (Ireland)
  • English (Kenya)
  • English (New Zealand)
  • English (Nigeria)
  • English (Philippines)
  • English (South Africa)
  • English (Tanzania)
  • English (United States)
  • Spanish (Argentina)
  • Spanish (Bolivia)
  • Spanish (Chile)
  • Spanish (Colombia)
  • Spanish (Costa Rica)
  • Spanish (Ecuador)
  • Spanish (El Salvador)
  • Spanish (Spain)
  • Spanish (United States)
  • Spanish (Guatemala)
  • Spanish (Honduras)
  • Spanish (Mexico)
  • Spanish (Nicaragua)
  • Spanish (Panama)
  • Spanish (Paraguay)
  • Spanish (Peru)
  • Spanish (Puerto Rico)
  • Spanish (Dominican Republic)
  • Spanish (Uruguay)
  • Spanish (Venezuela)
  • Basque (Spain)
  • Filipino (Philippines)
  • French (Canada)
  • French (France)
  • Galician (Spain)
  • Georgian (Georgia)
  • Gujarati (India)
  • Croatian (Croatia)
  • Zulu (South Africa)
  • Icelandic (Iceland)
  • Italian (Italy)
  • Javanese (Indonesia)
  • Kannada (India)
  • Khmer (Cambodia)
  • Lao (Laos)Latvian (Latvia)
  • Lithuanian (Lithuania)
  • Hungarian (Hungary)
  • Malayalam (India)
  • Marathi (India)
  • Dutch (Netherlands)
  • Nepali (Nepal)
  • Norwegian Bokmål (Norway)
  • Polish (Poland)
  • Portuguese (Brazil)
  • Portuguese (Portugal)
  • Romanian (Romania)
  • Sinhala (Srilanka)
  • Slovak (Slovakia)
  • Slovenian (Slovenia)
  • Sundanese (Indonesia)
  • Swahili (Tanzania)
  • Swahili (Kenya)
  • Finnish (Finland)
  • Swedish (Sweden)
  • Tamil (India)
  • Tamil (Singapore)
  • Tamil (Sri Lanka)
  • Tamil (Malaysia)
  • Telugu (India)
  • Vietnamese (Vietnam)
  • Turkish (Turkey)
  • Urdu (Pakistan)Urdu (India)Greek (Greece)
  • Bulgarian (Bulgaria)
  • Russian (Russia)
  • Serbian (Serbia)
  • Ukrainian (Ukraine)
  • Hebrew (Israel)Arabic (Israel)
  • Arabic (Jordan)
  • Arabic (United Arab Emirates)
  • Arabic (Bahrain)
  • Arabic (Algeria)
  • Arabic (Saudi Arabia)
  • Arabic (Iraq)Arabic (Kuwait)
  • Arabic (Morocco)
  • Arabic (Tunisia)
  • Arabic (Oman)
  • Arabic (State of Palestine)
  • Arabic (Qatar)
  • Arabic (Lebanon)
  • Arabic (Egypt)Persian (Iran)Hindi (India)Thai (Thailand)
  • Korean (South Korea)
  • Chinese, Mandarin (Traditional, Taiwan)
  • Chinese, Cantonese (Traditional, Hong Kong)
  • Chinese, Mandarin (Simplified, Hong Kong)
  • Chinese, Mandarin (Simplified, China)
  • Japanese (Japan)

We updated this list in April 2020 to also include:

  • Dutch (Belgium) - nl-BE - Nederlands (België)
  • Estonian (Estonia) - et-EE - Eesti keel (Eesti)
  • Macedonian (North Macedonia) - mk-MK - Македонски (Северна Македонија)
  • Mongolian (Mongolia) - mn-MN - Монгол (Монгол)
  • Burmese (Myanmar) - my-MM - ဗမာ (မြန်မာ)
  • Punjabi (Gurmukhi, India) - pa-guru-IN - ਪੰਜਾਬੀ (ਗੁਰਮੁਖੀ, ਭਾਰਤ)
  • Albanian (Albania) - sq-AL - Shqip (Shqiperi)
  • Uzbek (Uzbekistan) - uz-UZ - O’zbek (O’zbekiston)
  • and more

We look forward to your feedback!

Philippe Dame
Co-Founder & CPO
Want to chat about this topic?
Get in touch!