Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Implementation:Speechbrain Speechbrain Prepare CommonVoice SSL

From Leeroopedia


Knowledge Sources
Domains Self-Supervised Learning, Data Preparation
Last Updated 2026-02-09 00:00 GMT

Overview

Variant of the Common Voice data preparation script for the self-supervised learning (wav2vec2) recipe.

Description

This file (recipes/CommonVoice/self-supervised-learning/wav2vec2/common_voice_prepare.py) is a copy of the canonical Common Voice data preparation script. It provides the same prepare_common_voice function with identical parameters and behavior, placed within the self-supervised-learning/wav2vec2 recipe subdirectory for convenience. The canonical implementation is documented on the Implementation:Speechbrain_Speechbrain_Prepare_CommonVoice_Seq2Seq page.

Usage

Use this when preparing the Mozilla Common Voice dataset specifically for self-supervised learning (wav2vec2) training. See the canonical page for full details.

Code Reference

Source Location

Canonical Implementation

This is a duplicate of the seq2seq Common Voice preparation script. For full documentation including signature, I/O contract, and usage examples, see:

Related Pages

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment