
Zylo
Uncover and optimize your SaaS investment.

A large-scale multilingual speech-to-text translation corpus.
A large-scale multilingual speech-to-text translation corpus.
CoVoST (Conversational Voice-to-Speech Translation) is a large-scale, multilingual speech-to-text translation corpus developed by Facebook Research. It addresses the lack of parallel data for end-to-end speech translation (ST) model training. Built upon the Common Voice dataset, CoVoST includes translations from English into 15 languages and from 21 languages into English. The corpus comprises approximately 2,880 hours of speech data from 78,000 speakers. It is designed to foster ST research by providing a diversified, openly licensed dataset. CoVoST facilitates the training of end-to-end ST models, which offer system simplicity, lower inference latency, and reduced compounding errors compared to cascaded ST systems. Data splitting scripts and Fairseq S2T examples are provided to facilitate model training.
A large-scale multilingual speech-to-text translation corpus.
Quick visual proof for CoVoST. Helps non-technical users understand the interface faster.
CoVoST (Conversational Voice-to-Speech Translation) is a large-scale, multilingual speech-to-text translation corpus developed by Facebook Research.
Explore all tools that specialize in end-to-end model training. This domain focus ensures CoVoST delivers optimized results for this specific requirement.
Open side-by-side comparison first, then move to deeper alternatives guidance.
CoVoST provides a substantial amount of data, covering multiple languages and translation directions, which allows for training robust and generalizable ST models.
Supports direct training of speech-to-text translation models, eliminating the need for intermediate ASR and MT components.
Provides scripts to generate train, development, and test splits from the corpus, ensuring consistent evaluation methodologies.
Leverages the Common Voice dataset, providing a large, diverse, and publicly available source of speech data.
Includes an out-of-domain evaluation set from Tatoeba, allowing for assessment of model performance in real-world scenarios.
Download Common Voice audio clips and transcripts.
Download CoVoST translations.
Generate data splits using the provided script (get_covost_splits.py).
Specify the version, source language, target language, root path, and Common Voice TSV path.
Obtain train, development, and test TSV files.
All Set
Ready to go
Verified feedback from other users.
“CoVoST is highly regarded for its comprehensive multilingual coverage and free availability, enhancing speech-to-text translation research.”
No reviews yet. Be the first to rate this tool.

Uncover and optimize your SaaS investment.

A powerful shell designed for interactive use and scripting.

Zopto was a LinkedIn automation tool designed to generate leads, but it is now defunct.
The all-in-one AI platform for go-to-market teams.

Maximize your Amazon sales and grow your business with powerful, accurate data and AI-driven listing optimization.

Your one-stop static site engine.