端到端语音识别ESPnet2 实例 egs2列表

egs2 实例概述

一个简单的表格是这么创建的:

Directory nameCorpus nameTaskLanguageURL
aishellAISHELL-ASR0009-OS1 Open Source Mandarin Speech CorpusASRZHhttp://www.aishelltech.com/kysjcp
amiThe AMI Meeting CorpusASRENhttp://groups.inf.ed.ac.uk/ami/corpus/
an4CMU AN4 databaseASR/TTSENhttp://www.speech.cs.cmu.edu/databases/an4/
babelIARPA Babel corupsASR~20 Languageshttps://www.iarpa.gov/index.php/research-programs/babel
chime4The 4th CHiME Speech Separation and Recognition ChallengeASR/Multichannel ASRENhttp://spandh.dcs.shef.ac.uk/chime_challenge/chime2016/
commonvoiceThe Mozilla Common VoiceASR13 Languageshttps://voice.mozilla.org/datasets
csjCorpus of Spontaneous JapaneseASRJPhttps://pj.ninjal.ac.jp/corpus_center/csj/en/
csmscChinese Standard Mandarin Speech CopusTTSZHhttps://www.data-baker.com/open_source.html
dirha_wsjDistant-speech Interaction for Robust Home ApplicationsMulti-Array ASRENhttps://dirha.fbk.eu/, https://github.com/SHINE-FBK/DIRHA_English_wsj
how2How2: A Large-scale Dataset for Multimodal Language UnderstandingASR/Machine Translation/Speech TranslationEN->PThttps://github.com/srvk/how2-dataset
jsssJSSS: Japanese speech corpus for summarization and simplificationTTSJPhttps://sites.google.com/site/shinnosuketakamichi/research-topics/jsss_corpus
jsutJapanese speech corpus of Saruwatari-lab., University of TokyoASR/TTSJPhttps://sites.google.com/site/shinnosuketakamichi/publication/jsut
jvsJVS (Japanese versatile speech) corpusTTSJPhttps://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus
laborotvLaboroTVSpeech (A large-scale Japanese speech corpus on TV recordings)ASRJPhttps://laboro.ai/column/eg-laboro-tv-corpus-jp
librispeechLibriSpeech ASR corpusASRENhttp://www.openslr.org/12
ljspeechThe LJ Speech DatasetTTSENhttps://keithito.com/LJ-Speech-Dataset/
mini_an4Mini version of CMU AN4 database for the integration testASR/TTSENhttp://www.speech.cs.cmu.edu/databases/an4/
nscNational Speech CorpusASREN-SGhttps://www.imda.gov.sg/programme-listing/digital-services-lab/national-speech-corpus
mlsMLS (A large multilingual corpus derived from LibriVox audiobooks)ASR8 languageshttp://www.openslr.org/94/
open_li52Corpus combination with 52 languages(Commonvocie + voxforge)Multilingual ASR52 languages
ru_open_sttRussian Open Speech To Text (STT/ASR) DatasetASRRUhttps://github.com/snakers4/open_stt
reverbREVERB (REverberant Voice Enhancement and Recognition Benchmark) challengeASRENhttps://reverb2014.dereverberation.com/
spgispeechSPGISpeech 5k corpusASRENhttps://datasets.kensho.com/datasets/scribe
timitTIMIT Acoustic-Phonetic Continuous Speech CorpusASRENhttps://catalog.ldc.upenn.edu/LDC93S1
vctkEnglish Multi-speaker Corpus for CSTR Voice Cloning ToolkitTTSENhttp://www.udialogue.org/download/cstr-vctk-corpus.html
vivosVIVOS (Vietnamese corpus for ASR)ASRVIhttps://ailab.hcmus.edu.vn/vivos/
voxforgeVoxForgeASR7 languageshttp://www.voxforge.org/
wsjCSR-I (WSJ0) Complete, CSR-II (WSJ1) CompleteASRENhttps://catalog.ldc.upenn.edu/LDC93S6A,https://catalog.ldc.upenn.edu/LDC94S13A
wsj0_2mixMERL WSJ0-mix multi-speaker datasetASR/SEENhttp://www.merl.com/demos/deep-clustering
wsj0_2mix_spatializedMERL WSJ0-mix multi-speaker dataset (Spatialized version)ASR/Multichannel ASR/SEENhttp://www.merl.com/demos/deep-clustering
yesnoThe “yesno” corpusASRHEhttp://www.openslr.org/1
zeroth_koreanZeroth-KoreanASRKRhttp://www.openslr.org/40

使用方法

See: https://espnet.github.io/espnet/espnet2_tutorial.html#recipes-using-espnet2

Logo

CSDN联合极客时间,共同打造面向开发者的精品内容学习社区,助力成长!

更多推荐