Corpora

By default, each corpus is text/audio. Some corpora contain more information. In this case, they are classified under a dedicated section (i.e. Audiovisual for audiovisual speech)

Multilingual

Language specific

English

Alba speech corpus

https://doi.org/10.7488/ds/2506

Parallel Audiobook Corpus

https://doi.org/10.7488/ds/2468

VCTK

https://doi.org/10.7488/ds/2645

Hurricane natural speech corpus - higher quality version

https://doi.org/10.7488/ds/2482

Repeated Harvard Sentence Prompts corpus version 0.5

https://doi.org/10.7488/ds/39

The Voice Conversion Challenge 2018 database

https://doi.org/10.7488/ds/2337

The Voice Conversion Challenge 2016 database

https://doi.org/10.7488/ds/1575

LibriTTS corpus

http://www.openslr.org/60/

CSTR NAM TIMIT Plus corpus

http://homepages.inf.ed.ac.uk/jyamagis/page3/page57/page57.html

The LJ Speech Dataset

https://keithito.com/LJ-Speech-Dataset/

DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices

https://archive.org/details/daps_dataset

French

The SIWIS French Speech Synthesis Database

https://doi.org/10.7488/ds/1705

Japanese

JVS corpus

https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus

JSUT corpus

https://sites.google.com/site/shinnosuketakamichi/publication/jsut

Norwegian

NB Tale - a basic acoustic phonetic speech database for Norwegian

https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-31&lang=en

Tuva Speech Database

https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-44&lang=en

Romanian

Romanian Speech Synthesis (RSS) Database

http://romaniantts.com/rssdb/

The SWARA Corpus

https://speech.utcluj.ro/swarasc/

Articulatory data

mngu0

http://www.mngu0.org

Audiovisual

TCD-TIMIT (audio visual multi-speaker database)

https://sigmedia.tcd.ie/TCDTIMIT/node/1

The GRID audiovisual sentence corpus

http://spandh.dcs.shef.ac.uk/gridcorpus/

Non classified

PTDB-TUG: Pitch Tracking Database from Graz University of Technology

https://www.spsc.tugraz.at/databases-and-tools/ptdb-tug-pitch-tracking-database-from-graz-university-of-technology.html

Anonymous

Search

Corpora

Multilingual

Idlak/Living-Audio-Dataset

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

The Simple4All Tundra Corpus

Language specific

English

Alba speech corpus

Parallel Audiobook Corpus

VCTK

Hurricane natural speech corpus - higher quality version

Repeated Harvard Sentence Prompts corpus version 0.5

The Voice Conversion Challenge 2018 database

The Voice Conversion Challenge 2016 database

LibriTTS corpus

CSTR NAM TIMIT Plus corpus

The LJ Speech Dataset

DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices

French

The SIWIS French Speech Synthesis Database

Japanese

JVS corpus

JSUT corpus

Norwegian

NB Tale - a basic acoustic phonetic speech database for Norwegian

Tuva Speech Database

Romanian

Romanian Speech Synthesis (RSS) Database

The SWARA Corpus

Articulatory data

mngu0

Audiovisual

TCD-TIMIT (audio visual multi-speaker database)

The GRID audiovisual sentence corpus

Non classified

PTDB-TUG: Pitch Tracking Database from Graz University of Technology

Navigation

Wiki tools

Page tools