Corpora
By default, each corpus is text/audio. Some corpora contain more information. In this case, they are classified under a dedicated section (i.e. Audiovisual for audiovisual speech)
Multilingual
Idlak/Living-Audio-Dataset
https://github.com/Idlak/Living-Audio-Dataset
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
https://github.com/Kyubyong/CSS10
The Simple4All Tundra Corpus
Language specific
English
Alba speech corpus
https://doi.org/10.7488/ds/2506
Parallel Audiobook Corpus
https://doi.org/10.7488/ds/2468
VCTK
https://doi.org/10.7488/ds/2645
Hurricane natural speech corpus - higher quality version
https://doi.org/10.7488/ds/2482
Repeated Harvard Sentence Prompts corpus version 0.5
The Voice Conversion Challenge 2018 database
https://doi.org/10.7488/ds/2337
The Voice Conversion Challenge 2016 database
https://doi.org/10.7488/ds/1575
LibriTTS corpus
CSTR NAM TIMIT Plus corpus
http://homepages.inf.ed.ac.uk/jyamagis/page3/page57/page57.html
The LJ Speech Dataset
https://keithito.com/LJ-Speech-Dataset/
DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices
https://archive.org/details/daps_dataset
French
The SIWIS French Speech Synthesis Database
https://doi.org/10.7488/ds/1705
Japanese
JVS corpus
https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus
JSUT corpus
https://sites.google.com/site/shinnosuketakamichi/publication/jsut
Norwegian
NB Tale - a basic acoustic phonetic speech database for Norwegian
https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-31&lang=en
Tuva Speech Database
https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-44&lang=en
Romanian
Romanian Speech Synthesis (RSS) Database
The SWARA Corpus
https://speech.utcluj.ro/swarasc/
Articulatory data
mngu0
Audiovisual
TCD-TIMIT (audio visual multi-speaker database)
https://sigmedia.tcd.ie/TCDTIMIT/node/1
The GRID audiovisual sentence corpus
http://spandh.dcs.shef.ac.uk/gridcorpus/