Corpora

Alba speech corpus

https://doi.org/10.7488/ds/2506

Parallel Audiobook Corpus

https://doi.org/10.7488/ds/2468

VCTK

https://doi.org/10.7488/ds/1994

The SIWIS French Speech Synthesis Database

https://doi.org/10.7488/ds/1705

Hurricane natural speech corpus - higher quality version

https://doi.org/10.7488/ds/2482

The Voice Conversion Challenge 2018 database

https://doi.org/10.7488/ds/2337

The Voice Conversion Challenge 2016 database

https://doi.org/10.7488/ds/1575

Repeated Harvard Sentence Prompts corpus version 0.5

https://doi.org/10.7488/ds/39

CSTR NAM TIMIT Plus corpus

http://homepages.inf.ed.ac.uk/jyamagis/page3/page57/page57.html

mngu0

http://www.mngu0.org

Romanian Speech Synthesis (RSS) Database

http://romaniantts.com/rssdb/

The SWARA Corpus

https://speech.utcluj.ro/swarasc/

The Simple4All Tundra Corpus

http://tundra.simple4all.org

NB Tale - a basic acoustic phonetic speech database for Norwegian

https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-31&lang=en

Tuva Speech Database

https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-44&lang=en

LibriTTS corpus

http://www.openslr.org/60/

The GRID audiovisual sentence corpus

http://spandh.dcs.shef.ac.uk/gridcorpus/

TCD-TIMIT (audio visual multi-speaker database)

https://sigmedia.tcd.ie/TCDTIMIT/node/1

Idlak/Living-Audio-Dataset

https://github.com/Idlak/Living-Audio-Dataset

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

https://github.com/Kyubyong/CSS10

JVS corpus

https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus

JSUT corpus

https://sites.google.com/site/shinnosuketakamichi/publication/jsut

PTDB-TUG: Pitch Tracking Database from Graz University of Technology

https://www.spsc.tugraz.at/databases-and-tools/ptdb-tug-pitch-tracking-database-from-graz-university-of-technology.html

DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices

https://archive.org/details/daps_dataset

The LJ Speech Dataset

https://keithito.com/LJ-Speech-Dataset/

Anonymous

Search

Corpora

Namespaces

More

Page actions

Contents

Alba speech corpus

Parallel Audiobook Corpus

VCTK

The SIWIS French Speech Synthesis Database

Hurricane natural speech corpus - higher quality version

The Voice Conversion Challenge 2018 database

The Voice Conversion Challenge 2016 database

Repeated Harvard Sentence Prompts corpus version 0.5

CSTR NAM TIMIT Plus corpus

mngu0

Romanian Speech Synthesis (RSS) Database

The SWARA Corpus

The Simple4All Tundra Corpus

NB Tale - a basic acoustic phonetic speech database for Norwegian

Tuva Speech Database

LibriTTS corpus

The GRID audiovisual sentence corpus

TCD-TIMIT (audio visual multi-speaker database)

Idlak/Living-Audio-Dataset

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

JVS corpus

JSUT corpus

PTDB-TUG: Pitch Tracking Database from Graz University of Technology

DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices

The LJ Speech Dataset

Navigation

Navigation

Special pages

Wiki tools

Wiki tools

Anonymous

Search

Corpora

Alba speech corpus

Parallel Audiobook Corpus

VCTK

The SIWIS French Speech Synthesis Database

Hurricane natural speech corpus - higher quality version

The Voice Conversion Challenge 2018 database

The Voice Conversion Challenge 2016 database

Repeated Harvard Sentence Prompts corpus version 0.5

CSTR NAM TIMIT Plus corpus

mngu0

Romanian Speech Synthesis (RSS) Database

The SWARA Corpus

The Simple4All Tundra Corpus

NB Tale - a basic acoustic phonetic speech database for Norwegian

Tuva Speech Database

LibriTTS corpus

The GRID audiovisual sentence corpus

TCD-TIMIT (audio visual multi-speaker database)

Idlak/Living-Audio-Dataset

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

JVS corpus

JSUT corpus

PTDB-TUG: Pitch Tracking Database from Graz University of Technology

DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices

The LJ Speech Dataset

Navigation

Wiki tools

Page tools