Corpora: Difference between revisions

Latest revision as of 17:25, 24 February 2020

By default, each corpus is text/audio. Some corpora contain more information. In this case, they are classified under a dedicated section (i.e. Audiovisual for audiovisual speech)

Multilingual

Language specific

English

Alba speech corpus

https://doi.org/10.7488/ds/2506

Parallel Audiobook Corpus

https://doi.org/10.7488/ds/2468

VCTK

https://doi.org/10.7488/ds/2645

Hurricane natural speech corpus - higher quality version

https://doi.org/10.7488/ds/2482

Repeated Harvard Sentence Prompts corpus version 0.5

https://doi.org/10.7488/ds/39

The Voice Conversion Challenge 2018 database

https://doi.org/10.7488/ds/2337

The Voice Conversion Challenge 2016 database

https://doi.org/10.7488/ds/1575

LibriTTS corpus

http://www.openslr.org/60/

CSTR NAM TIMIT Plus corpus

http://homepages.inf.ed.ac.uk/jyamagis/page3/page57/page57.html

The LJ Speech Dataset

https://keithito.com/LJ-Speech-Dataset/

DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices

https://archive.org/details/daps_dataset

French

The SIWIS French Speech Synthesis Database

https://doi.org/10.7488/ds/1705

Japanese

JVS corpus

https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus

JSUT corpus

https://sites.google.com/site/shinnosuketakamichi/publication/jsut

Norwegian

NB Tale - a basic acoustic phonetic speech database for Norwegian

https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-31&lang=en

Tuva Speech Database

https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-44&lang=en

Romanian

Romanian Speech Synthesis (RSS) Database

http://romaniantts.com/rssdb/

The SWARA Corpus

https://speech.utcluj.ro/swarasc/

Articulatory data

mngu0

http://www.mngu0.org

Audiovisual

TCD-TIMIT (audio visual multi-speaker database)

https://sigmedia.tcd.ie/TCDTIMIT/node/1

The GRID audiovisual sentence corpus

http://spandh.dcs.shef.ac.uk/gridcorpus/

Non classified

PTDB-TUG: Pitch Tracking Database from Graz University of Technology

https://www.spsc.tugraz.at/databases-and-tools/ptdb-tug-pitch-tracking-database-from-graz-university-of-technology.html

@@ Line 1: / Line 1: @@
+'''By default, each corpus is text/audio. Some corpora contain more information. In this case, they are classified under a dedicated section (i.e. Audiovisual for audiovisual speech)'''
-== Alba speech corpus ==
+== Multilingual ==
+=== Idlak/Living-Audio-Dataset ===
+https://github.com/Idlak/Living-Audio-Dataset
+=== CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages ===
+https://github.com/Kyubyong/CSS10
+=== The Simple4All Tundra Corpus ===
+http://tundra.simple4all.org
+== Language specific ==
+=== English ===
+==== Alba speech corpus ====
 https://doi.org/10.7488/ds/2506
-== Parallel Audiobook Corpus ==
+==== Parallel Audiobook Corpus ====
 https://doi.org/10.7488/ds/2468
-== VCTK  ==
+==== VCTK  ====
-https://doi.org/10.7488/ds/1994
+https://doi.org/10.7488/ds/2645
-== The SIWIS French Speech Synthesis Database ==
+==== Hurricane natural speech corpus - higher quality version ====
-https://doi.org/10.7488/ds/1705
+https://doi.org/10.7488/ds/2482
-== Hurricane natural speech corpus - higher quality version ==
+==== Repeated Harvard Sentence Prompts corpus version 0.5 ====
-https://doi.org/10.7488/ds/2482
+https://doi.org/10.7488/ds/39
-== The Voice Conversion Challenge 2018 database  ==
+==== The Voice Conversion Challenge 2018 database  ====
 https://doi.org/10.7488/ds/2337
-== The Voice Conversion Challenge 2016 database ==
+==== The Voice Conversion Challenge 2016 database ====
 https://doi.org/10.7488/ds/1575
-== Repeated Harvard Sentence Prompts corpus version 0.5 ==
+==== LibriTTS corpus ====
-https://doi.org/10.7488/ds/39
+http://www.openslr.org/60/
-== CSTR NAM TIMIT Plus corpus ==
+==== CSTR NAM TIMIT Plus corpus ====
 http://homepages.inf.ed.ac.uk/jyamagis/page3/page57/page57.html
-== mngu0 ==
+==== The LJ Speech Dataset ====
-http://www.mngu0.org
+https://keithito.com/LJ-Speech-Dataset/
+==== DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices ====
+https://archive.org/details/daps_dataset
+=== French ===
+==== The SIWIS French Speech Synthesis Database ====
+https://doi.org/10.7488/ds/1705
+=== Japanese ===
+==== JVS corpus ====
+https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus
+==== JSUT corpus ====
+https://sites.google.com/site/shinnosuketakamichi/publication/jsut
-== Romanian Speech Synthesis (RSS) Database ==
-http://romaniantts.com/rssdb/
-== The SWARA Corpus ==
-https://speech.utcluj.ro/swarasc/
-== The Simple4All Tundra Corpus ==
+=== Norwegian ===
-http://tundra.simple4all.org
-== NB Tale - a basic acoustic phonetic speech database for Norwegian ==
+==== NB Tale - a basic acoustic phonetic speech database for Norwegian ====
 https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-31&lang=en
-== Tuva Speech Database ==
+==== Tuva Speech Database ====
 https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-44&lang=en
-== LibriTTS corpus ==
+=== Romanian ===
-http://www.openslr.org/60/
-== The GRID audiovisual sentence corpus ==
+==== Romanian Speech Synthesis (RSS) Database ====
-http://spandh.dcs.shef.ac.uk/gridcorpus/
+http://romaniantts.com/rssdb/
-== TCD-TIMIT (audio visual multi-speaker database) ==
+==== The SWARA Corpus ====
-https://sigmedia.tcd.ie/TCDTIMIT/node/1
+https://speech.utcluj.ro/swarasc/
-== Idlak/Living-Audio-Dataset ==
-https://github.com/Idlak/Living-Audio-Dataset
-== CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages ==
+== Articulatory data ==
-https://github.com/Kyubyong/CSS10
-== JVS corpus ==
+=== mngu0 ===
-https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus
+http://www.mngu0.org
-== JSUT corpus ==
+== Audiovisual ==
-https://sites.google.com/site/shinnosuketakamichi/publication/jsut
-== PTDB-TUG: Pitch Tracking Database from Graz University of Technology ==
+=== TCD-TIMIT (audio visual multi-speaker database) ===
-https://www.spsc.tugraz.at/databases-and-tools/ptdb-tug-pitch-tracking-database-from-graz-university-of-technology.html
+https://sigmedia.tcd.ie/TCDTIMIT/node/1
+=== The GRID audiovisual sentence corpus ===
+http://spandh.dcs.shef.ac.uk/gridcorpus/
-== DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices ==
+== Non classified ==
-https://archive.org/details/daps_dataset
+=== PTDB-TUG: Pitch Tracking Database from Graz University of Technology ===
+https://www.spsc.tugraz.at/databases-and-tools/ptdb-tug-pitch-tracking-database-from-graz-university-of-technology.html
-== The LJ Speech Dataset ==
-https://keithito.com/LJ-Speech-Dataset/

Anonymous

Search

Corpora: Difference between revisions

Latest revision as of 17:25, 24 February 2020

Multilingual

Idlak/Living-Audio-Dataset

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

The Simple4All Tundra Corpus

Language specific

English

Alba speech corpus

Parallel Audiobook Corpus

VCTK

Hurricane natural speech corpus - higher quality version

Repeated Harvard Sentence Prompts corpus version 0.5

The Voice Conversion Challenge 2018 database

The Voice Conversion Challenge 2016 database

LibriTTS corpus

CSTR NAM TIMIT Plus corpus

The LJ Speech Dataset

DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices

French

The SIWIS French Speech Synthesis Database

Japanese

JVS corpus

JSUT corpus

Norwegian

NB Tale - a basic acoustic phonetic speech database for Norwegian

Tuva Speech Database

Romanian

Romanian Speech Synthesis (RSS) Database

The SWARA Corpus

Articulatory data

mngu0

Audiovisual

TCD-TIMIT (audio visual multi-speaker database)

The GRID audiovisual sentence corpus

Non classified

PTDB-TUG: Pitch Tracking Database from Graz University of Technology

Navigation

Wiki tools

Page tools