Corpora: Difference between revisions

From SynSIG
(Created page with " == Alba speech corpus == https://doi.org/10.7488/ds/2506 == Parallel Audiobook Corpus == https://doi.org/10.7488/ds/2468 == VCTK == https://doi.org/10.7488/ds/1994 == The...")
 
No edit summary
 
Line 1: Line 1:
'''By default, each corpus is text/audio. Some corpora contain more information. In this case, they are classified under a dedicated section (i.e. Audiovisual for audiovisual speech)'''


== Alba speech corpus ==
== Multilingual ==
 
=== Idlak/Living-Audio-Dataset ===
https://github.com/Idlak/Living-Audio-Dataset
 
=== CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages ===
https://github.com/Kyubyong/CSS10
 
=== The Simple4All Tundra Corpus ===
http://tundra.simple4all.org
 
 
== Language specific ==
=== English ===
 
==== Alba speech corpus ====
https://doi.org/10.7488/ds/2506
https://doi.org/10.7488/ds/2506


== Parallel Audiobook Corpus ==
==== Parallel Audiobook Corpus ====
https://doi.org/10.7488/ds/2468
https://doi.org/10.7488/ds/2468


== VCTK  ==
==== VCTK  ====
https://doi.org/10.7488/ds/1994
https://doi.org/10.7488/ds/2645


== The SIWIS French Speech Synthesis Database ==
==== Hurricane natural speech corpus - higher quality version ====
https://doi.org/10.7488/ds/1705
https://doi.org/10.7488/ds/2482


== Hurricane natural speech corpus - higher quality version ==
==== Repeated Harvard Sentence Prompts corpus version 0.5 ====
https://doi.org/10.7488/ds/2482
https://doi.org/10.7488/ds/39


== The Voice Conversion Challenge 2018 database  ==
==== The Voice Conversion Challenge 2018 database  ====
https://doi.org/10.7488/ds/2337
https://doi.org/10.7488/ds/2337


== The Voice Conversion Challenge 2016 database ==
==== The Voice Conversion Challenge 2016 database ====
https://doi.org/10.7488/ds/1575
https://doi.org/10.7488/ds/1575


== Repeated Harvard Sentence Prompts corpus version 0.5 ==
==== LibriTTS corpus ====
https://doi.org/10.7488/ds/39
http://www.openslr.org/60/


== CSTR NAM TIMIT Plus corpus ==
==== CSTR NAM TIMIT Plus corpus ====
http://homepages.inf.ed.ac.uk/jyamagis/page3/page57/page57.html
http://homepages.inf.ed.ac.uk/jyamagis/page3/page57/page57.html


== mngu0 ==
==== The LJ Speech Dataset ====
http://www.mngu0.org
https://keithito.com/LJ-Speech-Dataset/
 
==== DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices ====
https://archive.org/details/daps_dataset
 
 
=== French ===
 
==== The SIWIS French Speech Synthesis Database ====
https://doi.org/10.7488/ds/1705
 
 
=== Japanese ===
 
==== JVS corpus ====
https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus
 
==== JSUT corpus ====
https://sites.google.com/site/shinnosuketakamichi/publication/jsut


== Romanian Speech Synthesis (RSS) Database ==
http://romaniantts.com/rssdb/


== The SWARA Corpus ==
https://speech.utcluj.ro/swarasc/


== The Simple4All Tundra Corpus ==
=== Norwegian ===
http://tundra.simple4all.org


== NB Tale - a basic acoustic phonetic speech database for Norwegian ==
==== NB Tale - a basic acoustic phonetic speech database for Norwegian ====
https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-31&lang=en
https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-31&lang=en


== Tuva Speech Database ==
==== Tuva Speech Database ====
https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-44&lang=en
https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-44&lang=en


== LibriTTS corpus ==
=== Romanian ===
http://www.openslr.org/60/


== The GRID audiovisual sentence corpus ==
==== Romanian Speech Synthesis (RSS) Database ====
http://spandh.dcs.shef.ac.uk/gridcorpus/
http://romaniantts.com/rssdb/


== TCD-TIMIT (audio visual multi-speaker database) ==
==== The SWARA Corpus ====
https://sigmedia.tcd.ie/TCDTIMIT/node/1
https://speech.utcluj.ro/swarasc/


== Idlak/Living-Audio-Dataset ==
https://github.com/Idlak/Living-Audio-Dataset


== CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages ==
== Articulatory data ==
https://github.com/Kyubyong/CSS10


== JVS corpus ==
=== mngu0 ===
https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus
http://www.mngu0.org


== JSUT corpus ==
== Audiovisual ==
https://sites.google.com/site/shinnosuketakamichi/publication/jsut


== PTDB-TUG: Pitch Tracking Database from Graz University of Technology ==
=== TCD-TIMIT (audio visual multi-speaker database) ===
https://www.spsc.tugraz.at/databases-and-tools/ptdb-tug-pitch-tracking-database-from-graz-university-of-technology.html
https://sigmedia.tcd.ie/TCDTIMIT/node/1


=== The GRID audiovisual sentence corpus ===
http://spandh.dcs.shef.ac.uk/gridcorpus/


== DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices ==
== Non classified ==
https://archive.org/details/daps_dataset
=== PTDB-TUG: Pitch Tracking Database from Graz University of Technology ===
 
https://www.spsc.tugraz.at/databases-and-tools/ptdb-tug-pitch-tracking-database-from-graz-university-of-technology.html
== The LJ Speech Dataset ==
https://keithito.com/LJ-Speech-Dataset/

Latest revision as of 17:25, 24 February 2020

By default, each corpus is text/audio. Some corpora contain more information. In this case, they are classified under a dedicated section (i.e. Audiovisual for audiovisual speech)

Multilingual

Idlak/Living-Audio-Dataset

https://github.com/Idlak/Living-Audio-Dataset

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

https://github.com/Kyubyong/CSS10

The Simple4All Tundra Corpus

http://tundra.simple4all.org


Language specific

English

Alba speech corpus

https://doi.org/10.7488/ds/2506

Parallel Audiobook Corpus

https://doi.org/10.7488/ds/2468

VCTK

https://doi.org/10.7488/ds/2645

Hurricane natural speech corpus - higher quality version

https://doi.org/10.7488/ds/2482

Repeated Harvard Sentence Prompts corpus version 0.5

https://doi.org/10.7488/ds/39

The Voice Conversion Challenge 2018 database

https://doi.org/10.7488/ds/2337

The Voice Conversion Challenge 2016 database

https://doi.org/10.7488/ds/1575

LibriTTS corpus

http://www.openslr.org/60/

CSTR NAM TIMIT Plus corpus

http://homepages.inf.ed.ac.uk/jyamagis/page3/page57/page57.html

The LJ Speech Dataset

https://keithito.com/LJ-Speech-Dataset/

DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices

https://archive.org/details/daps_dataset


French

The SIWIS French Speech Synthesis Database

https://doi.org/10.7488/ds/1705


Japanese

JVS corpus

https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus

JSUT corpus

https://sites.google.com/site/shinnosuketakamichi/publication/jsut


Norwegian

NB Tale - a basic acoustic phonetic speech database for Norwegian

https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-31&lang=en

Tuva Speech Database

https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-44&lang=en

Romanian

Romanian Speech Synthesis (RSS) Database

http://romaniantts.com/rssdb/

The SWARA Corpus

https://speech.utcluj.ro/swarasc/


Articulatory data

mngu0

http://www.mngu0.org

Audiovisual

TCD-TIMIT (audio visual multi-speaker database)

https://sigmedia.tcd.ie/TCDTIMIT/node/1

The GRID audiovisual sentence corpus

http://spandh.dcs.shef.ac.uk/gridcorpus/

Non classified

PTDB-TUG: Pitch Tracking Database from Graz University of Technology

https://www.spsc.tugraz.at/databases-and-tools/ptdb-tug-pitch-tracking-database-from-graz-university-of-technology.html