Corpora

From SynSIG
Revision as of 17:00, 24 February 2020 by Sebastienlemaguer (talk | contribs) (Created page with " == Alba speech corpus == https://doi.org/10.7488/ds/2506 == Parallel Audiobook Corpus == https://doi.org/10.7488/ds/2468 == VCTK == https://doi.org/10.7488/ds/1994 == The...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Alba speech corpus

https://doi.org/10.7488/ds/2506

Parallel Audiobook Corpus

https://doi.org/10.7488/ds/2468

VCTK

https://doi.org/10.7488/ds/1994

The SIWIS French Speech Synthesis Database

https://doi.org/10.7488/ds/1705

Hurricane natural speech corpus - higher quality version

https://doi.org/10.7488/ds/2482

The Voice Conversion Challenge 2018 database

https://doi.org/10.7488/ds/2337

The Voice Conversion Challenge 2016 database

https://doi.org/10.7488/ds/1575

Repeated Harvard Sentence Prompts corpus version 0.5

https://doi.org/10.7488/ds/39

CSTR NAM TIMIT Plus corpus

http://homepages.inf.ed.ac.uk/jyamagis/page3/page57/page57.html

mngu0

http://www.mngu0.org

Romanian Speech Synthesis (RSS) Database

http://romaniantts.com/rssdb/

The SWARA Corpus

https://speech.utcluj.ro/swarasc/

The Simple4All Tundra Corpus

http://tundra.simple4all.org

NB Tale - a basic acoustic phonetic speech database for Norwegian

https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-31&lang=en

Tuva Speech Database

https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-44&lang=en

LibriTTS corpus

http://www.openslr.org/60/

The GRID audiovisual sentence corpus

http://spandh.dcs.shef.ac.uk/gridcorpus/

TCD-TIMIT (audio visual multi-speaker database)

https://sigmedia.tcd.ie/TCDTIMIT/node/1

Idlak/Living-Audio-Dataset

https://github.com/Idlak/Living-Audio-Dataset

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

https://github.com/Kyubyong/CSS10

JVS corpus

https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus

JSUT corpus

https://sites.google.com/site/shinnosuketakamichi/publication/jsut

PTDB-TUG: Pitch Tracking Database from Graz University of Technology

https://www.spsc.tugraz.at/databases-and-tools/ptdb-tug-pitch-tracking-database-from-graz-university-of-technology.html


DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices

https://archive.org/details/daps_dataset

The LJ Speech Dataset

https://keithito.com/LJ-Speech-Dataset/