Help:Contents: Difference between revisions

From SynSIG
Line 10: Line 10:


2. Complete the license for Indian language data, which can be accessed via http://wissap.iiit.ac.in/blizzard2014/
2. Complete the license for Indian language data, which can be accessed via http://wissap.iiit.ac.in/blizzard2014/
== Data download ==
* Indian languages: About 2 hours of speech data in each of six Indian languages (Assamese, Gujarati, Hindi, Rajasthani, Tamil and Telugu), recorded by native professional speakers in high quality studio environments. Text is provided in UTF-8 format. No other information, such as segment labels, is provided.
* These speech databases are provided by the group of institutions: IIIT Hyderabad, IIT Madras, DAIICT, SSN College of Engineering, IIT Mandi, and IIT Guwahati with support and collaboration from DeitY, Gov. of India.
* You will be sent a username and password to download the data after accepting the license. Download links can be found via http://wissap.iiit.ac.in/blizzard2014/
* Note that the official release for this challenge is only the speech data (wav/ directory), sampled at 16 kHz, and the corresponding text in UTF-8 format (train.done.data).
* Development tools, available via : http://www.cstr.ed.ac.uk/projects/blizzard/, include:
** Submitted synthetic speech and listener scores for some previous Blizzard Challenges, which may be helpful during development
* Questionnaire:
** Download the questionnaire, complete it, and return it by the deadline given in the Timeline. http://wissap.iiit.ac.in/blizzard2014/Blizzard2014_Ques.txt

Revision as of 14:13, 6 February 2015

Read these first

Registration and license agreement

1. Register by emailing blizzard@festvox.org. We need to know your team name, the name of the main contact person, your affiliation, and contact details including email address, postal address and phone number. Please state which datasets you want to download and which tasks you are planning to submit entries to (this is non-binding, but is helpful for our planning).

2. Complete the license for Indian language data, which can be accessed via http://wissap.iiit.ac.in/blizzard2014/

Data download

  • Indian languages: About 2 hours of speech data in each of six Indian languages (Assamese, Gujarati, Hindi, Rajasthani, Tamil and Telugu), recorded by native professional speakers in high quality studio environments. Text is provided in UTF-8 format. No other information, such as segment labels, is provided.
  • These speech databases are provided by the group of institutions: IIIT Hyderabad, IIT Madras, DAIICT, SSN College of Engineering, IIT Mandi, and IIT Guwahati with support and collaboration from DeitY, Gov. of India.
  • You will be sent a username and password to download the data after accepting the license. Download links can be found via http://wissap.iiit.ac.in/blizzard2014/
  • Note that the official release for this challenge is only the speech data (wav/ directory), sampled at 16 kHz, and the corresponding text in UTF-8 format (train.done.data).
  • Development tools, available via : http://www.cstr.ed.ac.uk/projects/blizzard/, include:
    • Submitted synthetic speech and listener scores for some previous Blizzard Challenges, which may be helpful during development