Speech Synthesis Workshop (SSW): Difference between revisions

From SynSIG
No edit summary
Line 32: Line 32:
* [http://www.isca-speech.org/archive_open/ssw3/index.html SSW3] (Dedicated to the memory of Christian Benoît), November 26-29, 1998, Jenolan Caves House, Blue Mountains, Australia.  
* [http://www.isca-speech.org/archive_open/ssw3/index.html SSW3] (Dedicated to the memory of Christian Benoît), November 26-29, 1998, Jenolan Caves House, Blue Mountains, Australia.  
* [http://www.isca-speech.org/archive_open/ssw4/index.html SSW4], August 29 - September 1, 2001, Atholl Palace Hotel, Pitlochry, Perthshire, Scotland.
* [http://www.isca-speech.org/archive_open/ssw4/index.html SSW4], August 29 - September 1, 2001, Atholl Palace Hotel, Pitlochry, Perthshire, Scotland.
* [http://www.ssw5.org/ SSW5], June 14-16, 2004, Pittsburgh, PA, USA.
* [http://www.isca-speech.org/archive_open/ssw5/ SSW5], June 14-16, 2004, Pittsburgh, PA, USA.
* [http://www.isca-speech.org/archive_open/ssw6/index.html SSW6], August 22-24, 2007, Bonn , Germany.
* [http://www.isca-speech.org/archive_open/ssw6/index.html SSW6], August 22-24, 2007, Bonn , Germany.
* [http://isw3.naist.jp/~tomoki/ssw7/www/ SSW7], September 22-24, 2010, Kyoto, Japan.
* [http://isw3.naist.jp/~tomoki/ssw7/www/ SSW7], September 22-24, 2010, Kyoto, Japan.

Revision as of 23:38, 8 September 2017

At an international conference on speech processing, a speech scientist once held up a tube of toothpaste (whose brand was "Signal") and, squeezing it in front of the audience, coined the phrase "This is speech synthesis; speech recognition is the art of pushing the toothpaste back into the tube."

One could turn this very simplistic view the other way round: users are generally much more tolerant of speech recognition errors than they are willing to listen to unnatural speech. There is magic in a speech recognizer that transcribes continuous radio speech into text with a word accuracy as low as 50%; in contrast, even a perfectly intelligible speech synthesizer is only moderately tolerated by users if it delivers nothing more than "robot voices". Delivering both intelligibility and naturalness has been the holy grail of speech synthesis research for the past 30 years. More recently, expressivity has been added as a major objective of speech synthesis.

Add to this the engineering costs (computational cost, memory cost, design cost for making another synthetic voice or another language) which have to be taken into account, and you'll start to have an idea of the challenges underlying text-to-speech synthesis.

Major challenges call for major meetings: the Speech Synthesis Workshops (SSWs), which are held every three years under the auspices of ISCA's SynSIG. SSWs provide a unique occasion for people in the speech synthesis area to meet each other. They contribute to establishing a feeling that we are all participating in a joint effort towards intelligible, natural, and expressive synthetic speech.

List of ISCA ITRW Speech Synthesis Workshops (SSW)

(Full papers are available on line)

  • SSW1, September 25-28, 1990, Autrans, France.
  • SSW2, September 12-15, 1994, Mohonk Mountain House, New Paltz, NY, USA.
  • SSW3 (Dedicated to the memory of Christian Benoît), November 26-29, 1998, Jenolan Caves House, Blue Mountains, Australia.
  • SSW4, August 29 - September 1, 2001, Atholl Palace Hotel, Pitlochry, Perthshire, Scotland.
  • SSW5, June 14-16, 2004, Pittsburgh, PA, USA.
  • SSW6, August 22-24, 2007, Bonn , Germany.
  • SSW7, September 22-24, 2010, Kyoto, Japan.
  • SSW8, August 31 - September 2, 2013, Barcelona, Spain.
  • SSW9, September 13-15, 2016, Sunnyvale, California, USA.