Latest revision as of 17:17, 3 November 2023

Call for participation

The Blizzard Challenge 2023 Workshop is the culmination of the Blizzard Challenge 2023. Blizzard Challenge is an annual challenge to compare corpus-based speech synthesis on common databases and a large listening test. The aims of the workshop are to present the results from the listening tests and for participants in the Challenge to describe their systems.

Who can attend the workshop ?

The workshop is open to all and we encourage participation from anyone interested in speech synthesis. However, please follow the registration procedure below.

Who can submit a paper to the workshop ?

All participants in the Blizzard Challenge 2023 are required to submit a paper describing their entry. The paper submission instructions can be found at Blizzard Challenge 2023 Rules #PAPER.

Organizers of the Blizzard Challenge 2023

Olivier Perrotin & Gérard Bailly (Université Grenoble Alpes)
Simon King (University of Edinburgh)

Location and date

Venue: Same as SSW12 https://ssw2023.org/index.php/venue/

Maison de la Création et de l’Innovation – MaCI
339 Av. Centrale, 38400 Saint-Martin-d’Hères
Grenoble, France

Date: Tuesday 29th August 2023

Registration

If you are registered for SSW12 then you may attend the Blizzard Challenge 2023 without further registration - simply bring your SSW12 badge with you.
If you are attending the Blizzard Challenge 223 without registering for SSW12, please email blizzard-challenge-organisers@googlegroups.com to register.

Programme

We will have Live Oral Presentation for all teams that physically attend the workshop. The duration of each presentation is 15 min (half presentation, half Q&A).
We will watch Pre-recorded videos from all teams that cannot physically attend the workshop. The duration of each presentation is 6 min (3 min video + 3 min Q&A).

All times below are in Paris time (GMT+2).

09:00 - 10:15 | Welcome message and Summary of Blizzard Challenge 2023 (Presenter: Olivier Perrotin)

10:15 - 10:30 | Break

10:30 - 11:45 | System presentations: FastSpeech-based models (live - 5 * 15 min)
- LIUM-TTS - Laboratoire d'Informatique Le Mans Université (LIUM)
- GIPSA-lab - Univ. Grenoble Alpes, CNRS, Grenoble INP, France
- IMS - University of Stuttgart, Institute for Natural Language Processing, Germany
- MuLanTTS - Microsoft
- Samsung TTS - Samsung Electronics HQ and Samsung Research China, Beijing

11:45 - 12:00 | System presentations : FastSpeech- and Tacotron-based models (remote - 2 * 6 min)
- SCUT SCSE (remote) - South China University of Technology
- FireRedTTS (remote) - Xiaohongshu Inc.

12:00 - 13:30 | Lunch

13:30 - 14:30 | System presentations: Tacotron-based models (live - 4 * 15 min)
- AudioLabs - International Audio Laboratories Erlangen
- TTS-Cube - Adobe Systems, SCC
- La Forge - Ubisoft
- DeepZen - DeepZen Ltd.

14:30 - 15:00 | Break

15:00 - 16:00 | System presentations: Stochastic models (live and remote)
- Idiap - Idiap Research Institute, Martigny, Switzerland
- BIGAI - Beijing Institute of General Artificial Intelligence
- ~~CASIA Speech (remote) - Institute of Automation, Chinese Academy of Sciences~~ (no-show; no video provided)
- Xiaomi-ASLP (remote) - Xiaomi AI Lab and Audio Speech and Language Processing Group (ASLP@NPU), Northwestern Polytechnical University (video provided but no-show for Q&A)
- ~~Fruit Shell (remote) - University of Chinese Academy of Sciences~~ (no-show; no video provided)
- 10AI (remote) - Beijing Yiling Intelligence Technology Co., Ltd.
- IOA-ThinkIT (remote) - Institute of Acoustics of the Chinese Academy of Sciences (video provided but no-show for Q&A)
16:00 - 16:30 | Closing

Published proceedings

Proceedings are available at: https://www.isca-speech.org/archive/blizzard_2023/index.html

Slides of the summary presentation are available at: https://www.isca-speech.org/archive/pdfs/blizzard_2023/intro.pdf

Blizzard material (syntheses, results, statistical analyses) are available at: https://www.cstr.ed.ac.uk/projects/blizzard/data.html

@@ Line 36: / Line 36: @@
 * We will watch '''Pre-recorded videos''' from all teams that cannot physically attend the workshop. The duration of each presentation is 6 min (3 min video + 3 min Q&A).
-The full program with team slots will be given according to the pending confirmation of attendance of some teams. We will try to keep the beginning and end time of the day unchanged.
 All times below are in Paris time (GMT+2).
-* 09:00 - 10:00 | Welcome message and Summary of Blizzard Challenge 2023 (Presenter: Olivier Perrotin)
+* 09:00 - 10:15 | Welcome message and Summary of Blizzard Challenge 2023 (Presenter: Olivier Perrotin)
+* 10:15 - 10:30 | Break
-* 10:00 - 10:30 | System presentations: ''FastSpeech-based models'' (live - 2 * 15 min)
+* 10:30 - 11:45 | System presentations: ''FastSpeech-based models'' (live - 5 * 15 min)
 ** LIUM-TTS - Laboratoire d'Informatique Le Mans Université (LIUM)
 ** GIPSA-lab - Univ. Grenoble Alpes, CNRS, Grenoble INP, France
-* 10:30 - 11:00 | Break
-* 11:00 - 11:45 | System presentations: ''FastSpeech-based models'' (live - 3 * 15 min)
 ** IMS - University of Stuttgart, Institute for Natural Language Processing, Germany
 ** MuLanTTS - Microsoft
 ** Samsung TTS - Samsung Electronics HQ and Samsung Research China, Beijing
 * 11:45 - 12:00 | System presentations : ''FastSpeech- and Tacotron-based models'' (remote - 2 * 6 min)
 ** SCUT SCSE (remote) - South China University of Technology
 ** FireRedTTS (remote)  - Xiaohongshu Inc.
 * 12:00 - 13:30 | Lunch
@@ Line 68: / Line 60: @@
 ** La Forge - Ubisoft
 ** DeepZen - DeepZen Ltd.
 * 14:30 - 15:00 | Break
 * 15:00 - 16:00 | System presentations: ''Stochastic models'' (live and remote)
 ** Idiap - Idiap Research Institute, Martigny, Switzerland
 ** BIGAI - Beijing Institute of General Artificial Intelligence
-** CASIA Speech (remote) - Institute of Automation, Chinese Academy of Sciences
+** <s>CASIA Speech (remote) - Institute of Automation, Chinese Academy of Sciences</s> (no-show; no video provided)
-** Xiaomi-ASLP (remote) - Xiaomi AI Lab and Audio Speech and Language Processing Group (ASLP@NPU), Northwestern Polytechnical University
+** Xiaomi-ASLP (remote) - Xiaomi AI Lab and Audio Speech and Language Processing Group (ASLP@NPU), Northwestern Polytechnical University (video provided but no-show for Q&A)
-** Fruit Shell (remote) - University of Chinese Academy of Sciences
+** <s>Fruit Shell (remote) - University of Chinese Academy of Sciences</s> (no-show; no video provided)
 ** 10AI (remote) - Beijing Yiling Intelligence Technology Co., Ltd.
-** IOA-ThinkIT (remote) - Institute of Acoustics of the Chinese Academy of Sciences
+** IOA-ThinkIT (remote) - Institute of Acoustics of the Chinese Academy of Sciences (video provided but no-show for Q&A)
+* 16:00 - 16:30 | Closing
+= Published proceedings =
-* 16:00 - 16:30 | Closing
+Proceedings are available at: https://www.isca-speech.org/archive/blizzard_2023/index.html
-= Published proceedings =
+Slides of the summary presentation are available at: https://www.isca-speech.org/archive/pdfs/blizzard_2023/intro.pdf
-TBA
+Blizzard material (syntheses, results, statistical analyses) are available at: https://www.cstr.ed.ac.uk/projects/blizzard/data.html

Anonymous

Search

Blizzard Challenge 2023 Workshop: Difference between revisions

Namespaces

More

Page actions

Latest revision as of 17:17, 3 November 2023

Contents

Call for participation

Who can attend the workshop ?

Who can submit a paper to the workshop ?

Organizers of the Blizzard Challenge 2023

Location and date

Registration

Programme

Published proceedings

Navigation

Navigation

Special pages

Wiki tools

Wiki tools

Anonymous

Search

Blizzard Challenge 2023 Workshop: Difference between revisions

Latest revision as of 17:17, 3 November 2023

Call for participation

Who can attend the workshop ?

Who can submit a paper to the workshop ?

Organizers of the Blizzard Challenge 2023

Location and date

Registration

Programme

Published proceedings

Navigation

Wiki tools

Page tools