Education: Difference between revisions
From SynSIG
Line 60: | Line 60: | ||
== Keynotes & tutorials at International conferences and workshops == | == Keynotes & tutorials at International conferences and workshops == | ||
* [http://www.eusipco2017.org/wp-content/uploads/2017/09/SimonKing_Keynote-talk_EUSIPCO_2017.pdf Simon King | * [http://www.eusipco2017.org/wp-content/uploads/2017/09/SimonKing_Keynote-talk_EUSIPCO_2017.pdf Simon King, Speech synthesis: where did the signal processing go? @ EUSIPCO2016] | ||
* [http://www.speech.zone/courses/one-off/merlin-interspeech2017/ Deep Learning for Text-to-Speech Synthesis, using the Merlin toolkit @ Interspeech 2017] | * [http://www.speech.zone/courses/one-off/merlin-interspeech2017/ Simon King, Oliver Watts, Srikanth Ronanki, Zhizheng Wu, Felipe Espic, Deep Learning for Text-to-Speech Synthesis, using the Merlin toolkit @ Interspeech 2017] | ||
* [https://www.superlectures.com/interspeech2016/isca-medalist-for-leadership-and-extensive-contributions-to-speech-and-language-processing | * [https://www.superlectures.com/interspeech2016/isca-medalist-for-leadership-and-extensive-contributions-to-speech-and-language-processing John Makhoul: A 50-year retrospective on speech and languag processing @ Interspeech 2016] | ||
* [https://www.superlectures.com/odyssey2016/voice-conversion-and-spoofing-countermeasures-for-speaker-verification Voice conversion and spoofing countermeasures for speaker verification @ Odyssey 2016] | * [https://www.superlectures.com/odyssey2016/voice-conversion-and-spoofing-countermeasures-for-speaker-verification Haizhou Li, Voice conversion and spoofing countermeasures for speaker verification @ Odyssey 2016] | ||
* [https://www.superlectures.com/odyssey2016/understanding-individual-level-speech-variability-from-novel-speech-production-data-to-robust-speaker-recognition Understanding individual-level speech variability: From novel speech production data to robust speaker recognition @ Odyssey 2016] | * [https://www.superlectures.com/odyssey2016/understanding-individual-level-speech-variability-from-novel-speech-production-data-to-robust-speaker-recognition Shri Narayanan, Understanding individual-level speech variability: From novel speech production data to robust speaker recognition @ Odyssey 2016] | ||
* [https://www.superlectures.com/iscslp2014/tutorial-4-deep-learning-for-speech-generation-and-synthesis Deep Learning for Speech Generation and Synthesis @ ISCSLP 2014] | * [https://www.superlectures.com/iscslp2014/tutorial-4-deep-learning-for-speech-generation-and-synthesis Yao Qian and Frank K. Soong, Deep Learning for Speech Generation and Synthesis @ ISCSLP 2014] | ||
* [https://www.superlectures.com/odyssey2014/speaking-in-adverse-conditions-from-behavioural-observations-to-intelligibility-enhancing-speech-modifications Speaking in adverse conditions: from behavioural observations to intelligibility-enhancing speech modifications @ Odyssey 2014] | * [https://www.superlectures.com/odyssey2014/speaking-in-adverse-conditions-from-behavioural-observations-to-intelligibility-enhancing-speech-modifications Martin Cooke, Speaking in adverse conditions: from behavioural observations to intelligibility-enhancing speech modifications @ Odyssey 2014] | ||
* [https://www.superlectures.com/asru2011/speech-synthesis-as-a-statistical-machine-learning-problem Keiichi Tokuda, Speech Synthesis as A Statistical Machine Learning Problem @ ASRU 2011] | |||
== Other kind of teaching materials == | == Other kind of teaching materials == |
Revision as of 04:06, 12 September 2017
SPCC
The Speech Processing Courses in Crete (SPCC) are targeting to teach graduate students and researchers the latest advancements of speech processing covering theory, hands on, and establishing contacts between the academics and industry. The school will provide the chance to students and professionals to meet world leaders in speech technology, exchanging ideas, sharing experiences and vision. The Summer School is organized by the University of Crete, Greece.
2017
- webpage: http://spcc.csd.uoc.gr/
- the school topic is: Towards Intelligible and Conversational Speech Synthesis Engines, The topic includes:
- Modern Acoustic Modelling Approaches (DNN/LSTM, WaveNet)
- Text Normalization and Linguistic Analysis
- Prosody
- Advanced Vocoders and Modifications (Voice Conversion)
- Intelligibility and Cognitive Effort in Speech Synthesis
2016
- webpage: http://spcc.csd.uoc.gr/SPCC2016/
- the school topic is: Advancements in Modern Speech Synthesis Engines, The topic includes:
- Advanced Speech Signal Modelling and Modifications
- Current Acoustic Modelling Approaches
- Challenges in Fornt-End Processing
- Listening Context Aware Speech Synthesis Systems
- Text Normalization and Linguistic Analysis
Lecture slides used for SPCC 2016 are available online
- Dr. Yannis Agiomyrgiannakis, Google UK : Vocaine the Vocoder
- Prof. Yannis Stylianou, University of Crete : Adaptive Sinusoidal Models
- Prof. Yannis Stylianou, University of Crete : Sinusoidal Modeling
- Prof. Simon King, University of Edinburgh : Text Processing for Speech Synthesis
- Dr. Spyros Raptis, ILSP Athena : Unit-Selection-based Text-To-Speech Synthesis
- Prof. Simon King, University of Edinburgh : Speech Synthesis with Hidden Markov Models
- Dr. Vassilis Tsiaras, University of Crete : Linear Dynamical Models in Speech Synthesis
- Dr. Yannis Agiomyrgiannakis, Google UK : Vocoder-side Voice Morphing for TTS
- Dr. Heiga Zen, Google UK : Artificial Neural Network based Speech Synthesis
- Dr. Masami Akamine, Toshiba Japan : Closed Loop Diphone-based Text-To-Speech Synthesis
- Prof. Yannis Stylianou, University of Crete : Speech Intelligibility
- Prof. Simon King, University of Edinburgh : Evaluating Speech Synthesis
- Prof. Simon King, University of Edinburgh : Hybrid Speech Synthesis
2015
- webpage: http://spcc.csd.uoc.gr/SPCC2015/
- the school topic is: From Diphones to Modern Speech Synthesis Engines, The topic includes:
- Speech Signal Modelling and Modifications
- Acoustic Modelling: HMM, LDM, DNN
- Approaches: Diphones, Unit Selection, Statistical, Hybrid
- Listening Context Aware speech synthesis systems
2014
- webpage: http://spcc.csd.uoc.gr/SPCC2014/
Lecture slides used for SPCC 2014 are available online
- Prof. Yannis Stylianou, University of Crete, Greece : Welcome and Introduction
- Prof. Yannis Stylianou, University of Crete, Greece : Speech Production and Modeling
- Prof. Yannis Stylianou, University of Crete, Greece : Voice Pathology
- Prof. Rainer Martin, Intitute of Communication Acoustics, Germany : Signal Processing for Hearing Aids
- Prof. Gerasimos Potamianos, University of Thessaly, Greece : Automatic Speech Recognition
- Dr Milica Gasic, University of Cambridge, U.K. : Spoken Dialogue Systems I
- Dr Milica Gasic, University of Cambridge, U.K. : Spoken Dialogue Systems II
- Dr Olivier Pietquin, Univeristy of Lille 1, France : Statistical Dialogue Modeling
- Dr Nassos Katsamanis, NTUA, Athens : Distant Speech Recognition
Keynotes & tutorials at International conferences and workshops
Other kind of teaching materials
- An introductory course on speech processing (in French and English) by Thierry Dutoit, , Faculté Polytechnique de Mons, Belgium.
- TTSBOX, A Matlab tutorial toolbox on corpus-based Text-to-Speech synthesis, by Thierry Dutoit, Faculté Polytechnique de Mons, Belgium.
Samples
- Access to some speech samples
Educational Softwares
- See our list of educational softwares
Historical images
Take a look at our gallery of historical images.
External Links
- See our external references page
- Dennis Klatt's History of Speech Synthesis,
- Examples of Synthesized Speech,
- "The Talking Computer": Text to Speech Synthesis (J.P. Olive in Hal's Legacy, MITPress),
- German-TTS and emotional synthesis ([1] and [2]) demo by Felix Burkhardt.