Real time Text to Speech (Audio)

kevin · July 19, 2016, 4:08pm

Hi community,

I would like to make an avatar speak according to a given text.

I’m good at skinning animations, but have never tried the audio part in PlayCanvas.

Any suggestion on which text to speech API I should use and how to integrate it with PlayCanvas?

I can only think of google translation API, e.g.
http://translate.google.com/translate_tts?ie=UTF-8&total=1&idx=0&textlen=32&client=tw-ob&q=hello%20play%20canvas%20community&tl=En-gb

Thanks!

max · July 19, 2016, 4:13pm

There is a Web Speech API, that allows you to implement text-to-speech and speech-to-text.
That is only supported on some platforms: http://caniuse.com/#feat=speech-synthesis

kevin · July 20, 2016, 2:34pm

Hi Max,

It works fine. But it can only speak with the native system language and voice. The API call “window.speechSynthesis.getVoices()” returns an empty list on PlayCanvas. But it runs correct out of PlayCanvas. Any idea what went wrong?

Below is my code:

pc.script.create('SpeechTest', function (app) {
    // Creates a new SpeechTest instance
    var SpeechTest = function (entity) {
        this.entity = entity;
    };

    SpeechTest.prototype = {
        // Called once after all resources are loaded and before the first update
        initialize: function () {

            this.speak("It will automatically generate a wav file which you can easily get with an HTTP request through any");
        },

        // Called every frame, dt is time in seconds since last update
        update: function (dt) {
        },
        
        speak: function(phrase) {
            if(phrase === "") return;
            
            var speech = new SpeechSynthesisUtterance(phrase);
            var voices = window.speechSynthesis.getVoices();
            speech.voice = voices.filter(function(voice) { return voice.name == 'Google UK English Male'; })[0];
            window.speechSynthesis.speak(speech);
        }
    };

    return SpeechTest;
});

max · July 20, 2016, 2:50pm

Have you tried that method on exactly same platforms but in different pages?
I know that some platforms do have only few voices or even only one.

kevin · July 20, 2016, 3:21pm

Yes, on Chrome, but different pages. The method seems to have some certain delay.

kevin · July 21, 2016, 11:00am

I’m confused that the speechSynthesis works in a blank webpage but does not work in PlayCanvas.

Could you please help me solve this issue?

max · July 21, 2016, 11:17am

Voices are loaded in async in browser, so you need to get voices on callback, check this answer: http://stackoverflow.com/questions/21513706/getting-the-list-of-voices-in-speechsynthesis-of-chrome-web-speech-api#answer-22978802