I’d rather not self-promote, as I’ve already shared a post in the relevant section.
But after using this method for a few days, I can confirm its usefulness, so I will duplicate the information in this section.
Now my J4205 can easily play back responses from LLMs.
In the future, I expect improvements to the Wyoming protocol, local servers, and cloud integrations for various TTS engines with streaming support. Then this method will no longer be as relevant.
But for now, my integration might already prove useful to someone out there.
- Copy the
streaming_tts_proxy
folder from this repository to your custom integrations. - Restart the system.
- Add the streaming tts proxy integration.
- Specify the host and port (use
core-piper:10200
for the add-on). - Specify the language and voice in the settings.
- Start testing.
The only requirement is that your server must be capable of generating 1 second of speech in no more than 1 second (RTF = 1), preferably slightly faster.
I tried to reflect the work with different satellite versions in the diagrams; there are some inaccuracies, but the idea should be clear.
1 post - 1 participant