Quantcast
Channel: Configuration - Home Assistant Community
Viewing all articles
Browse latest Browse all 109599

TTS streaming support

$
0
0

Are there any TTS engines that work in streaming mode aside from piper?

I have the latest version and with local piper streaming mode works, if there is a long answer from the LLM then assistant starts reading it before the whole output is on screen. However there is a noticeable delay/latency. That is especially annoying if the response is short (it’s of course understandable since it’s done a the local cpu).

It works much better with a GPU-accelerated piper. The slightly inconvenient part is that in assistant menu there is no easy way to tell two piper instances apart but it’s not the end of the world.

However the voice quality is not as good as some other TTS engines out there.

In particular I like Kokoro since it has a good compromise between voice quality and speed. It supports streaming if I go to it’s web interface. However when used from HA it waits until the whole output is on the screen. I’ve tried this implementation that provides a wyoming proxy to OpenAI API and this implementation that adds an integration that can be pointed towards an OpenAI API endpoint and then exposes it inside HA. In both cases it waits until the whole text is on the screen.

Looking at the diagram for wyoming proxy it appears that the proxy intentionally waits for the whole text to appear before passing it on to TTS engine.

Does anyone know about either other Wyoming proxies that support streaming or other integrations that can do it?

And is this something that needs to be additionally set up on HA side or on the TTS side? I suspect the latter but I could be missing something.

1 post - 1 participant

Read full topic


Viewing all articles
Browse latest Browse all 109599

Latest Images

Trending Articles



Latest Images

<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>