I’m using https://github.com/rhasspy/piper mostly to create some audiobooks and read some posts/news, but the voices available are not always comfortable to listen to.
Do you guys have any recommendation for a voice changer to process these audio files?
Preferably it’ll have a CLI so I can include it in my pipeline to process RSS feeds, but I don’t mind having to work through an UI.
Bonus points if it can process the audio streams.
That’s called text to speech, not a voice changer. A voice changer is the thing in the Darth Vader halloween masks.
There’s been discussion on TTS programs here recently: https://lemm.ee/search?q=tts&type=All&listingType=All&communityId=185&page=1&sort=TopAll
Or you can search via your local instance/interface.
Text to speech is what piper is doing.
What I’m looking for is called voice changer since I want to change a voice which already read something.
That’s exactly what I want: “the thing in the Darth Vader halloween masks” but for linux, preferably via CLI to ingest audio files and be able to configure it to change the voice as I want, not only Darth Vader.
Oh, I see. I think it would still be easier to either use a different voice in piper (the github page talks about this) or use a different tts program entirely.
So, all of the awkward pauses, the lack of inflection - you’re saying keep those, just change who it sounds like is speaking?