Do you guys think there will ever be a FOSS voice assistant?

milkytoast@kbin.social · 2 years ago

Do you guys think there will ever be a FOSS voice assistant?

Mathieu@beehaw.org · 2 years ago

Home Assistant invested quite a bit into the technology to create a FOSS voice assistant over the past year. It still needs quite a bit of work, but the foundation is there; it supports wake words (“Hey …”), speech-to-text to hear your command, interpretation and command processing, and text-to-speech to return results.

The downsides are that it’s still quite technical to set up primarily due to the lack of commercially available hardware, and the command library is fairly small at this point.

With some of this foundational work out of the way, I expect Home Assistant to move forward quickly to improve, and other projects can work off the same pieces if they desire to as well.

Here’s their year-end post about it: https://www.home-assistant.io/blog/2023/12/13/year-of-the-voice-chapter-5/

milkytoast@kbin.social · 2 years ago

should have clarified, I’m not looking for a home assistant, I’m looking for a voice assistant on my phone. either way super exited to see where they take this

Atemu@lemmy.ml · 2 years ago

I don’t see how being home-assistant excludes it from working on your phone. The only difference is that your phone acts as the “satellite” rather than a stationary device.

Litanys@lem.cochrun.xyz · 2 years ago

While that’s true, all the things they built are individual and open source, it wouldn’t take too much work for sometime who knows how to package things up for a phone app. That said, you’d need another device to do all the processing.

seang96@spgrn.com · 2 years ago

I have been trying to get home assistant voice assistant to work in my kurbenetes cluster. The documentation is nearlynon existant for configuring it without using their dedicated core OS version with the addon store.

Second issue is the esphone $13 voice assistant setup they have, the integration requires a UDP port for every audio steam. Home assistant picks a random UDP port currently which sucks for kurbenetes, docker you have to set to host networking mode. Someone made a patch that allows you to specify your own range, but I haven’t gotten it working with the patch yet. It looks like their maybe an issue with the esphone device not using the correct channel for the microphone so nothing is being recorded.

sub_o@beehaw.org · 2 years ago

I don’t know if it fits all the prereqs of a FOSS, but there’s mycroft and there’s also jasper

But I have no idea how advanced they are, or how good their 3rd party integrations are.

SecurityPro@lemmy.ml · 2 years ago

Mycroft is defunct

UprisingVoltage@feddit.it · 2 years ago

Source?

paperemail@links.rocks · 2 years ago

They probably mean this: https://mycroft.ai/blog/update-from-the-ceo-part-1/

SecurityPro@lemmy.ml · 2 years ago

https://www.theregister.com/2023/02/13/linux_ai_assistant_killed_off/

Wikipedia entry: “In early 2023, Mycroft AI ceased development.”

Arcka@midwest.social · 2 years ago

FYI, a fork is being carried on by interested parties: https://community.mycroft.ai/t/faq-ovos-neon-and-the-future-of-the-mycroft-voice-assistant/13496

spaduf@slrpnk.net · 2 years ago

Probably for the best. They’d been spinning their wheels while sucking most of the oxygen out of the room for several years now. Time for somebody else to give it a go

Cyborganism@lemmy.ca · edit-2 2 years ago

Is it delivery? No! It’s Dicio.

I’ll see myself out…

xoggy@programming.dev · 2 years ago

I don’t mean to Dicio but that was a bad pun.

Cyborganism@lemmy.ca · 2 years ago

😂

Sharpiemarker@startrek.website · 2 years ago

Don’t forget your pizza

dom@lemmy.ca · 2 years ago

Home assistant is getting into voice assistants. I’m considering getting a few to try jt out

IcyPenguin@beehaw.org · 2 years ago

There’s Mycroft AI

Daxtron2@startrek.website · 2 years ago

That’s super cool, might set up a pi to run this

IcyPenguin@beehaw.org · 2 years ago

Mycroft actually sells devices (pretty similar to the Amazon Echo lineup) that are powered by Raspberry Pis. But making your own is much cheaper.

rar@discuss.online · 2 years ago

There could be a software implementation that works perfectly fine on desktop PCs, especially Linux, but problem is hardware. I don’t see commercial smartphone manufactures giving access to ‘unauthorized uses’ like foss projects usually go around.

4dpuzzle@beehaw.org · 2 years ago

You’re right. The ‘open source’ android phones are the perfect example. But FOSS needs to stop relying on these fascist hardware stack and opt for better open modular platforms. We have examples for such things - like the framework laptops or fairphones. It’s somewhat tolerable for laptops. But we are still too far behind in terms of mobiles and desk boxes needed for these sorts of projects.

drwankingstein@lemmy.dbzer0.com · 2 years ago

considering android allows you to actively change the default assistant it won’t be a problem, we already have plenty of apps that use overlays that are foss so that’s not an issue either, so I really have no idea what you think would be locked down here.

dicio is just kind of a clunky app

Azzu@lemm.ee · 2 years ago

I don’t know Dicio, but I mean can you just leave the app open? Because that’s essentially what the other assistants are, just devices with the app always open.

If you can leave the app open, and it otherwise complies with your requirements, then we already have a FOSS voice assistant, it just doesn’t have its own dedicated hardware yet. But if you would dedicate some hardware to it, like an old phone, then it could be largely equivalent.

Jeena@jemmy.jeena.net · 2 years ago

I’m using https://rhasspy.readthedocs.io/en/latest/ together with HomeAssistant which does what you describe. It combines a lot of different things into one nice UI, one of the things is listening to a wake word with help of one of those:

Raven
Porcupine
Snowboy
Mycroft Precise
Pocketsphinx
External Command

With some of them you can even train it to use your own wake word.

Jeena@jemmy.jeena.net · 2 years ago

Oh and because last year was the year of the voice in HomeAssistant, they build it in into HomeAssistant itself so you don’t need to set it up as clumsy as I did: https://www.home-assistant.io/blog/2023/10/12/year-of-the-voice-chapter-4-wakewords/

RachelRodent@lemmy.dbzer0.com · 2 years ago

there is kinda, you can get koboldAI which is open source and get an actually foss model to run with it and slap the program kobold assistant on top of it

stallmer@sopuli.xyz · 2 years ago

There’s one you can use with Home Assistant that works pretty well for home automation commands.

I’ve just found that I don’t really like using voice control for things…

Drinvictus@discuss.tchncs.de · 2 years ago

There is no reason why it should not exist other than the fact that there really is no interest. Except for a few uses here and there (driving for example), voice assistants are just gimmicks.

state_electrician@discuss.tchncs.de · 2 years ago

I mean, that’s like your personal opinion and not some objective fact.

Drinvictus@discuss.tchncs.de · 2 years ago

Not really. Even Amazon, Apple and Google have been investing in assistants less and less. They have had massive lay offs from voice assistant teams.

https://www-theregister-com.cdn.ampproject.org/v/s/www.theregister.com/AMP/2022/11/23/voice_assistants_fail/?amp_gsa=1&amp_js_v=a9&usqp=mq331AQIUAKwASCAAgM%3D#amp_ct=1704369199164&amp_tf=From %251%24s&aoh=17043691929854&referrer=https%3A%2F%2Fwww.google.com&ampshare=https%3A%2F%2Fwww.theregister.com%2F2022%2F11%2F23%2Fvoice_assistants_fail%2F

https://www.bbc.com/news/business-64371426

"But it’s not clear whether they are money-making opportunities. Reports say most interactions are relatively simple tasks like checking the weather, or playing music.

More broadly, according to one report, over the past three years voice assistant use has been falling and another report suggests that the adoption of smart speakers is slowing."

So no it’s not just “my opinion”. But sure just down vote and fuck off.

milkytoast@kbin.social · 2 years ago

I think that that just means that there’s not much point in developing them further. they’re still great for the simple tasks, like texting while driving.

state_electrician@discuss.tchncs.de · 2 years ago

I mean, that only says something about the money making part and not the pure usability. For many people the commercial options are a gimmick, sure. But are these options, with a clear focus on milking the customers for money, really the ultimate state for voice assistants? I’d argue they are not. There is a space for free voice assistants that let users control their data and that still provide a value. Beyond users with disabilities that make it hard to impossible to use computers, voice assistants won’t ever do something you cannot do with a computer (which includes smartphones). If that makes you consider them a gimmick then I don’t have an argument. But I think it is nice and convenient to be able to use a computer with your voice while doing something else.