frogman [he/him]@beehaw.org to

Technology@beehaw.orgEnglish · 2 years ago

Mozilla Firefox new alt-text generator powered by "fully private on-device AI model"

hacks.mozilla.org

cross-posted to:
opensource@lemmy.ml

222

Mozilla Firefox new alt-text generator powered by "fully private on-device AI model"

hacks.mozilla.org

frogman [he/him]@beehaw.org to

Technology@beehaw.orgEnglish · 2 years ago

cross-posted to:
opensource@lemmy.ml

Experimenting with local alt text generation in Firefox Nightly – Mozilla Hacks - the Web developer blog

hacks.mozilla.org

Firefox 130 will feature an on-device AI model that automatically generates alt-text for images, integrated into its built-in PDF editor.

New accessibility feature coming to Firefox, an “AI powered” alt-text generator.

"Starting in Firefox 130, we will automatically generate an alt text and let the user validate it. So every time an image is added, we get an array of pixels we pass to the ML engine and a few seconds after, we get a string corresponding to a description of this image (see the code).

…

Our alt text generator is far from perfect, but we want to take an iterative approach and improve it in the open.

…

We are currently working on improving the image-to-text datasets and model with what we’ve described in this blog post…"

Chat

Quokka@quokk.au
link
fedilink
arrow-up
1·
2 years ago
Any multimodal llm could do this in a heart beat locally.

And OpenAI has made their shit freely available to run locally, it’s like the worst company to use as an example.
- photonic_sorcerer@lemmy.dbzer0.com
  link
  fedilink
  English
  arrow-up
  3·
  2 years ago
  Where is this freely available multimodal GPT4 you speak of?
  - Quokka@quokk.au
    link
    fedilink
    arrow-up
    1·
    2 years ago
    deleted by creator

Technology@beehaw.org

technology@beehaw.org

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@beehaw.org

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

130 users / day
1.11K users / week
2.56K users / month
6.56K users / 6 months
1 local subscriber
42.4K subscribers
4.66K Posts
74.3K Comments
Modlog