mozz@mbin.grits.dev to

Technology@beehaw.org · 2 years ago

"No, seriously. All those things Google couldn't find anymore? Top of the search pile. Queries that generated pages of spam in Google results? Fucking pristine on Kagi – the right answers, over and ov

pluralistic.net

161

"No, seriously. All those things Google couldn't find anymore? Top of the search pile. Queries that generated pages of spam in Google results? Fucking pristine on Kagi – the right answers, over and ov

pluralistic.net

mozz@mbin.grits.dev to

Technology@beehaw.org · 2 years ago

Pluralistic: Too big to care (04 Apr 2024) – Pluralistic: Daily links from Cory Doctorow

pluralistic.net

Chat

Irdial@lemmy.sdf.org
link
fedilink
English
arrow-up
8·
2 years ago
My issue with Kagi is that it relies on aggregate results from other search engine indices
- 👍Maximum Derek👍@discuss.tchncs.de
  link
  fedilink
  English
  arrow-up
  9·
  2 years ago
  So do DDG and a lot of other search engines. In addition to the time and cost of running a spider and maintaining a database (for little to no technological benefit these days), a lot of server admins will block crawlers that aren’t googlebot or msnbot/bingbot.
- Dark Arc@social.packetloss.gg
  link
  fedilink
  English
  arrow-up
  7·
  2 years ago
  It has its own index in addition to aggregating results.
- mozz@mbin.grits.devOP
  link
  fedilink
  arrow-up
  5·
  2 years ago
  Why is that an issue?
  - Irdial@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    9·
    2 years ago
    I don’t get the impression that Kagi intends to compete with major search engines. It is clearly marketed toward privacy-focused, tech-minded individuals. You can take that one of two ways. Either you are frustrated with the erosion of search engine quality due to advertising, or you disagree with the predatory practices such as data mining that comes along with such advertising. In both cases, the only real way to signal to major search engines that you disagree with these practices is to stop using their services (including their APIs).
    
    For example, I have been using DuckDuckGo for decades. At first, I had to compromise search result quality, but now it has enough users and support that results are on-par with the likes of Google.
    
    I do not think that Kagi is bad or that people should not use it. It simply isn’t for me, because it does not actually address the reasons I do not use search engines like Google.
    - Nia_The_Cat@beehaw.org
      link
      fedilink
      arrow-up
      17·
      2 years ago
      deleted by creator
      - Irdial@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        5·
        2 years ago
        Yes, and I largely disagree with it :/
        
        Atemu@lemmy.ml
        link
        fedilink
        arrow-up
        2·
        2 years ago
        I think you’re underestimating how huge of an undertaking a half-decent search index is, much less a good one.
        
        Nia_The_Cat@beehaw.org
        link
        fedilink
        arrow-up
        2·
        edit-2
        2 years ago
        deleted by creator
        
        jevans ⁂@lemmy.ml
        link
        fedilink
        arrow-up
        4·
        edit-2
        6 months ago
        deleted by creator
        
        Zworf@beehaw.org
        link
        fedilink
        arrow-up
        2·
        edit-2
        2 years ago
        Lol. I typed the name of my hometown and the two first results were escort sites from that area.
        
        I mean, either it knows me really well and their privacy claims are wrong 🤭 Or it has a funny way of prioritising indexes.
        
        Blaze (he/him)@lemmy.zip
        link
        fedilink
        arrow-up
        1·
        2 years ago
        Thanks
        
        Nia_The_Cat@beehaw.org
        link
        fedilink
        arrow-up
        1·
        2 years ago
        deleted by creator
        
        Zworf@beehaw.org
        link
        fedilink
        arrow-up
        1·
        2 years ago
        On the other hand, it doesn’t really matter so much anymore.
        
        LLM is the new search. I can ask it the actual question I have and it will give me the answer. If it’s not exactly what I need I can ask it to specify further.
        
        Contrast that with a search engine that just gives me a ton of bookmarks to sift through to see if they actually might answer my question or are just clickbait.
        
        Of course there’s still some times when you need search, like when you need to find an actual website, or when you need a source reference. But really the need for me is greatly reduced now.
        
        TehPers@beehaw.org
        link
        fedilink
        English
        arrow-up
        2·
        2 years ago
        Be careful relying on LLMs for “searching”. I’m speaking from experience here - getting actually accurate results from the current generation of LLMs, even with RAG, is difficult. You might get accurate results most of the time (even 80% or more), but it can be difficult to identify the inaccurate results due to the confidence models present their output with when hallucinating.
        
        Also, if your LLM isn’t doing retrieval-augmented generation (RAG), then it isn’t actually a search and won’t find results more recent than the data it was trained off of.
        
        Zworf@beehaw.org
        link
        fedilink
        arrow-up
        1·
        2 years ago
        I know. But I’m often not really looking for accuracy. I just need to know something for myself. Most of the stuff I look up is absolutely not critically important. It’s not like I’m trying to write a PhD dissertation or something.
        
        I know it can be inaccurate but I can verify the results (and they usually are totally fine).
    - mozz@mbin.grits.devOP
      link
      fedilink
      arrow-up
      7·
      edit-2
      2 years ago
      I think it’s not that complicated – Kagi’s search results are just far more useful. I think it’s marketed at people who want good search results, not anything dealing with privacy (although, Kagi doesn’t log your searches, so it’s fully private for most everyday definitions) – your viewpoint for you makes perfect sense to me and sure I respect it, but I don’t think it’s right to say that people are linking their credit cards to do a have-to-be-logged-in-first search on Kagi chiefly for reasons of privacy focus.
      
      (I just tried the same experiment Doctorow tried, of searching for something that I’d been unable to find through Google, and Kagi did the same thing for me that it did for him (i.e. found it). That’s actually not important enough for me to pay for Kagi, but “Google is shit now” is no fringe opinion and it’s pretty easy to verify that Kagi does in practice work markedly better.)

Technology@beehaw.org

technology@beehaw.org

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@beehaw.org

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

191 users / day
957 users / week
2.51K users / month
6.56K users / 6 months
1 local subscriber
42.4K subscribers
4.66K Posts
74.3K Comments
Modlog