I’m @froztbyte more or less everywhere that matters

  • 21 Posts
  • 2.54K Comments
Joined 3 years ago
cake
Cake day: July 2nd, 2023

help-circle
  • slowly burning through the latest BTB eps (on epstein), getting to the point in ep3 about the discussions/events around ~2016 and just … god

    the remarks and then-recent actions, wrt affecting the internet and the social technological comms structure of humanity as a whole, and then rapidity of a bunch of shit starting to turn to shit 2014~2016 (as I’ve remarked on in previous posts)…

    I’d want to see more threads checked into and researched in depth (and I know that some stuff (partly?) also had their own drivers), but fucking hell there’s a lot of apparent overlap. dunno if I can take on that investigation (my stats derivation/calculation skills border on a warcrime), but other than that would be interesting to see some analyses



















  • in today’s news about magical prompts that super totes give you superpowers:

    We introduced SKILLSBENCH, the first benchmark to systematically evaluate Agent Skills as first-class artifacts. Across 84 tasks, 7 agent-model configurations, and 7,308 trajectories under three conditions (no Skills, curated Skills, self-generated Skills), our evaluation yields four key findings: (1) curated Skills provide substantial but variable benefit (+16.2 percentage points average, with high variance across domains and configurations); (2) self-generated Skills provide negligible or negative benefit (–1.3pp average), demonstrating that effective Skills require human-curated domain expertise

    I am jack’s surprised face

    …and given I have other yaks, I shall not step on my “software and tools don’t have to suck” soapbox right now