“Notably, O3-MINI, despite being one of the best reasoning models, frequently skipped essential proof steps by labeling them as “trivial”, even when their validity was crucial.”

  • Soyweiser@awful.systems
    link
    fedilink
    English
    arrow-up
    3
    arrow-down
    1
    ·
    edit-2
    6 days ago

    Yeah, I know, it is a personal thing from me. I have more of those, think it isn’t helpful to use certain too general terms in specific cases as then you cast a too wide net. I fun at parties. (It is also me poking fun at how the soviets called everybody who disagreed with them a reactionary)