• theneverfox@pawb.social
    link
    fedilink
    English
    arrow-up
    7
    ·
    13 hours ago

    I think deepseek shook them enough to realize what should have been obvious for a while… Brute force doesn’t beat new techniques, and spending the most might not be the safest bet

    There’s a ton of new techniques being developed all the time to do things more efficiently, and if you don’t need a crazy context window, in many use cases you can get away with much smaller models that don’t need massive datacenters