• Voroxpete@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    6
    ·
    edit-2
    19 days ago

    Training data. They’re absolutely desperate for it.

    Each tiny incremental improvement to an LLM requires an exponential increase in the anount of training data, to the point where Earth is literally running out, and the most accessible sources like Reddit and Twitter have locked the doors and put up price lists.

    They’re starving dogs hunting for scraps.