@kautau

kautau@lemmy.world · 10 hours ago

It’s bad to break copyright if you do it, but it’s fine if they do it to train their models

kautau@lemmy.world · 11 hours ago

It’s probably deepseek r1, which is a “reasoning” model so basically it has sub-models doing things like running computation while the “supervisor” part of the model “talks to them” and relays back the approach. Trying to imitate the way humans think. That being said, models are getting “agentic” meaning they have the ability to run software tools against what you send them, and while it’s obviously being super hyped up by all the tech bro accellerationists, it is likely where LLMs and the like are headed, for better or for worse.