What AI services are you selfhosting? Or, have tested and passed on

kiol@lemmy.world · 1 year ago

What AI services are you selfhosting? Or, have tested and passed on

L_Acacia@lemmy.ml · 1 year ago

As much as I’d like to praise the open-weight models. Nothing comes close to Claude sonnet in my experience too. I use local models when info are sensitive and claude when the problem requires being somewhat competent.

What setup do you use for coding? I might have a tip for minimizing claude cost you depending on what your setup is.

ikidd@lemmy.world · 1 year ago

I’m using vscode/Roocode with Gosucoder shortprompt, with Requesty providing models. Generally I’ll use R1 to outline a project and Claude to implement. The shortprompt seems to reduce the context quite a bit and hence the cost. I’ve heard about Cursor but haven’t tried it yet.

When you’re using local models, which ones are you using? The ones I mention don’t seem to give me much I can use, but I’m also probably asking more of them because I see what Claude can do. It might also be a problem with how Roocode uses them, though when I just jump into a chat and ask it to spit out code, I don’t get much better.

L_Acacia@lemmy.ml · edit-2 1 year ago

If you are willing to pay 10$ a month. You should get GithubCopilot, it provides near unlimited claude 3.5 usage. RooCode can hook into the github copilot api, and use it for its generations.

I use Qwen Coder and Mistral small locally too. It works ok, but its nowhere near GPT/Claude in terms of response quality.

What AI services are you selfhosting? Or, have tested and passed on

What AI services are you selfhosting? Or, have tested and passed on

Testing Indiedroid Nova w/ 16gb ram - Learning Together