I wanted Claude Code-style workflows without sending code to the cloud, so I built Loki

Dark-Alex-17@lemmy.world · 8 hours ago

I wanted Claude Code-style workflows without sending code to the cloud, so I built Loki

MalReynolds@slrpnk.net · 2 hours ago

Ollama is enshittifying at a rate of knots, have you got a way to use llama-server (or preferably llama-swap) instead ?

JollyForeheadRidges@lemmy.zip · 16 minutes ago

Crap. I was just starting to play with Ollama and thought it might be a good balance between running local models and using one of the proprietary services.

Could you elaborate on what’s happening with them / what to watch out for?

Dark-Alex-17@lemmy.world · 2 hours ago

Looking at Llama-swap, since it says it supports OpenAI-compatible API, it should just work natively already. Just set up the client to be type: openai-compatible and fill in the URL and provide the models. Should work out of the box!

MalReynolds@slrpnk.net · 2 hours ago

Hope so, bet it doesn’t without some tweaking though, OpenAI-compatible seldom is, and ollama is bad for that. Still, worth checking out, I’ll have a go at it sometime soonish and perhaps you’ll see a PR (or some doco in the best case scenario).

Dark-Alex-17@lemmy.world · 2 hours ago

Looking forward to it! Heads up in case you missed it: I had settled on renaming it to Coyote, so sometime this week will be a breaking change and release to get that done.

Biggest pains are just going to be updating the repo tokens for Crates.io and renaming the homebrew repo.

MalReynolds@slrpnk.net · 2 hours ago

K, I’ll circle back in a week or so…