SuspiciousCarrot78@aussie.zone to Selfhosted@lemmy.worldEnglish · 17 hours agoDo you host your own AI?message-squaremessage-square151linkfedilinkarrow-up1124file-text
arrow-up1124message-squareDo you host your own AI?SuspiciousCarrot78@aussie.zone to Selfhosted@lemmy.worldEnglish · 17 hours agomessage-square151linkfedilinkfile-text
minus-squareSuspiciousCarrot78@aussie.zoneOPlinkfedilinkEnglisharrow-up5·7 hours agoLlama.cpp or death!
minus-squaretristynalxander@mander.xyzlinkfedilinkEnglisharrow-up1·15 minutes agoIt’s not that hard to use llama.cpp directly anyway. Why would I use a wrapper when I can just run a python script?
minus-squarebrucethemoose@lemmy.worldlinkfedilinkEnglisharrow-up1·1 hour agoOr exllama! Vllm, sglang, Lorax. Koboldcpp, Aphrodite, text-generation-webui, LM Studio, powerinfer, ktransformers, mlc-LLM, really whatever floats your boat. Just not ollama, specifically.
Llama.cpp or death!
It’s not that hard to use
llama.cppdirectly anyway. Why would I use a wrapper when I can just run a python script?Or exllama! Vllm, sglang, Lorax. Koboldcpp, Aphrodite, text-generation-webui, LM Studio, powerinfer, ktransformers, mlc-LLM, really whatever floats your boat. Just not ollama, specifically.