Do you host your own ML / AI / LLM? What do you use, and what do you use it for?

  • Franconian_Nomad@feddit.org
    link
    fedilink
    English
    arrow-up
    2
    ·
    4 hours ago

    Yeah, a higher quant would be nice, I actually try not to go below Q5, but you can domino’s so much with 16GB of VRAM and the ddr4 system RAM.

    But I must say I‘m pretty impressed by Qwen3.6-35b, not only from its capabilities but also from hardware requirements. MoE for the win I guess.

    RWKV sounds interesting, have to look into it, thanks!