SuspiciousCarrot78@aussie.zone to Selfhosted@lemmy.worldEnglish · 13 hours agoDo you host your own AI?message-squaremessage-square133linkfedilinkarrow-up192arrow-down125file-text
arrow-up167arrow-down1message-squareDo you host your own AI?SuspiciousCarrot78@aussie.zone to Selfhosted@lemmy.worldEnglish · 13 hours agomessage-square133linkfedilinkfile-text
minus-squarefubarx@lemmy.worldlinkfedilinkEnglisharrow-up1·6 hours agoFound vLLM to be the most efficient local runtime service. And “ray” as a good (but complicated) way to distribute the load: https://docs.ray.io/
Found vLLM to be the most efficient local runtime service. And “ray” as a good (but complicated) way to distribute the load: https://docs.ray.io/