Lizonghang/prima.cpp prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters **Language:**C++ Total stars: 101 Stars trend: 15 Apr 2025 2am ██ +16 3am ██▋ +21 4am █ +8 5am ▌ +4 6am █ +8 7am ▌ +4 8am ▍ +3 9am ▍ +3 10am ▏ +1 11am ▋ +5 12pm ▋ +5 #cplusplus #distributedai, #llamacpp, #llminference, #ondevicellms


More photos from syxyzevaduqe