Chinese Yellow Pages | Classifieds | Knowledge | Tax | IME

Fri. Mar 13th, 2026

AI/Machine Learning cloud technology

add llama-cpp-python to kubernet cluster

ByMin Wang

Feb 19, 2024

(1) use container

https://github.com/abetlen/llama-cpp-python/pkgs/container/llama-cpp-python

(2) mount k8s storage as /models

export MODEL point to the right llama-model.gguf

(3) expose 8000 to loadbalancer to outside

(4) browse to ip:8000/docs for API exploration

By Min Wang

Related Post

AI/Machine Learning

Q&A: Fine-Tuning and Guidance on diffusion models

Aug 3, 2024 Min Wang

cloud technology

Update k8s certs

Jul 17, 2024 Min Wang

AI/Machine Learning

Build your own GPU

Mar 28, 2024 Min Wang

You missed

Linux Networking

troubleshooing missing ip in k8s ( metallb-system)

Feb 12, 2025 Min Wang 0 Comments

AI/Machine Learning

Q&A: Fine-Tuning and Guidance on diffusion models

Aug 3, 2024 Min Wang 0 Comments

software programming

coding judge system

Jul 27, 2024 Min Wang 0 Comments

software programming

what is std::forward and universal reference

Jul 17, 2024 Min Wang 0 Comments