Local LLM: Ollama, llama.cpp, vLLM - Running Models on Self-Hosted Hardware — AI Engineering | MindForge