chat ai using llama.cpp
Henry is a fast, local-first AI chatbot built with Flask and llama.cpp. It runs quantized LLMs on your machine using the GGUF format — no cloud APIs, no data leaving your device.
misc
misc debug here.
Let me know
Take part in the project