Android / llama.cpp / GGUF

LLM tester with llama.cpp

LLM tester with llama.cpp is an Android local-LLM testing app that lets you load GGUF models, tune inference and prompt-template settings, manage shared MCP / Function Definitions settings, inspect logs, and expose an Ollama/OpenAI-compatible API plus the bundled WebUI from one app.

  • Supports both model downloads from a URL and importing local .gguf files from the device.
  • Lets you combine generation settings, Think behavior, custom chat templates, shared MCP settings, and Function Definitions JSON.
  • Can start an on-device Ollama/OpenAI-compatible API and WebUI on the same port, including endpoints such as /api/chat and /v1/chat/completions.

Screenshots

LLM tester with llama.cpp main screen
Main screen with prompt input, model output, processing logs, and API server controls.
LLM tester with llama.cpp settings screen
Settings screen for model URLs, local GGUF import, configuration management, and inference parameters.
LLM tester with llama.cpp template settings
Template, API, language, and log settings, including System Prompt and custom chat template fields.

Key Features

Operational Notes