Bundled plugin guides
llama.cpp Provider
llama-cpp is the official external provider plugin for local GGUF embeddings.
It owns the node-llama-cpp runtime dependency used by
memorySearch.provider: "local".
Install it before using local memory embeddings:
openclaw plugins install @openclaw/llama-cpp-providerThe main openclaw npm package does not include node-llama-cpp. Keeping the
native dependency in this plugin prevents normal OpenClaw npm updates from
deleting a manually installed runtime inside the OpenClaw package directory.
Configuration
Set the memory search provider to local:
{ agents: { defaults: { memorySearch: { provider: "local", local: { modelPath: "hf:ggml-org/embeddinggemma-300m-qat-q8_0-GGUF/embeddinggemma-300m-qat-Q8_0.gguf", }, }, }, },}The default model is embeddinggemma-300m-qat-Q8_0.gguf. You can also point
local.modelPath at a local .gguf file.
Native Runtime
Use Node 24 for the smoothest native install path. Source checkouts using pnpm may need to approve and rebuild the native dependency:
pnpm approve-buildspnpm rebuild node-llama-cppFor lower-friction local embeddings, use a local service provider such as Ollama or LM Studio instead.