Prerequisites
- A working OpenClaw or Hermes Agent install (the host).
- A media worker that bridges the live Teams call into the plugin over an HMAC WebSocket (the worker renders the avatar tile and forwards audio/video; it dials this plugin).
- A realtime voice provider key (for realtime mode) - e.g. OpenAI or Azure OpenAI. Streaming mode uses your host’s configured STT/TTS instead.
- OpenClaw (npm)
- Hermes (pip)
Configure
Add a Full reference → OpenClaw configuration.
msteams-voice entry to your OpenClaw config - sharedSecret must match the worker: