Voice-first interaction
Players interact using natural language. The AI understands intent, confirms steps, and adapts responses contextually — beyond simple phrase matching.
- Push-to-talk — hold controller trigger while speaking for reliable ASR capture.
- AI confirmation — clarifies ambiguous requests and guides users interactively.
- TTS + captions — spoken responses with on-screen subtitles for accessibility.
- Disambiguation — numbers and highlights candidate objects when multiple matches exist.
- Fallback controls — controller/gaze click for confirm, tool pickup, and locomotion in noisy environments.
Example intents: “Open the carpentry workshop”, “Repeat that step”, “Switch to Yoruba”, “Show my progress”, “What did I do wrong?”