Visual mediums dominated recent decades of human computer interaction. The rise of speech as user interface highlights dependency on visual cues. We infer duration, progress, expectations and capabilities from color and layout. An intuition led visual experience eliminates the need to ask questions. We arrive at a predetermined destination like with GPS directions.
Speech alone must guide users to respond as expected or ask the right questions. Questions arise that a visual interface never allows. Such questions derail experiences if natural reentry points are absent. It’s easy to identify when things go wrong, but hard to imagine how they might go wrong without experiencing it. That’s why practicing a script is vital with audio experiences!
Let’s prototype an audio only experience for memorizing notes on a guitar fretboard.
That last question might sidetrack the conversation.
Assistant asked
What note is on the 2nd fret of the 1st string?
Reasonable responses might be questions:
We can give expectations before play begins, but the listener may not hear or understand. Repetition or clarifications would be natural in an unscripted conversation. The flow may get derailed. Let’s continue with a best case scenario.
This conversation could repeat as long as desired. The prompt to continue is a choice. Launching into the next quiz question would be acceptable. Either way exiting on command is important. Bad experiences occur when we force users to answer a question before exit. Never waste their time or discourage reuse.
Repeating a question is often necessary to keep the conversation moving:
The user finally received value from this conversation. They learned from their mistake. Enticing them to continue is our next task. We’ll gamify their experience by counting consecutive correct answers. They may choose to exit anyway. We’ll exit quickly with a memorable remark.
There are corners of that conversation we have not discussed. Let’s ignore them for now. Next time we’ll inspect the issues and how visuals enhance this experience.