Another factor that defines a context unit of a conversation, often underrated, is the TIMING of turn-taking.
That's especially true for a real-word real-time assistant as (your) Chris navigator.
I think TIMING (mainly the human utterances time distribution) is an important attribute to be added to the list. For example if user say
Call Alicia
The bot could reply:
Shall I call Alicia A on her mobile?
Afterward a possible user silence could be managed with a reprompt or a default choice (the mobile phone) if the user do not answer in X seconds.
Or the conversation (and the domain contect) could terminate after Y seconds of silence in the middle of a turn-taking.
What do you think about it?
Thanks