This speech-gaze data set was collected in the Interior Decoration domain.
<object name="dresser_desk" id="12"> <properties> <type text="dresser#n#4">desk dresser table vanity</type> </properties> </object> ...In the above example,
<user_input> <gaze> <selection> <entity text="chair_green">0.190656</entity> <entity text="door">0.069329</entity> <entity text="lamp_bank">0.229842</entity> <entity text="dresser">0.510173</entity> </selection> </gaze> <speech> <entity_annotation>chair_green</entity_annotation> <transcription>I like the fact that there's a chair in front of the</transcription> <waveform>wav\2_5100_4190.wav</waveform> </speech> </user_input>In the above example,