Gesture use during public speaking can be analyzed as an aspect of speaker performance as well as an indicator of speaker emotions. Recognizing gesture quality rather than identifying specific gestures presents an underexplored challenge compared to traditional gesture recognition in virtual reality (VR). To that end, we use a VR headset and controllers to create a database of 162 five-second-long gestures. Next, we ask ten judges to evaluate the quality of gestures in three dimensions, i.e., dynamics, range, and between-hand distance, on a low-medium-high scale with average Fleiss Kappa 0.39 inter-rater agreement. A comparison of various classifiers revealed that the Support Vector Machine (SVM) yielded the best results, achieving classification accuracy of 70-85% for each quality dimension.
Dr Sławomir Tadeja
Slawomir K. Tadeja is a postdoctoral research associate in the Cyber-Human Lab belonging to the Institute of Manufacturing at the University of Cambridge. Previously,...