David Demirdjian - Academia.edu (original) (raw)
Uploads
Papers by David Demirdjian
In this paper, we present a novel technique for classifying multimodal temporal events. Our main ... more In this paper, we present a novel technique for classifying multimodal temporal events. Our main contribution is the introduction of temporal random forests (TRFs), an extension of random forests (and decision trees in general) to the time domain. The approach is relatively simple and able to discriminatively learn event classes while performing feature selection in an implicit fashion. We describe here our ongoing research and present experiments performed on gesture and audio-visual speech recognition datasets comparing our method against state-of-the-art algorithms.
Virtual Reality, Jul 12, 2005
Lecture Notes in Computer Science, 2004
Springer eBooks, Nov 3, 2007
This paper presents a patch-based approach for pose estimation from single images using a kerneli... more This paper presents a patch-based approach for pose estimation from single images using a kernelized density voting scheme. We introduce a boosting-like algorithm that models the density using a mixture of weighted ‘weak’ estimators. The ‘weak’ density estimators and corresponding weights are learned iteratively from a training set, providing an efficient method for feature selection. Given a query image, voting
This paper presents a novel method for learning classes of temporal sequences using a bag-of-feat... more This paper presents a novel method for learning classes of temporal sequences using a bag-of-features approach. We define a temporal sequence as a bag of temporal features and show how this representation can be used for the recognition and segmentation of temporal events. A codebook of temporal descriptors, representing the local temporal texture, is automatically constructed from a set of
Navigating virtual environments usually requires a wired interface, game console, or keyboard. Th... more Navigating virtual environments usually requires a wired interface, game console, or keyboard. The advent of perceptual interface techniques allows a new option, the passive and untethered sensing of users' pose and gesture to allow them maneuver through virtual worlds. We show new algorithms for passive, real-time articulated tracking with standard cameras and personal computers. Several different interaction styles are compared,
... Science and Artificial Intilligens, MIT 200 Technology Square, Cambridge, MA 20139, USA konra... more ... Science and Artificial Intilligens, MIT 200 Technology Square, Cambridge, MA 20139, USA konrad@csail.mit.edu, demirdji@csail.mit.edu, trevor@csail.mit ... First we performed a Wizard ofOZ (WOz) study with the major aim of gathering qualitative data like: what gestures are most ...
In this paper, we present a novel technique for classifying multimodal temporal events. Our main ... more In this paper, we present a novel technique for classifying multimodal temporal events. Our main contribution is the introduction of temporal random forests (TRFs), an extension of random forests (and decision trees in general) to the time domain. The approach is relatively simple and able to discriminatively learn event classes while performing feature selection in an implicit fashion. We describe here our ongoing research and present experiments performed on gesture and audio-visual speech recognition datasets comparing our method against state-of-the-art algorithms.
Virtual Reality, Jul 12, 2005
Lecture Notes in Computer Science, 2004
Springer eBooks, Nov 3, 2007
This paper presents a patch-based approach for pose estimation from single images using a kerneli... more This paper presents a patch-based approach for pose estimation from single images using a kernelized density voting scheme. We introduce a boosting-like algorithm that models the density using a mixture of weighted ‘weak’ estimators. The ‘weak’ density estimators and corresponding weights are learned iteratively from a training set, providing an efficient method for feature selection. Given a query image, voting
This paper presents a novel method for learning classes of temporal sequences using a bag-of-feat... more This paper presents a novel method for learning classes of temporal sequences using a bag-of-features approach. We define a temporal sequence as a bag of temporal features and show how this representation can be used for the recognition and segmentation of temporal events. A codebook of temporal descriptors, representing the local temporal texture, is automatically constructed from a set of
Navigating virtual environments usually requires a wired interface, game console, or keyboard. Th... more Navigating virtual environments usually requires a wired interface, game console, or keyboard. The advent of perceptual interface techniques allows a new option, the passive and untethered sensing of users' pose and gesture to allow them maneuver through virtual worlds. We show new algorithms for passive, real-time articulated tracking with standard cameras and personal computers. Several different interaction styles are compared,
... Science and Artificial Intilligens, MIT 200 Technology Square, Cambridge, MA 20139, USA konra... more ... Science and Artificial Intilligens, MIT 200 Technology Square, Cambridge, MA 20139, USA konrad@csail.mit.edu, demirdji@csail.mit.edu, trevor@csail.mit ... First we performed a Wizard ofOZ (WOz) study with the major aim of gathering qualitative data like: what gestures are most ...