digitalmonkey
digitalmonkeys
Twitter handleFaceBookAcademia.eduLinkedIn
Home page of Victor Ng-Thow-Hing
Home Research Software Blog
email














About my research

Links to electronic copies of most of my publications can be found on my academia.edu page.

The common theme that ties my various research interests together is a desire to understand and model human and animal behavior, either with computer graphics and animation or on mechanical devices like humanoid robots. My perspective has moved towards a systems-based approach rather than looking at isolated components. However, in order to do any research at a wider holistic level, I often had to work on individual components first to make sure they are modeled correctly. This has enabled me to pursue interests in human-machine interaction which often requires working with systems of components interacting with each other in interesting ways.

The following are research projects that I've worked on and continue to be interested in while working in graduate school at the University of Toronto and at my current employment at the Honda Research Institute USA. Most recent research activity is listed first.

rtm model for car

Augmented Reality for the Car

The automobile industry is quickly moving towards cars connected to the internet. Although this means more information to the driver, there is a real danger of driver distraction. We are exploring the use of augmented reality windshields to create ways of enhancing the driver's experience while minimizing distraction. Not only do we work on the technical algorithms for fast computer vision and localization, but we follow a user-centered design thinking approach to develop appropriate applications and evaluate these with a configurable driving simulator to refine our designs before field tests.

Multi-modal facial expression

Multi-modal Facial Expression Recognition

My team wanted to continue our theme of non-verbal expressions by allowing our humanoid robot to pick up on emotional cues given by facial expression. We reasoned that if the robot can then build up a model of its human partner's emotional state, we can detect conditions like frustration or happiness. We had to overcome several problems: dealing with the small image sizes of faces from the robot's cameras, handling moving faces as people generally don't stand still, and determining how to pick up accurate facial expressions while the person was talking. For the latter case, some facial recognition algorithms depend on the mouth shape. However, when we talk, the mouth shape is not stable and can lead to false categorizations of expression. We used a multi-modal model to vary the types of feature masks we used to weight features from different parts of the face depending on whether the person was talking or not. We used Mitchel Benovoy's biologically-inspired models to deal with robust recognition over large distances.

Gesture generation

 

Gesture Generation

This topic is a very exciting interest of mine as it combines motor control, linguistics and planning. Our goal is to automatically and intelligently determine what gestures are co-produced with various speech utterances. The choice of gesture can be driven by many processes and needs, such as the need for emphasis, affective (emotional) states, content of the thought to be presented and personal style. The gesture models we produce exhibit phenomena at all levels of gesture expression - from low-level beats, deictics, iconics, metaphorics to high-level emblems. Gestures have probabilistic elements in the model at various stages to produce non-repetitive behavior, so you don't get the unnatural precision that often comes with robot displays of motion. This is continuing work which I am actively exploring further.

Child teaching robot with skin conductance measurements.

 

Learning with Kids

As part of the search for finding useful applications for humanoid robots, we collaborated with Dr. Sandra Okita from Columbia University to design humanoid robots as potential learning partners (not teacher substitutes) for children. In our earlier work, we found that children aged 4-6 were especially open to interaction with humanoid robots, but were often confused if the robot did not respond correctly to social cues such as expectant eye gazes.This has driven new research activity in producing better models for interaction, including cognitive models for attentive behaviors and gesture in communication. New methodologies for conducting human-robot experiments and measurements were developed to allow us to capture and analyze the interaction with multiple modalities and viewpoints, from different time scales. To help keep us honest, we perform comparative pre- and post-testing on students to determine if interaction with our robot produces any learning effects.

memory game

 

Human Robot Interactive Turn-taking Scenarios

To test out our ideas on interaction (perception, behavioral models and expression), we focus on various test scenarios to serve as research platforms to improve the quality of interaction and to test that interaction with people. Our first platform, which has undergone several interations in its various components was the Memory Game, the venerable card game where people (and robots) attempt to select pairs of identical cards from a set of face down cards. The challenge here is to use the robot's on-board perception to identify cards and balance the tasks of keeping track of the game with monitoring turn-taking and the fact that human's can make mistakes (or cheat).

Panoramic attention visualization

 

Panoramic Attention for Perception

We realized that a big problem is deciding how to filter the sheer amount of sensory information into actionable information that a robot can respond to. Our idea was to build a 3-layer model that starts with low-level attention mechanisms in the visual and auditory modalities, that can quickly isolate regions of interest (ROI). These ROIs are then fed to a second layer which we call mid-layer detection where specialized detectors for faces or objects reside. Finally, this high-level semantic information is sparsely stored in the panoramic attention layer, so-named because it is stored in the ego-centric panoramic view of the agent (robot)'s perspective.

Intelligent Systems

 

 

 

 

 

Intelligent Systems for Humanoid Robots

At my current work at the Honda Research Institute USA, I work on mult-agent intelligent systems for modeling human-robot interactions and complex tasks on the 2000 Honda ASIMO humanoid robot. Our goal is to develop autonomous robots using well-designed and reusable interaction models and a variety of perceptual, decision-making and motor control components. I currently am the project leader for human-robot interaction and intelligent systems integration. We've written several major software systems for handling environment maps, sensor fusion, task organization, robot control, and inter-agent communication.

Mass Sets

 

 

 

 

 
 

Computational Palaeontology

This project was a collaboration with John Hutchinson (Royal Veterinary College, University of London) and Frank "Clay" Anderson (Stanford University). It originated during a cafe chat around 2003 at Starbucks in Mountain View, CA. I was developing some mass models for my work in Digital Human Modeling at Honda, and thought that by combining these mass models with the B-spline solid model I used for muscle, we could create a very versatile shape primitive for estimating mass properties of body tissue in animals for both extant and extinct species. Well, 4 years later in 2007 we finally published our work with the developed mass set model applied to a Tyrannosaurus rex skeleton and validated with an ostrich carcass. I really enjoyed this project and it was done in my off-hours (Honda's not really into dinosaurs), so it was truly a labour of love.

Motion Planning

 

Motion Planning for Robots

Motion planning is a very important step prior to actual execution of motion on a robot. In order to design motion trajectories to send to a robot's joints, the trajectories to accomplish the task goal must avoid self-collisions, collisions with the environment and respect kinematic joint limits. I've worked on methods for planning tasks that involve switching between different modalities of motion (like walking and pushing). My main goal is to develop good modes of manipulation for a robot to accomplish higher level complex tasks. This work is done at the Honda Research Institute, USA with my former interns, Kris Hauser and Evan Drumwright.

Task Matrix

 

 

 

 

 

 

Complex Task Modeling for Humanoid Robots

The control algorithms for various motor tasks on a robot can vary widely depending on the goals of each task. They can range from simple joint angle trajectories to achieve certain poses, to pointing commands that require task-space control variables as well as perceptual information about objects in the environment. Evan Drumwright and I developed a Task Matrix that is a framework to unify these task programs with a simple, parameterized robot independent abstract interface. The Task Matrix can handle concurrency, conflict resolution, and can allow complex tasks to be assembled from simpler ones. Recently, the Task Matrix was made to work with the humanoid robot ASIMO. This project originated at the Honda Research Institute, and Evan has continued with the work into his PhD thesis. I am also actively developing the Task Matrix for my own research at Honda.

Joint Components

 

 

Realistic joints for humans and animals

Wei Shao and I worked on a joint component model to more accurately model the complexity of human joints. With this model, we could build complex joints, such as in the human spine, shoulder and knee. These joints could be animated at real-time rates and can be used in interactive applications. I also worked on developing automatic methods for building subject-specific skeletons from motion captured data with Jianbo Peng.

 

 

 

 

 

Muscle Models

 

Musculo-tendon Modeling for humans and animals

Although this topic was my first passion for research, it continues to still intrigue and excite me. My current work with humanoid robots makes it difficult to get back into this, but it hasn't stopped me from developing new ideas in whatever spare time I have left these days. I continue to collaborate with my co-authors from my grad school days as they have taken my original models and extended its application to other soft tissues. When I first started this topic, directly considering anatomy for modeling animals in computer graphics and animation was still unheard of. Most computer graphics were better at mechanical devices like cars and robots. Now we've almost got it right, but there is still something missing, and I hope continued work on building better underlying anatomical rigs for computer graphics creatures will be done.

Selected publications:

 

 

 

DANCE

 

 

 

 

Physics-Based Animation Systems

During my PhD thesis (1994-2000) at the University of Toronto, I co-developed with Petros Faloutsos a physics-based animation system called DANCE (Dynamic Animation and Control Environment). DANCE could be extended by plug-ins and featured abstract interfaces for controllers, numerical integrators and physical models. It could be used as a physics-simulation playground for testing out a variety of different ideas from virtual stuntmen to biomechanical models of muscle. This was a formative experience for me, as it was this project that I really started building plug-in based frameworks with abstract component interfaces in my software.

Since then, DANCE has subsequently been extended and improved by Ari Shapiro and has its own home page at UCLA.

  • Shapiro, A., V. Ng-Thow-Hing, P. Faloutsos, Dynamic Animation and Control Environment, in Graphics Interface 2005,  Victoria, Canada, May 9-11, 2005. CNN online coverage.
  • Ng-Thow-Hing, V., and P. Faloutsos, Dynamic Animation and Control Environment (DANCE), Siggraph Technical Sketch, in Siggraph Conference Abstracts and Applications, page 198, 2000.
  • Ng-Thow-Hing, V., and P. Faloutsos, DANCE: dynamic animation and control environment, in Graphics Interface ’99 Poster Abstracts¸ poster presentation, pages 31-32, 1999.