A media space is an environment which integrates audio, video and computer technologies in novel ways to provide its members with new ways of interacting and working together. Person-to-person communication to a media space from outside the media space is limited. This thesis presents the user-centered, iterative evolution of the audio video server attendant (AVSA)- a system that enables people to access and control the resources of a media space from a conventional videoconferencing room.

A key issue is that the only equipment available at the videoconferencing room is the traditional videoconferencing equipment- a camera, monitor, speaker and microphone. Consequently, we are forced to enable control to be exercised through speech input in response to visual prompts.

In a larger context, we are seeing a rapid move towards networked interactive information appliances. Conventional efforts "converge" the telephone, television and/or computer technologies to produce these appliances. However, because of the approach, the shape that the associated services are assuming is rooted in the appliances of the past. As an example of a networked interactive information appliance not centered on any one technology, the AVSA presents an option to the limited perspective.



