A visual display device is provided for delivering a generated image, preferably combinable with environment light, to the eye of a user. The device is lightweight and compact but yields a high quality image. In one embodiment, a color shutter provides a high-density color image. In one embodiment, a shroud protects from stray light and holds optical elements in desired alignment. In one embodiment an image generator is masked by at least two masks to provide for a high quality image without waste. In one embodiment, a removably mounted shield or activatable device can convert the apparatus from a see-through device to an immersion device and back again. In one embodiment, the device can be comfortably mounted to the user's head while still allowing for use of conventional eyeglasses. In one embodiment various controls, such as a mute button, volume control and the like can be provided, such as by mounting on the head-mounted display device.