CS5604, Unit MM

Edward A. Fox
Department of Computer Science, Virginia Tech, Blacksburg VA 24061-0106

Abstract:

Information can be represented in various media. Multimedia information systems may make use of audio, video, images, graphics, animations, and other types of media in addition to numbers and texts. While searching of multimedia databases often is based on searching of text descriptions, or involves vectors of features of the various representations, there are many new issues that must be faced when moving from text to a multiplicity of media. First, there is the problem of representation of each media type. There are special characteristics, and often requirements for large scale data compression. Second, there are issues related to coordinating and synchronizing the multiple media types. This relates to real time scheduling, network issues, and operating system support. Third, there are special computer systems (e.g., standalone systems like CD-TV or CD-I), new software/hardware technologies (e.g., DVI), and numerous standards (e.g., JPEG and MPEG for coding, HyTime for document architecture). These must be understood and related to given user requirements. Finally, there is need for methodologies for developing multimedia applications and for managing such projects. Here object-oriented methods are crucial, human-computer interaction guidelines must be followed, and new, useful metaphors and systems must be developed and refined.

This Unit has four articles that should be studied. In addition, the videotape on Interactive Digital Video can be viewed in class or in McB 110. In the lecture a general overview will be given using KMS or Acrobat. The demonstrations and videotapes help round out the picture.

Unit Highlights

Lecture/Motivating Question:
How do we extend current information systems to handle and support use of multimedia?

Computer Exercises:
Students will work with JPEG and MPEG formats.

Introduction

The area of multimedia is rapidly emerging, in part because it allows computers to communicate in ways more convenient and effective with humans, and in part because of rapid improvements in technology. Improvements have enabled media such as audio and video to be presented by computers, then to be captured in small amounts, then to be compressed in larger amounts, and finally to be more easily managed by computers and networks.

Technology

As is discussed in the first reading, many areas of computer science can and must be applied. A whole new set of jargon has emerged, relating to storage units, compression techniques, networking approaches, and special computer systems.

Of crucial importance is compression. When images are compressed, say by using the new JPEG standard (see 2nd article), approximately 20:1 savings in space is realized. This allows images to be handled on current computers, and large image collections to be managed using CD-ROMs or network servers. JPEG makes use of the discrete cosine transform (DCT), Huffman or arithmetic coding, and some other special tricks that altogether can be handled by special chips or fast host processors.

Standards for video (e.g., MPEG and px64) build upon the JPEG techniques, adding in other methods to remove temporal redundancy (i.e., repetition from one frame of data to the next).

Software

For multimedia to catch on, the next real barrier is software. Without real time operating systems and fast software, multimedia information cannot be presented and manipulated interactively by users. Without good software, developing multimedia applications is prohibitively expensive and time consuming. Without the use of modern object-oriented development efforts, both of the above types of efforts are doomed to early demise because of failures to keep up with rapid change in hardware, extension to myriad operating systems, and cross-platform portability requirements. The third (and fourth, optional) reading deals with many of these issues.

Though just exposed briefly to these concepts, students will find this area to be quite exciting, and will be equipped with some of the key knowledge and concepts needed to comprehend and work in this emerging industry.

Objectives

From the Course Objectives an important goal is: read and understand research contributions ...; you will gain experience by reading the three CACM articles, by the in-class discussion relating to those, and by the videotape presentations and demonstrations of research systems.

This Unit has the following objectives, for students to be able to:

  1. describe the basic methods used in the JPEG and MPEG standards for compression;

  2. describe the model and architecture used for current DVI systems' software --- AVK; and

  3. be able to explain much of the key terminology that relates to multimedia.

Suggested Procedure

There are two main types of effort required, besides the usual exercises (see below) and quiz. First, the readings (see the next section) should be carefully studied, keeping unit objectives in mind (see the previous section). Second, if possible, the videotapes and lectures, and the demonstrations, should supplement the readings.

Computer Exercises

First, you should examine a number of images compressed to varying degrees according to the JPEG standard. The easiest way is for you to study the WWW version. Alternatively, if you want some hands on experience, use some computer that can handle X display, connect to video.cs.vt.edu, and run the program

xv
on the
/u1/fox/ir/jpeg/4.*
files. Also, do
ls -l 4.*
in that directory to see the file sizes. Note that file
4.60.jpg
was produced by command
tifftopnm 4.tif | cjpeg -Q 60 > 4.60.jpg
run on fox.cs.vt.edu earlier.

Based on the file sizes and your assessment of quality, which of the versions of this image would you recommend putting out on WWW? Please explain briefly.

Students should run the program mpeg_play on the

/home/video/abdulla/*.mpg
files using video.cs.vt.edu. Or, use the WWW and run a local MPEG player for the *.mpg files there.

Send to the instructor the name of the X terminal used, and the average frames per second for each of 5 files. (Hint: you may want to pick the smallest files, since that will take less time.) If you are not running X, but work with the WWW, see if you can determine how close to normal speed (30fps) these are. Please explain why these values are not 30.

Also, view a movie about Hawaii and tell the instructor briefly what that MPEG movie is about.

Please be sure to look at the WWW course notes for this unit and if your computer supports it, experiment with some of the audio and video files. Tell the instructor the 3 WWW pages you visited that you found most interesting, and explain briefly why.

Study Questions

You should be able to answer each of the following questions.

  1. How does work in CS relate to this field?
  2. What is the role of DCT and how does it apply to large images? What other image compressions schemes are comparable to it, and which may yield even higher compression ratios?
  3. How does the JPEG standard support needs for users to maintain good quality and yet achieve impressive degrees of compression?
  4. How does JPEG relate to color spaces?
  5. How does MPEG work and how does it build upon JPEG?
  6. How does DVI technology work: what software has and is involved, how much are audio and video compressed, and what are the two main qualities of DVI compression supported?
  7. How does Fluent Machines' approach differ from DVI's with AVK?
  8. How does having a faster DVI processor help improve performance and flexibility in terms of co-processor scheduling and memory bitmap copying?
  9. What is the difference between CD-I and DVI, and what are the main contributions of each?
  10. What is the scope for each of the important new standards: JPEG, MPEG, px64, HyTime?

Comments on Readings

The four articles are relatively diverse, covering various aspects of the field. Respectively, they provide an overview, detailed discussion on compression methods and standards, one approach to systems software support, and finally an approach to authoring software.

FOXE91b

This article provides an overview. It should be read carefully, and students should study and be able to refer to the definitions of acronyms, terms, and phrases. Note particularly the various standards and the approaches to compression.

WALL91

This article should be carefully read up through page 35. The rest can be skimmed. It is important to understand the goals of JPEG and how DCT and Huffman coding work with quantization and zig zag encoding to yield a (variably reduced) compressed bitstream. If the images on page 42 cannot be clearly seen, you may wish to look at the original journal issue, on reserve in the library.

GREE92

Intel's software for DVI is discussed in this article. It is quite interesting, tying in with work on object oriented system, software development, co-processor architectures, and general operating system issues. Please study the glossary carefully. Indeed, the whole article should be read closely. It is interesting to see the failings of the original DVI software, how a new better conceptual model was developed, and how it was implemented using an object oriented approach. Unfortunately, Intel has since discontinued DVI.

PHIL91a

MediaView is an interesting system. It builds upon the metaphor of a long scrolling article, that can be enhanced with annotations that involve various media types. Though the article will not be required, it is of interest in that it shows how a simple approach to authoring can go a long way, and how important object-oriented development tools are when building such a complex piece of software.

Summary of Key Concepts

  1. Compression methods and fast processor now allow us to manipulate images and motion video in digital forms.

  2. Object oriented software methods, and good models and metaphors, are essential to build complex systems and authoring software needed for effective use of multimedia information.

References

1
Edward A. Fox. Guest editor's introduction: Standards and the emergence of digital multimedia systems. Communications of the Association for Computing Machinery, 34(4):26--29, April 1991. For Special Section on Digital Multimedia Systems.

2
James L. Green. The evolution of DVI system software. Communications of the Association for Computing Machinery, 35(1):52--67, January 1992.

3
Richard L. Phillips. MediaView: A general multimedia digital publication system. Communications of the Association for Computing Machinery, 34(7):75--83, July 1991.

4
Gregory K. Wallace. The JPEG still picture compression standard. Communications of the Association for Computing Machinery, 34(4):30--44, April 1991.


Edward A. Fox
Thu Nov 28, 1996