C. Ahlberg, and B. Shneiderman, Visual Information Seeking: Tight Coupling of Dynamic Query Filters with Starfield Displays, in Proc. ACM CHI '94, Boston, MA, April 1994, 313–317.
 A. Akutsu and Y. Tonomura, Video Tomography: An Efficient Method for Camerawork Extraction and Motion Analysis, in Proc. ACM Multimedia Conference, 1994, pp. 349–356.
 F. Arman,A. Hsu, and M. Chiu, Image Processing on Compressed Data for Large Video Databases, in Proceedings of the ACM MultiMedia, California, June 1993, pp. 267–272.
 D. M. Bikel,S. Miller,R. Schwartz, and R. Weischedel, Nymble: A High-Performance Learning Name-finder, in Proc. 5th Conf. on Applied Natural Language Processing (ANLP), Washington DC, April 1997, pp. 194–201.
 J.S. Boreczky and L.A. Rowe, Comparison of Video Shot Boundary Detection Techniques, in SPIE Conf. on Visual Communication and Image Processing, 1996.
 J. Boreczky,A. Girgensohn,G. Golovchinsky, and S. Uchihashi, An Interactive Comic Book Presentation for Exploring Video, in Proc. CHI '00, 2000, pp. 185–192.
 Chang, Moderator, Multimedia Access and Retrieval: The State of the Art and Future Directions, in Proc. ACM Multimedia '99, Orlando, FL, October 1999, pp. 443–445.
 M.G. Christel and K. Pendyala, Informedia Goes to School: Early Findings from the Digital Video Library Project, D-Lib Magazine, September 1996. http://www.dlib.org/dlib/september96/informedia/09christel.html.
 M.G. Christel,D.B. Winkler and C.R. Taylor, Improving Access to a Digital Video Library, in Human-Computer Interaction INTERACT '97: IFIP TC13 International Conference on Human-Computer Interaction, July 1997, Sydney, Australia, S. Howard, J. Hammond, & G. Lindgaard, Eds. London: Chapman & Hall, 1997, pp. 524–531.
 M.G. Christel,M.A. Smith,C.R Taylor, and D.B. Winkler, Evolving Video Skims into Useful Multimedia Abstractions, in Proc. of the CHI '98 Conference on Human Factors in Computing Systems, C. Karat, A. Lund, J. Coutaz, and J. Karat, Eds., Los Angeles, CA, April 1998, pp. 171–178.
 M. Christel, Visual Digests for News Video Libraries, in Proc. ACM Multimedia '99, Orlando FL, Nov. 1999, ACM Press, pp. 303–311.
 M.G. Christel and A.S. Warmack, The Effect of Text in Storyboards for Video Navigation, in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Salt Lake City, UT, May 2001, Vol. III, pp. 1409–1412.
 M., Christel,A.M, Olligschlaeger, and C. Huang, Interactive Maps for a Digital Video Library, IEEE MultiMedia 7(1), 2000, pp. 60–67.
 G. Crane,R. Chavez, et al., Drudgery and Deep Thought, Comm. ACM 44(5), 2001, pp. 34–40.
 M. Christel and D. Martin, Information Visualization within a Digital Video Library, Journal of Intelligent Information Systems 11(3), 1998, pp. 235–257.
 M. Christel,A. Hauptmann,H. Wactlar, and T. Ng, Collages as Dynamic Summaries for News Video, in Proc. ACM Multimedia '02, Juan-les-Pins, France, Dec. 2002, ACM Press.
 P. Clarkson and R. Rosenfeld, Statistical Language Modeling Using the CMU-Cambridge Toolkit, in Proc. Eurospeech '97, Rhodes, Greece, Sept. 1997, Int'l Speech Communication Assoc., pp. 2707–2710.
 Y. Deng,B.S. Manjunath,C. Kenney,M.S. Moore, and H.Shin, An efficient color representation for image retrieval, IEEE Transactions on Image Processing, Vol. 10, No. 1, IEEE, Jan. 2001, pp. 140–147.
 S.G. Eick, Data Visualization Sliders, in Proc. ACM Symposium on User Interface Software and Technology, Marina del Rey, CA, Nov. 1994, ACM Press, pp. 119–120.
 D.A. Forsyth,J. Haddon, and S. Ioffe, Finding Objects by Grouping Primitives, in Shape, Contour and Grouping in Computer Vision, D.A. Forsyth, J.L. Mundy, R. Cipolla, and V. DiGes'u, Eds., Springer-Verlag, 2000, LNCS 1681.
 U. Gargi,R. Kasturi, and S. H. Strayer, Performance Characterization of Video-Shot-Change Detection Methods, IEEE Transaction on Circuits and Systems for Video Technology, Vol. 10, No. 1, February 2000.
 D. Grinberg,J. Lafferty and D. Sleator, A robust parsing algorithm for link grammars, Carnegie Mellon University Computer Science technical report CMU-CS-95-125, and Proc. Fourth International Workshop on Parsing Technologies, Prague, September 1995.
 A. Hampapur,R. Jain, and T. Weymouth, Production Model Based Digital Video Segmentation, Multimedia Tools and Applications, Vol. 1, 1995, pp. 9–46.
 A. Hauptmann,M. Witbrock,A. Rudnicky, and S. Reed, Speech for Multimedia Information Retrieval, User Interface Software and Technology Conference (UIST'95), Pittsburgh, PA, November, 1995
 A. Hauptmann and M. Smith, Text, Speech, and Vision for Video Segmentation: The Informedia Project, in AAAI Fall 1995 Symposium on Computational Models for Integrating Language and Vision.
 M. A. Hearst, TileBars: Visualization of Term Distribution Information in Full Text Information Access, Proceedings of the ACM CHI'95 Conference on Human Factors in Computing Systems, Denver, CO, May, 1995, 59–66.
 M. Hegarty and M.A. Just, Constructing mental models of machines from text and diagrams, Journal of Memory & Language, Dec. 1993, 32(6), pp. 717–742.
 E. Hjelm s and B. K. Low, Face Detection: A survey, Computer Vision and Image Understanding Vol. 83, No. 3, September 2001.
 Q. Iqbal and J. K. Aggarwal, Retrieval by Classification of Images Containing Large Manmade Objects Using Perceptual Grouping, Pattern Recognition Journal. Vol. 35, No. 7, July 2002, pp. 1463–1479.
 Q. Iqbal and J. K. Aggarwal, Perceptual Grouping for Image Retrieval and Classification, in 3rd IEEE Computer Society Workshop on Perceptual Organization in Computer Vision, July 8, 2001, Vancouver, Canada, pp. 19.1–19.4.
 A. Komlodi and L. Slaughter, Visual Video Browsing Interfaces Using Key Frames, in CHI '98 Summary, ACM, New York, 1998, pp. 337–338.
 F. Li,A. Gupta,E. Sanocki,L. He, and Y. Rui, Browsing Digital Video, in Proc. ACM CHI '00, ACM Press, 2000, pp. 169–176.
 A. Large,J. Beheshti,A. Breuleux, and A. Renaud, Multimedia and comprehension: The relationship among text, animation, and captions. Journal of American Society for Information Science, June 1995, 46(5), pp. 340 – 347.
 R. Lienhart et al., Video Abstracting, Communications of the ACM, 40, 12, 1997, pp. 54–62.
 R. Lienhart, Comparison of Automatic Shot Boundary Detection Algorithms, in Proc. Storage and Retrieval for Still Image and Video Databases VII 1999, SPIE 3656–29, January 1999.
 B. S. Manjunath,J.-R. Ohm,V. V. Vinod, and A. Yamada, Color and Texture Descriptors, IEEE Transactions Circuits and Systems for Video Technology, Special Issue on MPEG-7, 2001.
 M. Mauldin, Conceptual Information Retrieval: A Case Study in Adaptive Partial Parsing, Kluwer Academic Press, September 1991.
 M. Mauldin, Information Retrieval by Text Skimming, Ph.D. Thesis, Carnegie Mellon University, August, 1989 (also available as CMU Computer Science technical report CMU-CS-89-193).
 A. Merlino,D. Morey, and M. Maybury, Broadcast News Navigation using Story Segmentation, in Proc. ACM Multimedia '97, ACM Press, 197, pp. 381–391.
 D. Miller,R. Schwartz,R. Weischedel, and R. Stone, Named Entity Extraction for Broadcast News, in Proc. DARPA Broadcast News Workshop, Washington, DC., March 1999. http://www.nist.gov/speech/publications/darpa99/html/ie20/ie20.htm
 J. Nielsen and R.L. Mack, Eds., Usability Inspection Methods, John Wiley & Sons, New York, NY, 1994.
 T. Firmin and M.J. Chrzanowski, An Evaluation of Automatic Text Summarization Systems, in M.T. Maybury, Ed., Advances in Automatic Text Summarization, The MIT Press, Cambridge, MA, 1999.
 G.C. Nugent, Deaf students' learning from captioned instruction: The relationship between the visual and caption display, Journal of Special Education, 1983, 17(2), pp. 227–234.
 K.A. Olsen,R.R. Korfhage, et al., Visualization of a Document Collection: The VIBE System, Information Processing & Management, 29(1), 1993, pp. 69–81.
 C. Pryluck, When Is a Sign Not a Sign, Revised and reprinted, Journal of Dramatic Theory and Criticism, 6(Spring 1992)2, pp. 221–231,
 C. Pryluck, Meaning in Film/Video: Order, Time, and Ambiguity, Journal of Broadcasting, 26(Summer 1982)3, pp. 685–695 (with Charles Teddlie,Richard Sands).
 H. Rowley,T. Kanade, and S. Baluja, Neural Network-Based Face Detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, January 1998.
 T. Sato,T. Kanade,E. Hughes, and M. Smith, Video OCR for Digital News Archive, in Proc. Workshop on Content-Based Access of Image and Video Databases, IEEE, Los Alamitos, CA, 1998, pp. 52–60.
 M. Smith and T. Kanade, Video Skimming and Characterization through the Combination of Image and Language Understanding Techniques, in Computer Vision and Pattern Recognition Conference, San Juan, Puerto Rico, June 1997, pp. 775–781.
 M. Smith, Integration of Image, Audio, and Language Technology for Video Characterization and Variable-Rate Skimming, Ph.D. Thesis, Carnegie Mellon University. January 1998 (also available as text, Kluwer Academic Press, March 2003).
 M. Smith and T. Chen, Image and Video Indexing and Retrieval, in The Handbook of Image and Video Processing, A.C. Bovik, Ed., Academic Press, New York, 2000.
 H. Schneiderman and T. Kanade, Object Detection Using the Statistics of Parts, International Journal of Computer Vision, 2002.
 K-K. Sung and T. Poggio, Example-Based Learning for View-Based Human Face Detection, Pattern Analysis and Machine Intelligence, Vol. 20, No. 1, January 1998.
 Y. Taniguchi,A. Akutsu,Y. Tonomura, and H. Hamada, An Intuitive and Efficient Access Interface to Real-Time Incoming Video Based on Automatic Indexing, in Proc. ACM Multimedia Conf., ACM Press, New York, pp. 25–33.
 S. Uchihashi,J. Foote,A. Girgensohn, and J. Boreczky, Video Manga: Generating Semantically Meaningful Video Summaries, in Proc. ACM Multimedia, ACM Press, 1999, pp. 383–392.
 H. Wactlar,M. Christel,Y. Gong, and A. Hauptmann, Lessons Learned from the Creation and Deployment of a Terabyte Digital Video Library, IEEE Computer, 32(2), Feb. 1999, pp. 66–73.
 W. Xiong and J. C.-M. Lee, Efficient scene change detection and camera motion annotation for video classification, in Computer Vision and Image Understanding, 71, 1998, pp. 166–181.
 B.-L. Yeo and M.M. Yeung, Retrieving and Visualizing Video, Comm. ACM 40 (12), 1997, pp. 43–52.
 H.J. Zhang,C.Y. Low, and S.W Smoliar, Video parsing and browsing using compressed data, Multimedia Tools and Applications, 1, 1995, pp. 89–111.
 H.J. Zhang,S.W. Smoliar,J.H. Wu,C.Y. Low, and A. Kankanhalli, A Video Database System for Digital Libraries, Lecture Notes in Computer Science, 916, 1995, pp. 253–264.
 Technical Notes: Biometric Consortium, www.biometrics.org, 2002.