Y. Rui and T. S. Huang, Image Retrieval: Current Techniques, Promising, Directions and Open Issues, Journal of Visual Communication and Image Representation, Vol. 10, No. 4, April 1999.
 A. Nagasaka and Y. Tanaka, Automatic Video Indexing and Full-Video Search for Object Appearance, in Proc. of IFIP 2nd Working Conf. On Visual Database Systems, pp. 113–127, 1992.
 H. J. Zhang,J. H. Wu,D. Zhong, and S. W. Smoliar, Video Parsing, Retrieval and Browsing: An Integrated and Content-Based Solution, Pattern Recognition, pp. 643–658, Vol. 30, No. 4, 1997.
 Y. Deng and B. S. Manjunath, Content-Based Search of Video Using Color, Texture and Motion, in Proc. of IEEE Intl. Conf. On Image Processing, pp. 534–537, Vol. 2, 1997.
 P. Aigrain,H. J. Zhang, and D. Petkovic, Content-Based Representation and Retrieval of Visual Media: A State-of-the-Art Review, International Journal of Multimedia Tools Applications, pp. 179–202, Vol. 3, November 1996.
 J. S. Boresczky and L. A. Rowe, A Comparison of Video Shot Boundary Detection Techniques, in Storage & Retrieval for Image and Video Databases IV, Proc. SPIE 2670, pp. 170–179, 1996.
 W. Wolf, Key frame selection by motion analysis, in IEEE ICASSP, pp. 1228–1231, Vol. 2, 1996.
 M. Flickner et al., Query by Image and Video Content, IEEE Computer, pp. 23–32, September 1995.
 M. Yeung,B. L. Yeo, and B. Liu, Extracting Story Units from Long Programs for Video Browsing and Navigation, in Proc. IEEE Conf. on Multimedia Computing and Systems, pp. 296–305, 1996.
 P. Aigrain,P. Joly, and V. Longueville, Medium Knowledge Based Macro-segmentation of Video into Sequences, in Intelligent Multimedia Information Retrieval, M. T. Maybury, Ed., pp. 159–173. AAAI/MIT Press, 1997.
 A. G. Hauptmann and M. A. Smith, Text, Speech, and Vision for Video Segmentation: The Informedia Project, in AAAI Fall Symposium, Computational Models for Integrating Language and Vision, Boston, 1995.
 R. Lienhart,S. Pfeiffer, and W. Effelsberg, Scene Determination Based on Video and Audio Features, Technical report, University of Mannheim, November 1998.
 J. S. Boreczky and L.D. Wilcox, A Hidden Markov Model Framework for Video Segmentation Using Audio and Image Features, in IEEE ICASSP, pp. 3741–3744, Vol. 6, Seattle, 1998.
 Y. Rui,T. S. Hunag, and S. Mehrotra, Constructing Table-of-Content for Videos, ACM Multimedia Systems Journal, Special Issue Multimedia Systems on Video Libraries, pp. 359–368, Vol. 7, No. 5, Sep. 1999.
 D. Zhong,H. Zhang, and S.-F. Chang, Clustering Methods for Video Browsing and Annotation, in SPIE Conference on Storage and Retrieval for Image and Video Databases, pp. 239–246, Vol. 2670, 1996.
 U. Gargi,S. Antani, and R. Kasturi, Indexing Text Events in Digital Video Databases, in Proc. 14th Int'l Conf. Pattern Recognition, pp. 916–918, 1998.
 J.C. Shim,C. Dorai, and R. Bolle, Automatic Text Extraction from Video for Content-Based Annotation and Retrieval, in Proc. 14th Int'l Conf. Pattern Recognition, pp. 618–620, 1998.
 J. D. Courtney, Automatic Video Indexing via Object Motion Analysis, Pattern Recognition, pp. 607–625, Vol. 30, No. 4, 1997.
 A. M. Ferman,A. M. Tekalp, and R. Mehrotra, Effective Content Representation for Video, in Proc. of ICIP'98, pp. 521–525, Vol. 3, 1998.
 M. Gelgon and P. Bouthemy, Determining a Structured Spatio-Temporal Representation of Video Content for Efficient Visualization and Indexing, in Proc. 5th Eur. Conf. on Computer Vision, ECCV'98, Freiburg, June 1998.
 Y. F. Ma and H. J. Zhang, Detecting Motion Object by Spatio-Temporal Entropy, in IEEE Int. Conf. on Multimedia and Expo, Tokyo, Japan, August 22–25, 2001.
 R. C. Nelson and R. Polana, Qualitative Recognition of Motion Using Temporal Texture, in Proc. DARPA Image Understanding Workshop, San Diego, CA, pp.555–559, Jan. 1992.
 K. Otsuka,T. Horikoshi,S. Suzuki, and M. Fujii, Feature Extraction of Temporal Texture Based on Spatio-Temporal Motion Trajectory, in Proc. 14th Int. Conf. On Pattern Recognition, ICPR'98, pp. 1047–1051, Aug. 1998.
 P. Bouthemy and R. Fablet, Motion Characterization from Temporal Concurrences of Local Motion-Based Measures for Video Indexing, Int. Conf. on Pattern Recognition, ICPR'98, pp. 905–908, Vol. 1, Australia, Aug. 1998.
 M. Szummer and R. W. Picard, Temporal Texture Modeling, in IEEE ICIP'96, pp. 823–826, Sep. 1996.
 D. Zhong and S.F. Chang, Video Object Model and Segmentation for Content-Based Video Indexing, in IEEE Int. Symp. on Circuits and Systems, Hong Kong, June 1997.
 Y. Deng and B. S. Manjunath,Netra-V: Toward an Object-Based Video Representation, IEEE Transactions CSVT, pp.616–627, Vol. 8, No. 3, 1998.
 S. F. Chang,W. Chen, and H. Sundaram, Semantic Visual Templates: Linking Visual Features to Semantics, in IEEE ICIP, 1998.
 S. Santini and R. Jain, Similarity Measures, IEEE Transactions of Pattern Analysis and Machine Intelligence, pp. 871–883, Vol. 21, No. 9, Sep. 1999.
 A. Tversky, Features of Similarity, Psychological Review, pp.327–352, Vol. 84, No. 4, July 1977.
 M. Flicker,H. Sawhney,W. Niblack,J. Ashley,Q. Huang,B. Dom,M. Gorkani,J. Hafner,D. Lee,D. Petkovic,D. Steele, and P. Yanker, Query by Image and Video Content: The QBIC System, IEEE Computer, pp. 23–32, Vol. 28, No. 9, 1995.
 S. Basu,M. Naphade and J. R. Smith, A Statical Modeling Approach to Content Based Retrieval, in IEEE ICASSP, 2002.
 C. Zhang and T. Chen, An Active Learning Framework for Content Based Information Retrieval, IEEE Transactions on Multimedia, Special Issue on Multimedia Database, pp. 260–268, Vol. 4, No. 2, June 2002.
 Y. Park, Efficient Tools for Power Annotation of Visual Contents: A Lexicographical Approach, ACM Multimedia, pp. 426–428, 2000.
 M. Slaney, Semantic-Audio Retrieval, in IEEE ICASSP, 2002.
 M. R. Naphade,I. Kozintsev,T. S. Huang, and K. Ramchandran, A Factor Graph Framework for Semantic Indexing and Retrieval in Video, in Proc. IEEE Workshop on Content-based Access of Image and Video Libraries, 2000.
 C. S. Lee,W.-Y. Ma, and H. J. Zhang, Information Embedding Based on User's Relevance Feedback for Image Retrieval, Invited paper, in SPIE Int. Conf. Multimedia Storage and Archiving Systems IV, Boston, pp. 19–22, Sep. 1999.
 M. R. Naphade,C. Y. Lin,J. R. Smith,B. Tseng, and S. Basu, Learning to Annotate Video Databases, in SPIE Conference on Storage and Retrieval on Media databases, 2002.
 W. Y. Ma and B. S. Manjunath, Texture Features and Learning Similarity, in IEEE Proceedings CVPR '96, pp. 425–430, 1996.
 D. McG. Squire, Learning a Similarity-Based Distance Measure for Image Database Organization from Human Partitionings of an Image Set, in IEEE Workshop on Applications of Computer Vision (WACV'98), pp. 88–93, 1998.
 A. P. Dempster,N. M. Laird, and D. B. Rubin, Maximum-likelihood from Incomplete Data via the EM Algorithm, Journal of Royal Statistical Society, Ser. B, pp. 1–38, Vol. 39, No. 1, 1977.
 G. McLachlan and T. Krishnan, The EM Algorithm and Extensions, Wiley Series in Probability and Statistics, John Wiley & Sons, New York, 1997.
 R. O. Duda,P. E. Hart, and D. G. Stork, Pattern Classification (2nd Edition), John Wiley & Sons, New York, 2000.
 C. Burges, A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery, pp. 121–167, Vol. 2, No. 2, 1998.
 N. Cristianini and J. Shawe-Taylor, An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods, Cambridge University Press, 2000.
 E. Chang and B. Li, On Learning Perceptual Distance Function for Image Retrieval, in IEEE ICASSP, 2002.
 D. Harman, Relevance Feedback Revisited, in Proc. of the Fifteenth Annual International ACM SIGIR Conf. on Research and Development in Information Retrieval, pp. 1–10, 1992.
 G. Salton and C. Buckley, Improving Retrieval Performance by Relevance Feedback, Journal of the American Society for Information Science, pp. 288–297, Vol. 41, No. 4, 1990.
 Y. Rui,T. S. Huang,M. Ortega, and S. Mehrotra, Relevance Feedback: A Power Tool for Interactive Content-based Image Retrieval, IEEE Transactions on Circuits and Systems for Video Technology, pp. 644–655, Vol. 8, No. 5, Sep. 1998.
 Y. Ishikawa,R. Subramanya, and C. Faloutsos, Mindreader: Query Database Through Multiple Examples, in Proc. of the 24th VLDB Conf., New York, 1998.
 Y. Rui and T. S. Huang, Optimizing Learning in Image Retrieval, in Proc. of IEEE int. Conf. On Computer Vision and Pattern Recognition, Jun. 2000.
 Q. Tian,P. Hong,T. S. Huang, Update Relevant Image Weights for Content-based Image Retrieval Using Support Vector Machines, in Proc. Multimedia and Expo IEEE Int. Conf., pp. 1199–1202, Vol. 2, 2000.
 S. Sull,J. Oh,S. Oh,S. M.-H. Song, and S. W. Lee, Relevance Graph-based Image Retrieval, in Proc. Multimedia and Expo IEEE Int. Conf., pp. 713–716, Vol. 2, 2000.
 N. D. Doulamis,A. D. Doulamis, and S. D. Kollias, Non-linear Relevance Feedback: Improving the Performance of Content-based Retrieval Systems, in Proc. Multimedia and Expo IEEE Int. Conf., pp. 331–334, Vol. 1, 2000.
 T. P. Minka and R. W. Picard, Interactive Learning Using a 'Society of Models', M.I.T Media Laboratory Perceptual Computing Section, Technical Report, No. 349.
 E. Chang and B. Li, MEGA - The Maximizing Expected Generalization Algorithm for Learning Complex Query Concepts, UCSB Technical Report, August 2001.
 S. Tong and E. Chang, Support Vector Machine Active Learning for Image Retrieval, ACM Multimedia, 2001.
 D. A. Cohn,Z. Ghahramani, and M. I. Jordan, Active Learning with Statistical Models, Journal of Artificial Intelligence Research, pp. 129–145, 4, 1996.
 A. Krogh and J. Vedelsby, Neural Network Ensembles, Cross Validation, and Active Learning, in Advances in Neural Information Processing Systems G. Tesauro, D. Touretzky, and T. Leen, Eds., Vol. 7, MIT Press, Cambridge, MA, 1995.
 D. D. Lewis and W. A. Gale, A Sequential Algorithm for Training Text Classifiers, in ACM-SIGIR 94, pp. 3–12, Springer-Verlag, London, 1994.
 M. Kearns,M. Li, and L. Valiant, Learning Boolean Formulae, Journal of ACM, pp. 1298–1328, Vol. 41, No. 6, 1994.