References


[1] J. R. Smith and S. F. Chang, Visualseek: A Fully Automated Content-based Image Query System, in Proceedings of ACM Multimedia, Boston, MA, Nov. 1996.

[2] D. Zhong and S. F. Chang, Spatio-Temporal Video Search Using the Object-based Video Representation, in Proceedings of IEEE International Conference on Image Processing, Santa Barbara, CA, Oct. 1997, Vol. 1, pp. 21–24.

[3] Y. Deng and B. S. Manjunath, Content Based Search of Video Using Color, Texture and Motion, in Proceedings of IEEE International Conference on Image Processing, Santa Barbara, CA, Oct. 1997, Vol. 2, pp. 534–537.

[4] H. Zhang,A. Wang, and Y. Altunbasak, Content-based Video Retrieval and Compression: A Unified Solution, in Proceedings of IEEE International Conference on Image Processing, Santa Barbara, CA, Oct. 1997, Vol. 1, pp. 13–16.

[5] M. M. Yeung and B. Liu, Efficient Matching and Clustering of Video Shots," in Proceedings of IEEE International Conference on Image Processing, Washington, D.C., Oct. 1995, Vol. 1, pp. 338–341.

[6] M. R. Naphade,M. M. Yeung, and B. L. Yeo, A Novel Scheme for Fast and Efficient Video Sequence Matching Using Compact Signatures, in Proceedings of SPIE Storage and Retrieval for Multimedia Databases, Jan. 2000, Vol. 3972, pp. 564–572.

[7] M. R. Naphade and T. S. Huang, Stochastic Modeling of Soundtrack for Efficient Segmentation and Indexing of Video, in Proceedings of SPIE Storage and Retrieval for Multimedia Databases, Jan. 2000, Vol. 3972, pp. 168–176.

[8] D. Ellis, Prediction-driven Computational Auditory Scene Analysis, Ph.D. Thesis, MIT, Cambridge, MA, 1996.

[9] M. Akutsu, A.Hamada, and Y. Tonomura, Video Handling with Music and Speech Detection, IEEE Multimedia, Vol. 5, No. 3, pp. 17–25, 1998.

[10] P. Jang and A. Hauptmann, Learning to Recognize Speech by Watching Television, IEEE Intelligent Systems Magazine, Vol. 14, No. 5, pp. 51–58, 1999.

[11] E. Wold,T. Blum,D. Keislar, and J. Wheaton, Content-based Classification Search and Retrieval of Audio, IEEE Multimedia, Vol. 3, No. 3, pp. 27–36, 1996.

[12] T. Zhang and C. Kuo, An Integrated Approach to Multimodal Media Content Analysis, in Proceedings of SPIE, IS&T Storage and Retrieval for Media Databases, Jan. 2000, Vol. 3972, pp. 506–517.

[13] M. Naphade,T. Kristjansson,B. Frey, and T. S. Huang, Probabilistic Multimedia Objects (Multijects): A Novel Approach to Indexing and Retrieval in Multimedia Systems, in Proceedings of IEEE International Conference on Image Processing, Chicago, IL, Oct. 1998, Vol. 3, pp. 536–540.

[14] M. R. Naphade,R. Wang, and T. S. Huang, Multimodal Pattern Matching for Audio-Visual Query and Retrieval, in Proceedings of SPIE, Storage and Retrieval for Media Databases, Jan. 2001, Vol. 4315, pp. 188–195.

[15] B. L. Yeo and B. Liu, Rapid Scene Change Detection on Compressed Video, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 5, No. 6, pp. 533–544, Dec. 1995.

[16] J. Meng,Y. Juan, and S. F. Chang, Scene Change Detection in a MPEG Compressed Video Sequence, in Proceedings of the SPIE Symposium, San Jose, CA, Feb. 1995, Vol. 2419, pp. 1–11.

[17] H. J. Zhang,C. Y. Low, and S. Smoliar, Video Parsing Using Compressed Data, in Proceedings of SPIE Conference on Image and Video Processing II, San Jose, CA, 1994, pp. 142–149.

[18] M. Naphade,R. Mehrotra,A. M. Ferman,J. Warnick,T. S. Huang, and A. M. Tekalp, A High Performance Shot Boundary Detection Algorithm Using Multiple Cues, in Proceedings of IEEE International Conference on Image Processing, Chicago, IL, Oct. 1998, Vol. 2, pp. 884–887.

[19] S. F. Chang,W. Chen, and H. Sundaram, Semantic Visual Templates - Linking Features to Semantics, in Proceedings of IEEE International Conference on Image Processing, Chicago, IL, Oct. 1998, Vol. 3, pp. 531-535.

[20] R. Qian,N. Hearing, and I. Sezan, A Computational Approach to Semantic Event Detection, in Proceedings of Computer Vision and Pattern Recognition, Fort Collins, CO, June 1999, Vol. 1, pp. 200–206.

[21] W. Wolf, Hidden Markov Model Parsing of Video Programs, in Proceedings of International Conference on Acoustics Signal and Speech Processing, 1997.

[22] M. Ferman and A. M. Tekalp, Probabilistic Analysis and Extraction of Video Content,' in Proceedings of IEEE International Conference on Image Processing, Kobe, Japan, Oct. 1999.

[23] N. Vasconcelos and A. Lippman, Bayesian Modeling of Video Editing and Structure: Semantic Features for Video Summarization and Browsing, in Proceedings of IEEE International Conference on Image Processing, Chicago, IL, Oct. 1998, Vol. 2, pp. 550–555.

[24] M. R. Naphade and T. S. Huang, A Probabilistic Framework for Semantic Video Indexing, Filtering and Retrieval, IEEE Transactions on Multimedia, Special issue on Multimedia over IP, Vol. 3, No. 1, pp. 141–151, March 2001.

[25] M. R. Naphade,I. Kozintsev, and T. S. Huang, Probabilistic Semantic Video Indexing, NIPS 2000, pp. 967–973.

[26] L. R. Rabiner, A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition, Proceedings IEEE, Vol. 77, No. 2, pp. 257–286, Feb. 1989.

[27] M. Brand,N. Oliver, and A. Pentland, Coupled Hidden Markov Models for Complex Action Recognition, in Proceedings of Computer Vision and Pattern Recognition, 1997, pp. 994–999.

[28] Z. Ghahramani and M. Jordan, Factorial Hidden Markov Models, Machine Learning, Vol. 29, pp. 245–273, 1997.

[29] V. Kobla,D. DeMenthon, and D. Doermann, Identifying Sports Video Using Replay, Text and Camera Motion Features, in Proceedings of SPIE Storage and Retrieval for Media Databases, Jan. 2000, Vol. 3972, pp. 332–343.

[30] M. R. Naphade, Video Analysis for Efficient Segmentation, Indexing and Retrieval, M.S. Thesis, University of Illinois at Urbana-Champaign, 1998.

[31] Z. Liu,Y. Wang, and T. Chen, Audio Feature Extraction and Analysis for Scene Segmentation and Classification, VLSI Signal Processing Systems for Signal, Image and Video Technology, Vol. 20, pp. 61–79, Oct. 1998.

[32] E. Scheirer and M. Slaney, Construction and Evaluation of a Robust Multifeatures Speech/Music Discriminator, in Proceedings of IEEE Intl. Conf. on Acoustic, Speech and Signal Processing, Munich, Germany, 1997, Vol. 2, pp. 1331–1334.

[33] J. Nam,A.E. Cetin, and A.H. Tewfik, Speaker Identification and Video Analysis for Hierarchical Video Shot Classification, in Proceedings of IEEE International Conference on Image Processing, Santa Barbara, CA, Oct. 1997, Vol. 2, pp. 550–555.

[34] Y. Wang,Z. Liu, and J. Huang, Multimedia Content Analysis Using Audio and Visual Information, IEEE Signal Processing Magazine, Vol. 17, No. 6, pp. 12–36, Nov. 2000.

[35] H. Wactlar,T. Kanade,M. Smith, and S. Stevens, Intelligent Access to Digital Video: The Informedia Project, IEEE Computer Digital Library Initiative Special Issue, No. 5, May 1996.

[36] Y. Nakamura and T. Kanade, Semantic Analysis for Video Contents Extraction - Spotting by Association in News Video, in Proceedings of ACM International Multimedia Conference, Nov. 1997.

[37] D. D. Saur,Y. P. Tan,S. R. Kulkarni, and P. J. Ramadge, Automated Analysis and Annotation of Basketball Video, in Proceedings of SPIE Symposium, 1997, Vol. 3022, pp. 176–187.

[38] B. Clarkson and A. Pentland, Unsupervised Clustering of Ambulatory Audio and Video, in Proceedings of IEEE International Conference on Accoustics Speech and Signal Processing, 1999.

[39] J. Rehg, K.Murphy, and P. Fieguth, Vision-based Speaker Detection Using Bayesian Networks, in Proceedings of Computer Vision and Pattern Recognition, Fort Collins, CO, June 1999, Vol. 2, pp. 110–116.

[40] M. R. Naphade and T. S. Huang, Recognizing High-level Audio-Visual Concepts Using Context," submitted to IEEE International Conference on Image Processing, 2001.

[41] M. R. Naphade,A. Garg, and T. S. Huang, Duration Dependent Input Output Markov Models for Audio-Visual Event Detection, submitted to IEEE International Conference on Multimedia and Expo, Tokyo, Japan, 2001.

[42] A. Garg,V. Pavlovic,M. Rehg, and T. S. Huang, Integrated Audio/Visual Speaker Detection Using Dynamic Bayesian Networks, in Proceedings of IEEE Conference on Automatic Face and Gesture Recognition, March 2000.

[43] T. Chen and R. Rao, Audio-Visual Integration in Multimodal Communication, IEEE Proceedings, Vol. 86, No. 5, pp. 837–852, 1998.

[44] R. O. Duda and P. E. Hart, Pattern Classification and Scene Analysis, Wiley Eastern, New York, 1973.

[45] H. V. Poor, An Introduction to Signal Detection and Estimation, Springer-Verlag, New York, 2 edition, 1999.

[46] L. E. Baum and T. Petrie, Statistical Inference for Probabilistic Functions of Finite State Markov Chains, Annals of Mathematical Statistics, Vol. 37, pp. 1559–1563, 1966.

[47] Y. Bengio and P. Frasconi, Input/Output HMMs for Sequence Processing, IEEE Transactions on Neural Networks, Vol. 7, No. 5, pp. 1231–1249, 1996.

[48] P. Ramesh and J. Wilpon, Modeling State Durations in Hidden Markov Models for Automatic Speech Recognition, in Proceedings of International Conference on Acoustics, Speech and Signal processing, Mar. 1992, Vol. 1, pp. 381–384.

[49] M. R. Naphade and T. S. Huang, Semantic Video Indexing Using a Probabilistic Framework, in Proceedings of IAPR International Conference on Pattern Recognition, Barcelona, Spain, Sep. 2000, Vol. 3, pp. 83–88.




Handbook of Video Databases. Design and Applications
Handbook of Video Databases: Design and Applications (Internet and Communications)
ISBN: 084937006X
EAN: 2147483647
Year: 2003
Pages: 393

flylib.com © 2008-2017.
If you may any questions please contact us: flylib@qtcs.net