A Multi-stage Approach for Anchor Shot Detection (original) (raw)
Abstract
In this paper we present a novel algorithm for anchor shot detection (ASD). ASD is a fundamental step for segmenting news video into stories that is among key issues for achieving efficient treatment of news-based digital libraries.
The proposed algorithm creates a set of audio/video templates of anchorperson shots in an unsupervised way, then classifies shots by comparing them to the templates. Audio similarity is evaluated by means of a new index and helps to achieve better performance than a pure video approach. The method has been tested on a wide database and compared with other state-of-the-art algorithms, demonstrating its effectiveness with respect to them.
Chapter PDF
We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.
References
- De Santo, M., Percannella, G., Sansone, C., Vento, M.: An Unsupervised Shot Classification System for News Video Story Detection. In: Abate, A.F., Nappi, M., Sebillo, M. (eds.) Multimedia Database and Image Communication, pp. 93–104. World Scientific Publ., Singapore (2005)
Google Scholar - Gao, X., Tang, X.: Unsupervised Video-Shot Segmentation and Model-Free Anchorperson Detection for News Video Story Parsing. IEEE Transactions on Circuits and Systems for Video Technology 12(9), 765–776 (2002)
Article Google Scholar - Gunsel, B., Ferman, A.M., Tekalp, A.M.: Video Indexing Through Integration of Syntactic and Semantic Features. In: Proc. of Workshop Applications of Computer Vision, Sarasota, FL, pp. 90–95 (1996)
Google Scholar - Swanberg, D., Shu, C.F., Jain, R.: Knowledge Guided Parsing in Video Databases. In: Proc. of SPIE Symposium on Electronic Imaging: Science and Technology, San Jose, CA, pp. 13–24 (1993)
Google Scholar - Smoliar, S.W., Zhang, H.J., Tao, S.Y., Gong, Y.: Automatic Parsing and Indexing of News Video. Multimedia Systems 2(6), 256–265 (1995)
Article Google Scholar - Hanjalic, A., Lagendijk, R.L., Biemond, J.: Semi-Automatic News Analysis, Indexing, and Classification System Based on Topics Preselection. In: Proc. of SPIE, Electronic Imaging: Storage and Retrieval of Image and Video Databases, San Jose (CA) (1999)
Google Scholar - Bertini, M., Del Bimbo, A., Pala, P.: Content-Based Indexing and Retrieval of TV News. Pattern Recognition Letters 22, 503–516 (2001)
Article MATH Google Scholar - Snoek, C.G.M., Worring, M.: Multimodal Video Indexing: A Review of the State-of-the-art. Multimedia Tools and Applications 25, 5–35 (2005)
Article Google Scholar - Eickeler, S., Muller, S.: Content-based video indexing of TV broadcast news using Hidden Markov Models. In: ICASSP 1999, pp. 2997–3000 (1999)
Google Scholar - Qi, W., Gu, L., Jiang, H., Chen, X.R., Zhang, H.J.: Integrating Visual, Audio and Text Analysis for News Video. In: 7th IEEE International Conference on Image Processing, Vancouver, British Columbia, Canada (2000)
Google Scholar - Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
MATH Google Scholar - Viola, P., Jones, M.: Rapid Object Detection Using a Boosted Cascade of Simple Features. In: Proc. of the IEEE CVPR Conference, vol. 1, pp. 511–518 (2001)
Google Scholar - Lee, H.Y., Lee, H.K., Ha, Y.H.: Spatial Color Descriptor for Image Retrieval and Video Segmentation. IEEE Transactions on Multimedia 5(3), 358–367 (2003)
Article MathSciNet Google Scholar - Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A Real-Time Text-Independent Speaker Identification System. In: 12th International Conference on Image Analysis and Processing, September 17-19, pp. 632–637. IEEE Computer Society Press, Mantova, Italy (2003)
Chapter Google Scholar - Wang, D., Lu, L., Zhang, H.-J.: Speech Segmentation Without Speech Recognition. In: ICASSP 2003, vol. I, pp. 468–471 (2003)
Google Scholar - Gargi, U., Kasturi, R., Strayer, S.H.: Performance Characterization of Video-Shot-Change Detection Methods. IEEE Trans. on Circuits and Systems for Video Technology 10(1), 1–13 (2000)
Article Google Scholar - De Santo, M., Percannella, G., Sansone, C., Vento, M.: A Comparison of Unsupervised Shot Classification Algorithms for News Video Segmentation. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A.C., de Ridder, D. (eds.) SSPR&SPR 2004. LNCS, vol. 3138, pp. 233–241. Springer, Heidelberg (2004)
Chapter Google Scholar
Author information
Authors and Affiliations
- Dip. di Ingegneria dell’Informazione ed Ingegneria Elettrica, Università degli Studi di Salerno, Via Ponte Don Melillo, I, I-84084, Fisciano (SA), Italy
L. D’Anna, G. Marrazzo, G. Percannella & M. Vento - Dipartimento di Informatica e Sistemistica, Università degli Studi di Napoli “Federico II”, Via Claudio 21, I-80125, Napoli, Italy
C. Sansone
Authors
- L. D’Anna
- G. Marrazzo
- G. Percannella
- C. Sansone
- M. Vento
Editor information
Editors and Affiliations
- Hong Kong University of Science and Technology,
Dit-Yan Yeung - Department of Computer Science, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
James T. Kwok - Instituto de Telecomunicações, Instituto Superior Técnico, Lisbon, Portugal
Ana Fred - Department of Electrical and Electronic Engineering, University of Cagliari, Piazza d’Armi, 09123, Cagliari, Italy
Fabio Roli - Faculty of Electrical Engineering, Mathematics and Computer Science, Information and Communication Theory Group, Delft University of Technology, Delft, The Netherlands
Dick de Ridder
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
D’Anna, L., Marrazzo, G., Percannella, G., Sansone, C., Vento, M. (2006). A Multi-stage Approach for Anchor Shot Detection. In: Yeung, DY., Kwok, J.T., Fred, A., Roli, F., de Ridder, D. (eds) Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2006. Lecture Notes in Computer Science, vol 4109. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11815921\_85
Download citation
- .RIS
- .ENW
- .BIB
- DOI: https://doi.org/10.1007/11815921\_85
- Publisher Name: Springer, Berlin, Heidelberg
- Print ISBN: 978-3-540-37236-3
- Online ISBN: 978-3-540-37241-7
- eBook Packages: Computer ScienceComputer Science (R0)Springer Nature Proceedings Computer Science
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.