A Multi-stage Approach for Anchor Shot Detection (original) (raw)

Abstract

In this paper we present a novel algorithm for anchor shot detection (ASD). ASD is a fundamental step for segmenting news video into stories that is among key issues for achieving efficient treatment of news-based digital libraries.

The proposed algorithm creates a set of audio/video templates of anchorperson shots in an unsupervised way, then classifies shots by comparing them to the templates. Audio similarity is evaluated by means of a new index and helps to achieve better performance than a pure video approach. The method has been tested on a wide database and compared with other state-of-the-art algorithms, demonstrating its effectiveness with respect to them.

Chapter PDF

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

References

De Santo, M., Percannella, G., Sansone, C., Vento, M.: An Unsupervised Shot Classification System for News Video Story Detection. In: Abate, A.F., Nappi, M., Sebillo, M. (eds.) Multimedia Database and Image Communication, pp. 93–104. World Scientific Publ., Singapore (2005)
Google Scholar
Gao, X., Tang, X.: Unsupervised Video-Shot Segmentation and Model-Free Anchorperson Detection for News Video Story Parsing. IEEE Transactions on Circuits and Systems for Video Technology 12(9), 765–776 (2002)
Article Google Scholar
Gunsel, B., Ferman, A.M., Tekalp, A.M.: Video Indexing Through Integration of Syntactic and Semantic Features. In: Proc. of Workshop Applications of Computer Vision, Sarasota, FL, pp. 90–95 (1996)
Google Scholar
Swanberg, D., Shu, C.F., Jain, R.: Knowledge Guided Parsing in Video Databases. In: Proc. of SPIE Symposium on Electronic Imaging: Science and Technology, San Jose, CA, pp. 13–24 (1993)
Google Scholar
Smoliar, S.W., Zhang, H.J., Tao, S.Y., Gong, Y.: Automatic Parsing and Indexing of News Video. Multimedia Systems 2(6), 256–265 (1995)
Article Google Scholar
Hanjalic, A., Lagendijk, R.L., Biemond, J.: Semi-Automatic News Analysis, Indexing, and Classification System Based on Topics Preselection. In: Proc. of SPIE, Electronic Imaging: Storage and Retrieval of Image and Video Databases, San Jose (CA) (1999)
Google Scholar
Bertini, M., Del Bimbo, A., Pala, P.: Content-Based Indexing and Retrieval of TV News. Pattern Recognition Letters 22, 503–516 (2001)
Article MATH Google Scholar
Snoek, C.G.M., Worring, M.: Multimodal Video Indexing: A Review of the State-of-the-art. Multimedia Tools and Applications 25, 5–35 (2005)
Article Google Scholar
Eickeler, S., Muller, S.: Content-based video indexing of TV broadcast news using Hidden Markov Models. In: ICASSP 1999, pp. 2997–3000 (1999)
Google Scholar
Qi, W., Gu, L., Jiang, H., Chen, X.R., Zhang, H.J.: Integrating Visual, Audio and Text Analysis for News Video. In: 7th IEEE International Conference on Image Processing, Vancouver, British Columbia, Canada (2000)
Google Scholar
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
MATH Google Scholar
Viola, P., Jones, M.: Rapid Object Detection Using a Boosted Cascade of Simple Features. In: Proc. of the IEEE CVPR Conference, vol. 1, pp. 511–518 (2001)
Google Scholar
Lee, H.Y., Lee, H.K., Ha, Y.H.: Spatial Color Descriptor for Image Retrieval and Video Segmentation. IEEE Transactions on Multimedia 5(3), 358–367 (2003)
Article MathSciNet Google Scholar
Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A Real-Time Text-Independent Speaker Identification System. In: 12th International Conference on Image Analysis and Processing, September 17-19, pp. 632–637. IEEE Computer Society Press, Mantova, Italy (2003)
Chapter Google Scholar
Wang, D., Lu, L., Zhang, H.-J.: Speech Segmentation Without Speech Recognition. In: ICASSP 2003, vol. I, pp. 468–471 (2003)
Google Scholar
Gargi, U., Kasturi, R., Strayer, S.H.: Performance Characterization of Video-Shot-Change Detection Methods. IEEE Trans. on Circuits and Systems for Video Technology 10(1), 1–13 (2000)
Article Google Scholar
De Santo, M., Percannella, G., Sansone, C., Vento, M.: A Comparison of Unsupervised Shot Classification Algorithms for News Video Segmentation. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A.C., de Ridder, D. (eds.) SSPR&SPR 2004. LNCS, vol. 3138, pp. 233–241. Springer, Heidelberg (2004)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Dip. di Ingegneria dell’Informazione ed Ingegneria Elettrica, Università degli Studi di Salerno, Via Ponte Don Melillo, I, I-84084, Fisciano (SA), Italy
L. D’Anna, G. Marrazzo, G. Percannella & M. Vento
Dipartimento di Informatica e Sistemistica, Università degli Studi di Napoli “Federico II”, Via Claudio 21, I-80125, Napoli, Italy
C. Sansone

Authors

L. D’Anna
G. Marrazzo
G. Percannella
C. Sansone
M. Vento

Editor information

Editors and Affiliations

Hong Kong University of Science and Technology,
Dit-Yan Yeung
Department of Computer Science, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
James T. Kwok
Instituto de Telecomunicações, Instituto Superior Técnico, Lisbon, Portugal
Ana Fred
Department of Electrical and Electronic Engineering, University of Cagliari, Piazza d’Armi, 09123, Cagliari, Italy
Fabio Roli
Faculty of Electrical Engineering, Mathematics and Computer Science, Information and Communication Theory Group, Delft University of Technology, Delft, The Netherlands
Dick de Ridder

Rights and permissions

Copyright information

About this paper

Cite this paper

D’Anna, L., Marrazzo, G., Percannella, G., Sansone, C., Vento, M. (2006). A Multi-stage Approach for Anchor Shot Detection. In: Yeung, DY., Kwok, J.T., Fred, A., Roli, F., de Ridder, D. (eds) Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2006. Lecture Notes in Computer Science, vol 4109. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11815921\_85

Download citation

.RIS
.ENW
.BIB
DOI: https://doi.org/10.1007/11815921\_85
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37236-3
Online ISBN: 978-3-540-37241-7
eBook Packages: Computer Science Computer Science (R0)Springer Nature Proceedings Computer Science

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.