A Multi-stage Approach for Anchor Shot Detection (original) (raw)

Abstract

In this paper we present a novel algorithm for anchor shot detection (ASD). ASD is a fundamental step for segmenting news video into stories that is among key issues for achieving efficient treatment of news-based digital libraries.

The proposed algorithm creates a set of audio/video templates of anchorperson shots in an unsupervised way, then classifies shots by comparing them to the templates. Audio similarity is evaluated by means of a new index and helps to achieve better performance than a pure video approach. The method has been tested on a wide database and compared with other state-of-the-art algorithms, demonstrating its effectiveness with respect to them.

Chapter PDF

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

References

  1. De Santo, M., Percannella, G., Sansone, C., Vento, M.: An Unsupervised Shot Classification System for News Video Story Detection. In: Abate, A.F., Nappi, M., Sebillo, M. (eds.) Multimedia Database and Image Communication, pp. 93–104. World Scientific Publ., Singapore (2005)
    Google Scholar
  2. Gao, X., Tang, X.: Unsupervised Video-Shot Segmentation and Model-Free Anchorperson Detection for News Video Story Parsing. IEEE Transactions on Circuits and Systems for Video Technology 12(9), 765–776 (2002)
    Article Google Scholar
  3. Gunsel, B., Ferman, A.M., Tekalp, A.M.: Video Indexing Through Integration of Syntactic and Semantic Features. In: Proc. of Workshop Applications of Computer Vision, Sarasota, FL, pp. 90–95 (1996)
    Google Scholar
  4. Swanberg, D., Shu, C.F., Jain, R.: Knowledge Guided Parsing in Video Databases. In: Proc. of SPIE Symposium on Electronic Imaging: Science and Technology, San Jose, CA, pp. 13–24 (1993)
    Google Scholar
  5. Smoliar, S.W., Zhang, H.J., Tao, S.Y., Gong, Y.: Automatic Parsing and Indexing of News Video. Multimedia Systems 2(6), 256–265 (1995)
    Article Google Scholar
  6. Hanjalic, A., Lagendijk, R.L., Biemond, J.: Semi-Automatic News Analysis, Indexing, and Classification System Based on Topics Preselection. In: Proc. of SPIE, Electronic Imaging: Storage and Retrieval of Image and Video Databases, San Jose (CA) (1999)
    Google Scholar
  7. Bertini, M., Del Bimbo, A., Pala, P.: Content-Based Indexing and Retrieval of TV News. Pattern Recognition Letters 22, 503–516 (2001)
    Article MATH Google Scholar
  8. Snoek, C.G.M., Worring, M.: Multimodal Video Indexing: A Review of the State-of-the-art. Multimedia Tools and Applications 25, 5–35 (2005)
    Article Google Scholar
  9. Eickeler, S., Muller, S.: Content-based video indexing of TV broadcast news using Hidden Markov Models. In: ICASSP 1999, pp. 2997–3000 (1999)
    Google Scholar
  10. Qi, W., Gu, L., Jiang, H., Chen, X.R., Zhang, H.J.: Integrating Visual, Audio and Text Analysis for News Video. In: 7th IEEE International Conference on Image Processing, Vancouver, British Columbia, Canada (2000)
    Google Scholar
  11. Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
    MATH Google Scholar
  12. Viola, P., Jones, M.: Rapid Object Detection Using a Boosted Cascade of Simple Features. In: Proc. of the IEEE CVPR Conference, vol. 1, pp. 511–518 (2001)
    Google Scholar
  13. Lee, H.Y., Lee, H.K., Ha, Y.H.: Spatial Color Descriptor for Image Retrieval and Video Segmentation. IEEE Transactions on Multimedia 5(3), 358–367 (2003)
    Article MathSciNet Google Scholar
  14. Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A Real-Time Text-Independent Speaker Identification System. In: 12th International Conference on Image Analysis and Processing, September 17-19, pp. 632–637. IEEE Computer Society Press, Mantova, Italy (2003)
    Chapter Google Scholar
  15. Wang, D., Lu, L., Zhang, H.-J.: Speech Segmentation Without Speech Recognition. In: ICASSP 2003, vol. I, pp. 468–471 (2003)
    Google Scholar
  16. Gargi, U., Kasturi, R., Strayer, S.H.: Performance Characterization of Video-Shot-Change Detection Methods. IEEE Trans. on Circuits and Systems for Video Technology 10(1), 1–13 (2000)
    Article Google Scholar
  17. De Santo, M., Percannella, G., Sansone, C., Vento, M.: A Comparison of Unsupervised Shot Classification Algorithms for News Video Segmentation. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A.C., de Ridder, D. (eds.) SSPR&SPR 2004. LNCS, vol. 3138, pp. 233–241. Springer, Heidelberg (2004)
    Chapter Google Scholar

Download references

Author information

Authors and Affiliations

  1. Dip. di Ingegneria dell’Informazione ed Ingegneria Elettrica, Università degli Studi di Salerno, Via Ponte Don Melillo, I, I-84084, Fisciano (SA), Italy
    L. D’Anna, G. Marrazzo, G. Percannella & M. Vento
  2. Dipartimento di Informatica e Sistemistica, Università degli Studi di Napoli “Federico II”, Via Claudio 21, I-80125, Napoli, Italy
    C. Sansone

Authors

  1. L. D’Anna
  2. G. Marrazzo
  3. G. Percannella
  4. C. Sansone
  5. M. Vento

Editor information

Editors and Affiliations

  1. Hong Kong University of Science and Technology,
    Dit-Yan Yeung
  2. Department of Computer Science, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
    James T. Kwok
  3. Instituto de Telecomunicações, Instituto Superior Técnico, Lisbon, Portugal
    Ana Fred
  4. Department of Electrical and Electronic Engineering, University of Cagliari, Piazza d’Armi, 09123, Cagliari, Italy
    Fabio Roli
  5. Faculty of Electrical Engineering, Mathematics and Computer Science, Information and Communication Theory Group, Delft University of Technology, Delft, The Netherlands
    Dick de Ridder

Rights and permissions

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

D’Anna, L., Marrazzo, G., Percannella, G., Sansone, C., Vento, M. (2006). A Multi-stage Approach for Anchor Shot Detection. In: Yeung, DY., Kwok, J.T., Fred, A., Roli, F., de Ridder, D. (eds) Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2006. Lecture Notes in Computer Science, vol 4109. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11815921\_85

Download citation

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Publish with us