Chinese–Vietnamese bilingual news event summarization based on distributed graph ranking (original) (raw)

Abstract

Multi-language news event summarization aims to quickly obtain the important information from lots of related news texts written in different languages automatically. Considering that the main expressed information for a same event is similar no matter what language it is presented, the paper proposes a novel unified approach to summarize important information from the monolingual and Chinese–Vietnamese bilingual news sets simultaneously. Firstly, analyzing the sentence dependence relationship, making rules to segment sentence into different grammatical parts, a bilingual dictionary is used to set up bilingual feature space. Secondly, Chinese–Vietnamese sentence graph model is calculated distributively. Finally, using the features that graph nodes can boost each other and fusing context information, the sentences are ranked based on whether they can represent the important information. The experimental result shows that our method is effective.

Access this article

Log in via an institution

Subscribe and save

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Notes

References

  1. Gambhir M, Gupta V (2017) Recent automatic text summarization techniques: a survey. Artif Intell Rev 47(1):1–66
    Article Google Scholar
  2. Chopra S, Auli M, Rush AM (2016) Abstractive sentence summarization with attentive recurrent neural networks. In: Proceedings of 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016. ACL, San Diego, CA, US, pp 93–98
  3. Hong K, Conroy JM, Favre B, Kuesza A, Lin H, Nenkova A (2014) A repository of state of the art and competitive baseline summaries for generic news summarization. In: Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014. ELRA, Reykjavik, Iceland, pp 1608–1616
  4. Baralis E, Cagliero L, Mahoto N, Fiori A (2013) GRAPHSUM: discovering correlations among multiple terms for graph-based summarization. Inf Sci 249(2013):96–109
    Article MathSciNet Google Scholar
  5. Luhn HP (1958) The automatic creation of literature abstracts. IBM J Res Dev 2(2):159–165
    Article MathSciNet Google Scholar
  6. Shen D, Sun J T, Li H, Yang Q, Chen Z (2007) Document summarization using conditional random fields. In: Proceedings of 20th International Joint Conference on Artificial Intelligence, IJCAI 2007. IJCAI, Morgan Kaufmann, Hyderabad, India, pp 2862–2867
  7. Li L, Zhou K, Xue GR, Zha H, Yu Y (2009) Enhancing diversity, coverage and balance for summarization through structure learning. In: Proceedings of the 18th International World Wide Web Conference, WWW 2009. ACM, Madrid, Spain, pp 71–80
  8. Wan X (2010) Towards a unified approach to simultaneous single-document and multi-document summarizations. In: Proceedings of the 23rd International Conference on Computational Linguistics, Coling 2010. ACM, Beijing, China, pp 1137–1145
  9. Erkan G, Radev DR (2004) Lexrank: graph-based lexical centrality as salience in text summarization. J Artif Intell Res 22(204):457–479
    Article Google Scholar
  10. Li Y, Li S (2014) Query-focused multi-document summarization: combining a novel topic model with graph-based semi-supervised learning. In: Proceedings of the International Conference on Computational Linguistics, Coling 2014. ACM, Dublin, Ireland, pp 1197–1207
  11. Xu JA, Liu JM, Araki K (2014) A hybrid topic model for multi-document summarization. IEICE Trans Inf Syst 98(5):1089–1094
    Article Google Scholar
  12. Wan X, Li H, Xiao J (2010) Cross-language document summarization based on machine translation quality prediction. In: Proceeding of the Annual Meeting of the Association for Computational Linguistics, ACL2010. ACL, Uppsala, Sweden, pp 917–926
  13. García JF, Carriegos MV (2019) On parallel computation of centrality measures of graphs. J Supercomput 75(3):1410–1428
    Article Google Scholar
  14. Nasir M, Muhammad K, Lloret J, Sangaiah AK, Sajjad M (2019) Fog computing enabled cost-effective distributed summarization of surveillance videos for smart cities. J Parallel Distrib Comput 126(2019):161–170
    Article Google Scholar
  15. Samuel J, Yuan X, Yuan X, Walton B (20100 Mining online full-text literature for novel protein interaction discovery. In: Proceeding of International Workshop on Data Mining for High Throughput Data from Genome-Wide Association Studies. IEEE International Conference on Bioinformatics and Biomedicine, Hong Kong, Dec 18–21
  16. Gu L, Han Y, Wang C, Chen W, Jiao J, Yuan X (2018) Module overlapping structure detection in PPI using an improved link similarity-based Markov clustering algorithm. Neural Comput Appl 31(5):1481–1490
    Article Google Scholar
  17. Li Y, McLean D, Bandar ZA, O’Shea JD, Crockett K (2006) Sentence similarity based on semantic nets and corpus statistics. IEEE Trans Knowl Data Eng 18(8):1138–1150
    Article Google Scholar
  18. Lin C Y, Hovy E (2003) Automatic evaluation of summaries using n-gram co-occurrence statistics. In: Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, NAACL 2003. NAACL, Edmonton, Canada, pp 150–157
  19. Mihalcea R, Tarau P (2005) A language independent algorithm for single and multiple document summarization. Unt Sch Works 2005:19–24
    Google Scholar

Download references

Acknowledgements

The work was supported by National Natural Science Foundation of China (Grant Nos. 61972186, 61732005, 61761026, 61672271 and 61762056), National Key Research and Development Plan (Grant No. 2018YFC0830101), Yunnan high-tech industry development project (Grant No. 201606), Natural Science Foundation of Yunnan Province (Grant No. 2018FB104), and Talent Fund for Kunming University of Science and Technology (Grant No. KKSY201703005).

Author information

Authors and Affiliations

  1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, 650500, China
    Shengxiang Gao, Zhengtao Yu, Yunlong Li, Yusen Wang & Yafei Zhang
  2. Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming, 650500, China
    Shengxiang Gao, Zhengtao Yu, Yunlong Li, Yusen Wang & Yafei Zhang

Authors

  1. Shengxiang Gao
  2. Zhengtao Yu
  3. Yunlong Li
  4. Yusen Wang
  5. Yafei Zhang

Corresponding author

Correspondence toZhengtao Yu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

About this article

Cite this article

Gao, S., Yu, Z., Li, Y. et al. Chinese–Vietnamese bilingual news event summarization based on distributed graph ranking.J Supercomput 76, 1034–1048 (2020). https://doi.org/10.1007/s11227-019-03006-1

Download citation

Keywords