Bottom-up broadcast neural network for music genre classification (original) (raw)

References

  1. Bertin-Mahieux T, Ellis DP, Whitman B, Lamere P (2011) The million song dataset. In: Ismir, vol 2, pp 10
  2. Cano P, Gómez Gutiérrez E, Gouyon F, Herrera Boyer P, Koppenberger M, Ong BS, Serra X, Streich S, Wack N (2006) Ismir 2004 audio description contest
  3. Choi K, Fazekas G, Sandler M, Cho K (2017) Transfer learning for music classification and regression tasks. arXiv:1703.09179
  4. Chollet F et al (2015) Keras
  5. Costa YM, Oliveira LS, Silla CN Jr (2017) An evaluation of convolutional neural networks for music classification using spectrograms. Applied Soft Computing 52:28–38
    Article Google Scholar
  6. Dai J, Liu W, Ni C, Dong L, Yang H (2015) “multilingual” deep neural network for music genre classification. In: Sixteenth annual conference of the international speech communication association
  7. Freitag M, Amiriparian S, Pugachevskiy S, Cummins N (2017) Schuller, B.: audeep: Unsupervised learning of representations from audio with deep recurrent neural networks. J Mach Learn Res 18(1):6340–6344
    MATH Google Scholar
  8. Fu Z, Lu G, Ting KM, Zhang D (2011) A survey of audio-based music classification and annotation. IEEE Trans Multimedia 13(2):303–319
    Article Google Scholar
  9. Hafemann LG, Oliveira LS, Cavalin P (2014) Forest species recognition using deep convolutional neural networks. In: 2014 22nd international conference on Pattern recognition (ICPR), IEEE, pp 1103–1107
  10. Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. ACM SIGKDD Explorations Newsletter 11(1):10–18
    Article Google Scholar
  11. Hamel P, Eck D (2010) Learning features from music audio with deep belief networks. In: ISMIR, vol 10, Utrecht, pp 339–344
  12. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
  13. Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: CVPR, vol 1, pp 3
  14. Ioffe S, Szegedy C (2015) Batch normalization:, Accelerating deep network training by reducing internal covariate shift. arXiv:1502.03167
  15. Jakubik J (2017) Evaluation of gated recurrent neural networks in music classification tasks. In: International conference on information systems architecture and technology, Springer, pp 27–37
  16. Jeong Y, Choi K, Jeong H (2017) Dlr:, Toward a deep learned rhythmic representation for music content analysis. arXiv:1712.05119
  17. Karunakaran N, Arya A (2018) A scalable hybrid classifier for music genre classification using machine learning concepts and spark. In: 2018 International conference on intelligent autonomous systems (ICoIAS), IEEE, pp 128–135
  18. Kereliuk C, Sturm BL, Larsen J (2015) Deep learning and music adversaries. IEEE Trans Multimedia 17(11):2059–2071
    Article Google Scholar
  19. Kingma DP, Ba J (2014) Adam:, A method for stochastic optimization. arXiv:1412.6980
  20. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
  21. Lee J, Nam J (2017) Multi-level and multi-scale feature aggregation using pretrained convolutional neural networks for music auto-tagging. IEEE Signal Process Lett 24(8):1208–1212
    Article Google Scholar
  22. Lee H, Pham P, Largman Y, Ng AY (2009) Unsupervised feature learning for audio classification using convolutional deep belief networks. In: Advances in neural information processing systems, pp1096–1104
  23. Li TL, Chan AB, Chun A (2010) Automatic musical pattern feature extraction using convolutional neural network. In: Proc Int Conf Data mining and applications
  24. Lin M, Chen Q, Yan S (2013) Network in network. arXiv:1312.4400
  25. Lykartsis A, Lerch A (2015) Beat histogram features for rhythm-based musical genre classification using multiple novelty functions. In: Proceedings of the 16th ISMIR Conference, pp 434–440
  26. Marchand U, Peeters G (2016) The extended ballroom dataset
  27. Marchand U, Peeters G (2014) The modulation scale spectrum and its application to rhythm-content analysis. In: DAFX (Digital audio effects)
  28. Marchand U, Peeters G (2016) Scale and shift invariant time/frequency representation using auditory statistics: Application to rhythm description. In: 2016 IEEE 26th international workshop on Machine learning for signal processing (MLSP), IEEE, pp 1–6
  29. McFee B, Raffel C, Liang D, Ellis DP, McVicar M, Battenberg E (2015) Nieto, O.: librosa: Audio and music signal analysis in python. In: Proceedings of the 14th python in science conference, pp 18–25
  30. Medhat F, Chesmore D, Robinson J (2017) Automatic classification of music genre using masked conditional neural networks. In: 2017 IEEE international conference on Data mining (ICDM), IEEE, pp 979–984
  31. Nanni L, Costa YM, Lucio DR, Silla CN Jr, Brahnam S (2017) Combining visual and acoustic features for audio classification tasks. Pattern Recogn Lett 88:49–56
    Article Google Scholar
  32. Nguyen QH, Do TT, Chu TB, Trinh LV, Nguyen DH, Phan CV, Phan TA, Doan DV, Pham HN, Nguyen BP et al (2019) Music genre classification using residual attention network. In: 2019 International conference on system science and engineering (ICSSE), IEEE, pp 115–119
  33. Pons J, Serra X (2017) Designing efficient architectures for modeling temporal features with convolutional neural networks. In: 2017 IEEE international conference on Acoustics, speech and signal processing (ICASSP), IEEE, pp 2472–2476
  34. Pons J, Serra X (2018) Randomly weighted cnns for (music) audio classification. arXiv:1805.00237
  35. Pons J, Lidy T, Serra X (2016) Experimenting with musically motivated convolutional neural networks. In: 2016 14th international workshop on Content-based multimedia indexing (CBMI), IEEE, pp 1–6
  36. Salamon J, Bello JP (2017) Deep convolutional neural networks and data augmentation for environmental sound classification. IEEE Signal Processing Letters 24(3):279–283
    Article Google Scholar
  37. Senac C, Pellegrini T, Mouret F, Pinquier J (2017) Music feature maps with convolutional neural networks for music genre classification. In: Proceedings of the 15th international workshop on content-based multimedia indexing, ACM, pp 19
  38. Sigtia S, Dixon S (2014) Improved music feature learning with deep neural networks. In: 2014 IEEE international conference on Acoustics, speech and signal processing (ICASSP), IEEE, pp 6959–6963
  39. Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9
  40. Tzanetakis G, Cook P (2002) Musical genre classification of audio signals. IEEE Trans Speech Audio Process 10(5):293–302
    Article Google Scholar
  41. Wang Y, Lin X, Wu L, Zhang W, Zhang Q, Huang X (2015) Robust subspace clustering for multi-view data by exploiting correlation consensus. IEEE Trans Image Process 24(11):3939–3949
    Article MathSciNet Google Scholar
  42. Wang Y, Lin X, Wu L, Zhang W (2017) Effective multi-query expansions:, Collaborative deep networks for robust landmark retrieval. arXiv:1701.05003
  43. Wang Y, Zhang W, Wu L, Lin X, Zhao X (2017) Unsupervised metric fusion over multiview data by graph random walk-based cross-view diffusion. IEEE Trans Neural Netw Learn Syst 28(1):57–70
    Article Google Scholar
  44. Wang Y, Wu L, Lin X, Gao J (2018) Multiview spectral clustering via structured low-rank matrix factorization. IEEE Transactions on Neural Networks and Learning Systems
  45. Yu Y, Luo S, Liu S, Qiao H, Liu Y, Feng L (2020) Deep attention based music genre classification. Neurocomputing 372:84–91
    Article Google Scholar
  46. Zhang W, Lei W, Xu X, Xing X (2016) Improved music genre classification with convolutional neural networks. In: INTERSPEECH, pp 3304–3308

Download references