Bottom-up broadcast neural network for music genre classification (original) (raw)

References

Bertin-Mahieux T, Ellis DP, Whitman B, Lamere P (2011) The million song dataset. In: Ismir, vol 2, pp 10
Cano P, Gómez Gutiérrez E, Gouyon F, Herrera Boyer P, Koppenberger M, Ong BS, Serra X, Streich S, Wack N (2006) Ismir 2004 audio description contest
Choi K, Fazekas G, Sandler M, Cho K (2017) Transfer learning for music classification and regression tasks. arXiv:1703.09179
Chollet F et al (2015) Keras
Costa YM, Oliveira LS, Silla CN Jr (2017) An evaluation of convolutional neural networks for music classification using spectrograms. Applied Soft Computing 52:28–38
Article Google Scholar
Dai J, Liu W, Ni C, Dong L, Yang H (2015) “multilingual” deep neural network for music genre classification. In: Sixteenth annual conference of the international speech communication association
Freitag M, Amiriparian S, Pugachevskiy S, Cummins N (2017) Schuller, B.: audeep: Unsupervised learning of representations from audio with deep recurrent neural networks. J Mach Learn Res 18(1):6340–6344
MATH Google Scholar
Fu Z, Lu G, Ting KM, Zhang D (2011) A survey of audio-based music classification and annotation. IEEE Trans Multimedia 13(2):303–319
Article Google Scholar
Hafemann LG, Oliveira LS, Cavalin P (2014) Forest species recognition using deep convolutional neural networks. In: 2014 22nd international conference on Pattern recognition (ICPR), IEEE, pp 1103–1107
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. ACM SIGKDD Explorations Newsletter 11(1):10–18
Article Google Scholar
Hamel P, Eck D (2010) Learning features from music audio with deep belief networks. In: ISMIR, vol 10, Utrecht, pp 339–344
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: CVPR, vol 1, pp 3
Ioffe S, Szegedy C (2015) Batch normalization:, Accelerating deep network training by reducing internal covariate shift. arXiv:1502.03167
Jakubik J (2017) Evaluation of gated recurrent neural networks in music classification tasks. In: International conference on information systems architecture and technology, Springer, pp 27–37
Jeong Y, Choi K, Jeong H (2017) Dlr:, Toward a deep learned rhythmic representation for music content analysis. arXiv:1712.05119
Karunakaran N, Arya A (2018) A scalable hybrid classifier for music genre classification using machine learning concepts and spark. In: 2018 International conference on intelligent autonomous systems (ICoIAS), IEEE, pp 128–135
Kereliuk C, Sturm BL, Larsen J (2015) Deep learning and music adversaries. IEEE Trans Multimedia 17(11):2059–2071
Article Google Scholar
Kingma DP, Ba J (2014) Adam:, A method for stochastic optimization. arXiv:1412.6980
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Lee J, Nam J (2017) Multi-level and multi-scale feature aggregation using pretrained convolutional neural networks for music auto-tagging. IEEE Signal Process Lett 24(8):1208–1212
Article Google Scholar
Lee H, Pham P, Largman Y, Ng AY (2009) Unsupervised feature learning for audio classification using convolutional deep belief networks. In: Advances in neural information processing systems, pp1096–1104
Li TL, Chan AB, Chun A (2010) Automatic musical pattern feature extraction using convolutional neural network. In: Proc Int Conf Data mining and applications
Lin M, Chen Q, Yan S (2013) Network in network. arXiv:1312.4400
Lykartsis A, Lerch A (2015) Beat histogram features for rhythm-based musical genre classification using multiple novelty functions. In: Proceedings of the 16th ISMIR Conference, pp 434–440
Marchand U, Peeters G (2016) The extended ballroom dataset
Marchand U, Peeters G (2014) The modulation scale spectrum and its application to rhythm-content analysis. In: DAFX (Digital audio effects)
Marchand U, Peeters G (2016) Scale and shift invariant time/frequency representation using auditory statistics: Application to rhythm description. In: 2016 IEEE 26th international workshop on Machine learning for signal processing (MLSP), IEEE, pp 1–6
McFee B, Raffel C, Liang D, Ellis DP, McVicar M, Battenberg E (2015) Nieto, O.: librosa: Audio and music signal analysis in python. In: Proceedings of the 14th python in science conference, pp 18–25
Medhat F, Chesmore D, Robinson J (2017) Automatic classification of music genre using masked conditional neural networks. In: 2017 IEEE international conference on Data mining (ICDM), IEEE, pp 979–984
Nanni L, Costa YM, Lucio DR, Silla CN Jr, Brahnam S (2017) Combining visual and acoustic features for audio classification tasks. Pattern Recogn Lett 88:49–56
Article Google Scholar
Nguyen QH, Do TT, Chu TB, Trinh LV, Nguyen DH, Phan CV, Phan TA, Doan DV, Pham HN, Nguyen BP et al (2019) Music genre classification using residual attention network. In: 2019 International conference on system science and engineering (ICSSE), IEEE, pp 115–119
Pons J, Serra X (2017) Designing efficient architectures for modeling temporal features with convolutional neural networks. In: 2017 IEEE international conference on Acoustics, speech and signal processing (ICASSP), IEEE, pp 2472–2476
Pons J, Serra X (2018) Randomly weighted cnns for (music) audio classification. arXiv:1805.00237
Pons J, Lidy T, Serra X (2016) Experimenting with musically motivated convolutional neural networks. In: 2016 14th international workshop on Content-based multimedia indexing (CBMI), IEEE, pp 1–6
Salamon J, Bello JP (2017) Deep convolutional neural networks and data augmentation for environmental sound classification. IEEE Signal Processing Letters 24(3):279–283
Article Google Scholar
Senac C, Pellegrini T, Mouret F, Pinquier J (2017) Music feature maps with convolutional neural networks for music genre classification. In: Proceedings of the 15th international workshop on content-based multimedia indexing, ACM, pp 19
Sigtia S, Dixon S (2014) Improved music feature learning with deep neural networks. In: 2014 IEEE international conference on Acoustics, speech and signal processing (ICASSP), IEEE, pp 6959–6963
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9
Tzanetakis G, Cook P (2002) Musical genre classification of audio signals. IEEE Trans Speech Audio Process 10(5):293–302
Article Google Scholar
Wang Y, Lin X, Wu L, Zhang W, Zhang Q, Huang X (2015) Robust subspace clustering for multi-view data by exploiting correlation consensus. IEEE Trans Image Process 24(11):3939–3949
Article MathSciNet Google Scholar
Wang Y, Lin X, Wu L, Zhang W (2017) Effective multi-query expansions:, Collaborative deep networks for robust landmark retrieval. arXiv:1701.05003
Wang Y, Zhang W, Wu L, Lin X, Zhao X (2017) Unsupervised metric fusion over multiview data by graph random walk-based cross-view diffusion. IEEE Trans Neural Netw Learn Syst 28(1):57–70
Article Google Scholar
Wang Y, Wu L, Lin X, Gao J (2018) Multiview spectral clustering via structured low-rank matrix factorization. IEEE Transactions on Neural Networks and Learning Systems
Yu Y, Luo S, Liu S, Qiao H, Liu Y, Feng L (2020) Deep attention based music genre classification. Neurocomputing 372:84–91
Article Google Scholar
Zhang W, Lei W, Xu X, Xing X (2016) Improved music genre classification with convolutional neural networks. In: INTERSPEECH, pp 3304–3308

Download references