Supervised Deep Hashing for Highly Efficient Cover Song Detection

Zhaoqin Ye; Jaeyoung Choi; Gerald Friedland

doi:10.1109/MIPR.2019.00049

2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)

Supervised Deep Hashing for Highly Efficient Cover Song Detection

Year: 2019, Pages: 234-239

DOI Bookmark: 10.1109/MIPR.2019.00049

Authors

Zhaoqin Ye
Jaeyoung Choi
Gerald Friedland

Abstract

This paper proposes a supervised deep hashing approach for highly efficient and effective cover song detection. Our system consists of two identical sub-neural networks, each one having a hash layer to learn a binary representations of input audio in the form of spectral features. A loss function joins the two outputs of the sub-networks by minimizing the Hamming distance for a pair of audio files covering the same music work. We further enhance system performance by loudness embedding, beat synchronization, and early fusion of input audio features. The output of 128-bit hash reaches state-of-the-art performance with mean pairwise accuracy. This system demonstrates the possibility of memory-efficient and real-time efficient cover song detection with satisfiable accuracy in large scale.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

Supervised hashing with kernels
2012 IEEE Conference on Computer Vision and Pattern Recognition
Kernel auto-encoder for semi-supervised hashing
2016 IEEE Winter Conference on Applications of Computer Vision (WACV)
Using Exact Locality Sensitive Mapping to Group and Detect Audio-Based Cover Songs
2008 Tenth IEEE International Symposium on Multimedia
Music fingerprint extraction for classical music cover song identification
2008 IEEE International Conference on Multimedia and Expo (ICME)
Large-Scale Cover Song Retrieval System Developed Using Machine Learning Approaches
2016 IEEE International Symposium on Multimedia (ISM)
Kara1k: A Karaoke Dataset for Cover Song Identification and Singing Voice Analysis
2017 IEEE International Symposium on Multimedia (ISM)
Supervised Hashing Using Graph Cuts and Boosted Decision Trees
IEEE Transactions on Pattern Analysis & Machine Intelligence
Key-Invariant Convolutional Neural Network Toward Efficient Cover Song Identification
2018 IEEE International Conference on Multimedia and Expo (ICME)
CoverHunter: Cover Song Identification with Refined Attention and Alignments
2023 IEEE International Conference on Multimedia and Expo (ICME)
MulKINet: Multi-Stage Key-Invariant Convolutional Neural Networks for Accurate and Fast Cover Song Identification
2020 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

Supervised Deep Hashing for Highly Efficient Cover Song Detection

Authors

Abstract

Related Articles