Publications

Detailed Information

Music source separation using stacked hourglass networks

Cited 0 time in Web of Science Cited 29 time in Scopus
Authors

Park, Sungheon; Kim, Taehoon; Lee, Kyogu; Kwak, Nojun

Issue Date
2018
Publisher
International Society for Music Information Retrieval
Citation
Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018, pp.289-296
Abstract
In this paper, we propose a simple yet effective method for multiple music source separation using convolutional neural networks. Stacked hourglass network, which was originally designed for human pose estimation in natural images, is applied to a music source separation task. The network learns features from a spectrogram image across multiple scales and generates masks for each music source. The estimated mask is refined as it passes over stacked hourglass modules. The proposed framework is able to separate multiple music sources using a single network. Experimental results on MIR-1K and DSD100 datasets validate that the proposed method achieves competitive results comparable to the state-of-the-art methods in multiple music source separation and singing voice separation tasks.
URI
https://hdl.handle.net/10371/206561
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Related Researcher

  • Graduate School of Convergence Science & Technology
  • Department of Intelligence and Information
Research Area Feature Selection and Extraction, Object Detection, Object Recognition

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share