Share Email Print

Proceedings Paper

Segmentation of singing voice within music signals
Author(s): Phabio J. Setubal; Sidnei Noceti Filho; Rui Seara
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

This paper proposes a novel approach to accomplish the automatic segmentation of singing voice within music signals, based on the difference between the dynamic harmonic content of singing voice and that of musical instrument signals. The obtained results are compared with those of another approach proposed in the literature, considering the same music database. For both techniques, an accuracy rate around 80% is obtained, even using a more rigorous performance measure for our approach only. As an advantage, the new procedure presents lower computational complexity. In addition, we discuss other results obtained by extending the tests over the whole database (upholding the same performance level) and by discriminating the error types (boundaries shifted in time, insertion and deletion of singing segments). The analysis of these errors suggests some alternative ways of reducing them, as for example, to adopt a confidence level based on a minimum harmonic content for the input signals. In this way, considering only signals with confidence level equal to one, the obtained performance is improved to almost 87%.

Paper Details

Date Published: 25 October 2004
PDF: 10 pages
Proc. SPIE 5601, Internet Multimedia Management Systems V, (25 October 2004); doi: 10.1117/12.571280
Show Author Affiliations
Phabio J. Setubal, Univ. Federal de Santa Catarina (Brazil)
Sidnei Noceti Filho, Univ. Federal de Santa Catarina (Brazil)
Rui Seara, Univ. Federal de Santa Catarina (Brazil)

Published in SPIE Proceedings Vol. 5601:
Internet Multimedia Management Systems V
John R. Smith; Tong Zhang; Sethuraman Panchanathan, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?