Detecting instruments in classical music: A view on timbre, musical features, and classification
Simmermacher, Christian
This item is not available in full-text via OUR Archive.
If you would like to read this item, please apply for an inter-library loan from the University of Otago via your local library.
If you are the author of this item, please contact us if you wish to discuss making the full text publicly available.
Cite this item:
Simmermacher, C. (2005, November 23). Detecting instruments in classical music: A view on timbre, musical features, and classification (Dissertation, Master of Science). Retrieved from http://hdl.handle.net/10523/1296
Permanent link to OUR Archive version:
http://hdl.handle.net/10523/1296
Abstract:
This thesis describes research on recognition of classical instruments from isolated tones and musical passages. The intention is to evaluate the robustness of the features and the efficiency of the data analysis methods to classify instruments in a dynamic environment.
The information extracted from the samples is based on relevant works in instrument detection and timbre studies, and on knowledge about instrument characteristics. For a better understanding of the perceptual derived features, the human auditory system is briefly reviewed and some perceptual phenomena are mentioned. Different feature extraction methods are chosen without applying sound source separation techniques.
These methods are based on auditory models and filters and are widely used for simulation of the human perception and speech detection. Furthermore, a standardised set of features from the MPEG-7 Instrument timbre description scheme is calculated from the harmonic spectrum of a sound. It is especially designed to capture the timbre of instrument tones.
Two experiments are discussed in this thesis. The first experiment is used to evaluate features and classification methods on a dataset of single tone samples from 20 instruments. A combination of perceptual features shows the highest accuracy with around 90% and a reduced collection of selected features proves most efficient. The second experiment is similar to the first, but it processes musical passages to distinguish between instruments from four major instrument families. Spectral features are predominantly used for this task. Nineteen features combined from the Mel-frequency cepstral coefficients and the MPEG-7 standard achieve an accuracy of around 94%. In both experiments a multilayer perceptron shows the best generalisation for the test set.
The dataset from the second experiment is also used for a user interface that implements the most robust feature extraction technique and the fastest data analysis method. It detects the dominant instrument in a ten second long passage and annotates the calculated information in XML-format.
Date:
2005-11-23
Advisor:
Deng, Da; Cranefield, Stephen
Degree Name:
Master of Science
Degree Discipline:
Information Science
Keywords:
recognition; classical instruments; isolated tones; music; data analysis; timbre; MPEG-7; distinguish between instruments; Mel-frequency cepstral coefficient
Research Type:
Dissertation