# Name: onsets_ISMIR2012_paper ## Contact: Sebastian Böck ## Description: collection of 321 audio excerpts with onset annotations ## Creator: Sebastian Böck and others ## Reference: Boeck_ISMIR_2012.pdf ----- ### Content: audio/ original audio files converted to .flac format annotations/onsets/ onset annotations annotations/giantsteps/ annotations in the GiantSteps project format annotations/.git/ git repo with annotations splits/ file splitting definitions ----- To convert the audio files to .wav use 'flac -d *.flac' If you use this dataset for comparison of the results, please cite the above paper. The same dataset is also used in the DAFx-13 and various other papers. ah_* taken from the data set introduced by Andre Holzapfel in: Three dimensions of pitched instrument onset detection. A. Holzapfel, Y. Stylianou, A.C. Gedik, and B. Bozkurt. IEEE Transactions on Audio, Speech, and Language Processing, 18(6), 2010 al_* taken from the data set introduced by Alexandre Lacoste in: A supervised classification algorithm for note onset detection. A. Lacoste, and D. Eck. EURASIP Journal on Applied Signal Processing, 1, 2007. It can be downloaded from: . The audio files are taken from the ISMIR 2004 ballroom data set located at: . api_* files / annotations by Antonio Pertusa Ibáñez http://grfia.dlsi.ua.es/cm/worklines/pertusa/onset/ODB/ ff123_* samples for testing audio codecs retrieved from http://ff123.net/samples.html gs_* files by Georgios Siamantas jpb_* taken from the data set introduced by Juan Pablo Bello in: A tutorial on onset detection in music signals. J. Bello, L. Daudet, S. Abdallah, C. Duxbury, M. Davies, and M. Sandler. IEEE Transactions on Speech and Audio Processing, 13(5), 2005 lame_* files used by the lame dev crew to test the lame mp3 encoder retrieved from http://lame.sourceforge.net/quality.php mck_* public part of Martin McKinney's beat/tempo set retrieved from: http://www.music-ir.org/mirex/wiki/2006:Audio_Tempo_Extraction mit_* files taken from the MIT media lab retrieved from http://sound.media.mit.edu/media.php sb_* taken from the data set introduced by Sebastian Böck in: Universal Onset Detection with Bidirectional Long Short-Term Memory Neural Networks. F. Eyben, S. Böck, B. Schuller, and A. Graves. Proceedings of the 11th International Society for Music Information Retrieval Conference (ISMIR), 2010. SoundCheck2_* files taken from the Alan Parson Sound Check 2 audio test CD vorbis_* files used for a OGG Vorbis listening test retrieved from http://hem.passagen.se/ingets1/vorbis.htm The origin of the violin sample is unknown. If you find this dataset useful, think about donating beer to the contact person :)