README.md 2.88 KB
Newer Older
Sebastian Böck's avatar
Sebastian Böck committed
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
# Name: onsets_ISMIR2012_paper

## Contact: Sebastian Böck <sebastian.boeck@jku.at>

## Description: collection of 321 audio excerpts with onset annotations

## Creator: Sebastian Böck and others

## Reference: Boeck_ISMIR_2012.pdf

-----

### Content:
audio/                  original audio files converted to .flac format
annotations/onsets/     onset annotations
annotations/giantsteps/ annotations in the GiantSteps project format
annotations/.git/       git repo with annotations 
splits/                 file splitting definitions

-----

To convert the audio files to .wav use 'flac -d *.flac'

If you use this dataset for comparison of the results, please cite the above
paper. The same dataset is also used in the DAFx-13 and various other papers.

ah_* taken from the data set introduced by Andre Holzapfel in:
  Three dimensions of pitched instrument onset detection.
  A. Holzapfel, Y. Stylianou, A.C. Gedik, and B. Bozkurt.
  IEEE Transactions on Audio, Speech, and Language Processing, 18(6), 2010

al_* taken from the data set introduced by Alexandre Lacoste in:
  A supervised classification algorithm for note onset detection.
  A. Lacoste, and D. Eck.
  EURASIP Journal on Applied Signal Processing, 1, 2007.
  It can be downloaded from:
  <http://w3.ift.ulaval.ca/~allac88/dataset.tar.gz>.
  The audio files are taken from the ISMIR 2004 ballroom data set located
  at: <http://mtg.upf.edu/ismir2004/contest/tempoContest/node5.html>.

api_*
  files / annotations by Antonio Pertusa Ibáñez
  http://grfia.dlsi.ua.es/cm/worklines/pertusa/onset/ODB/

ff123_*
  samples for testing audio codecs
  retrieved from http://ff123.net/samples.html

gs_*
  files by Georgios Siamantas

jpb_* taken from the data set introduced by Juan Pablo Bello in:
  A tutorial on onset detection in music signals.
  J. Bello, L. Daudet, S. Abdallah, C. Duxbury, M. Davies, and M. Sandler.
  IEEE Transactions on Speech and Audio Processing, 13(5), 2005

lame_*
  files used by the lame dev crew to test the lame mp3 encoder
  retrieved from http://lame.sourceforge.net/quality.php

mck_*
  public part of Martin McKinney's beat/tempo set
  retrieved from:
  http://www.music-ir.org/mirex/wiki/2006:Audio_Tempo_Extraction

mit_*
  files taken from the MIT media lab
  retrieved from http://sound.media.mit.edu/media.php

sb_* taken from the data set introduced by Sebastian Böck in:
  Universal Onset Detection with Bidirectional Long Short-Term Memory Neural
  Networks.
  F. Eyben, S. Böck, B. Schuller, and A. Graves.
  Proceedings of the 11th International Society for Music Information Retrieval
  Conference (ISMIR), 2010.

SoundCheck2_*
  files taken from the Alan Parson Sound Check 2 audio test CD

vorbis_*
  files used for a OGG Vorbis listening test
  retrieved from http://hem.passagen.se/ingets1/vorbis.htm

The origin of the violin sample is unknown.

If you find this dataset useful, think about donating beer to the contact
person :)