1 <!DOCTYPE html PUBLIC
"-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
4 <meta http-equiv=
"Content-Type" content=
"text/html; charset=UTF-8">
5 <meta http-equiv=
"Content-Style-Type" content=
"text/css">
7 <meta name=
"Generator" content=
"Cocoa HTML Writer">
8 <meta name=
"CocoaVersion" content=
"824.44">
9 <style type=
"text/css">
10 p
.p1
{margin: 0.0px 0.0px 0.0px 0.0px; font: 18.0px Helvetica
}
11 p
.p2
{margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica
; min-height: 14.0px}
12 p
.p3
{margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica
}
13 p
.p4
{margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica
; color: #000000; min-height: 14.0px}
14 p
.p5
{margin: 0.0px 0.0px 0.0px 0.0px; font: 9.0px Monaco
; color: #000000; min-height: 12.0px}
15 p
.p6
{margin: 0.0px 0.0px 0.0px 0.0px; font: 9.0px Monaco
; color: #bf0000}
16 p
.p7
{margin: 0.0px 0.0px 0.0px 0.0px; font: 9.0px Monaco
; color: #606060}
17 p
.p8
{margin: 0.0px 0.0px 0.0px 0.0px; font: 9.0px Monaco
; color: #000000}
18 p
.p9
{margin: 0.0px 0.0px 0.0px 0.0px; font: 9.0px Monaco
; color: #007300}
19 p
.p10
{margin: 0.0px 0.0px 0.0px 0.0px; font: 9.0px Monaco
; min-height: 12.0px}
20 p
.p11
{margin: 0.0px 0.0px 0.0px 0.0px; font: 9.0px Monaco
; color: #952415}
21 p
.p12
{margin: 0.0px 0.0px 0.0px 0.0px; font: 9.0px Monaco
}
22 span
.s1
{color: #000000}
23 span
.s2
{color: #0000bf}
24 span
.s3
{color: #007300}
25 span
.s4
{color: #bf0000}
26 span
.s5
{color: #0a20b3}
27 span
.Apple-tab-span
{white-space:pre
}
31 <p class=
"p1"><b>BeatTrack2
<span class=
"Apple-tab-span"> </span><span class=
"Apple-tab-span"> </span><span class=
"Apple-tab-span"> </span><span class=
"Apple-tab-span"> </span><span class=
"Apple-tab-span"> </span><span class=
"Apple-tab-span"> </span>template matching beat tracker
</b></p>
32 <p class=
"p2"><br></p>
33 <p class=
"p3"><b>#beattick, eighthtick, groovetick, tempo, phase, groove = BeatTrack2.kr(busindex, numfeatures, windowsize=
2, phaseaccuracy=
0.02, lock=
0, weightingscheme)
</b></p>
34 <p class=
"p2"><br></p>
35 <p class=
"p3"><b>busindex [sk]-
</b>Audio input to track, already analysed into N features, passed in via a control bus number from which to retrieve consecutive streams.
<span class=
"Apple-converted-space"> </span></p>
36 <p class=
"p2"><br></p>
37 <p class=
"p3"><b>numfeatures [s]
</b> How many features (ie how many control buses) are provided
</p>
38 <p class=
"p2"><br></p>
39 <p class=
"p3"><b>windowsize [s]
</b> Size of the temporal window desired (
2.0 to
3.0 seconds models the human temporal window). You might use longer values for stability of estimate at the expense of reactiveness.
<span class=
"Apple-converted-space"> </span></p>
40 <p class=
"p2"><br></p>
41 <p class=
"p3"><b>phaseaccuracy [s]
</b> Relates to how many different phases to test. At the default,
50 different phases spaced by
<b>phaseaccuracy
</b> seconds would be tried out for
60bpm;
16 would be trialed for
180 bpm. Larger phaseaccuracy means more tests and more CPU cost.
<span class=
"Apple-converted-space"> </span></p>
42 <p class=
"p2"><br></p>
43 <p class=
"p3"><b>lock [sk]-
</b>If this argument is greater than
0.5, the tracker will lock at its current periodicity and continue from the current phase. Whilst it updates the model's phase and period, this is not reflected in the output until lock goes back below
0.5.
<span class=
"Apple-converted-space"> </span></p>
44 <p class=
"p2"><br></p>
45 <p class=
"p3"><b>weightingscheme [s]
</b> Use (-
2.5) for flat weighting of tempi, (-
1.5) for compensation weighting based on the number of events tested (because different periods allow different numbers of events within the temporal window) or otherwise a bufnum from
0 upwards for passing an array of
120 individual tempo weights; tempi go from
60 to
179 bpm in steps of one bpm, so you must have a buffer of
120 values.
<span class=
"Apple-converted-space"> </span></p>
46 <p class=
"p2"><br></p>
47 <p class=
"p3">This beat tracker is based on exhaustively testing particular template patterns against feature streams; the testing takes place every
0.5 seconds. The two basic templates are a straight (
<b>groove
</b>=
0) and a swung triplet (
<b>groove
</b>=
1) pattern of
16th notes; this pattern is tried out at scalings corresponding to the tempi from
60 to
180 bpm. This is the cross-corellation method of beat tracking. A majority vote is taken on the best tempo detected, but this must be confirmed by a consistency check after a phase estimate. Such a consistency check helps to avoid wild fluctuating estimates, but is at the expense of an additional half second delay. The latency of the beat tracker with default settings is thus at least
2.5 seconds; because of block-based amortisation of calculation, it is actually around
2.8 seconds latency for a
2.0 second temporal window.
<span class=
"Apple-converted-space"> </span></p>
48 <p class=
"p2"><br></p>
49 <p class=
"p3">This beat tracker is designed to be flexible for user needs; you can try out different window sizes, tempo weights and combinations of features. However, there are no guarantees on stability and effectiveness, and you will need to explore such parameters for a particular situation.
<span class=
"Apple-converted-space"> </span></p>
50 <p class=
"p4"><br></p>
51 <p class=
"p5"><br></p>
52 <p class=
"p6">//required for MFCCs used below
</p>
53 <p class=
"p6"><span class=
"s1">b =
</span><span class=
"s2">Buffer
</span><span class=
"s1">.alloc(s,
1024,
1);
</span>//for sampling rates
44100 and
48000</p>
54 <p class=
"p6">//b = Buffer.alloc(s,
2048,
1); //for sampling rates
88200 and
96000</p>
55 <p class=
"p5"><br></p>
56 <p class=
"p6">//this is a one minute pop song; you should load something equivalent for testing
</p>
57 <p class=
"p7"><span class=
"s1">d=
</span><span class=
"s2">Buffer
</span><span class=
"s1">.read(s,
</span>"/Volumes/data/stevebeattrack/samples/100.wav"<span class=
"s1">);
</span></p>
58 <p class=
"p7"><span class=
"s1">d=
</span><span class=
"s2">Buffer
</span><span class=
"s1">.read(s,
</span>"/Volumes/data/stevebeattrack/samples/019.wav"<span class=
"s1">);
</span></p>
59 <p class=
"p5"><br></p>
60 <p class=
"p5"><br></p>
61 <p class=
"p6">//very feature dependent
</p>
63 <p class=
"p8">a=
<span class=
"s2">SynthDef
</span>(
<span class=
"s3">\help_beattrack2_1
</span>,{
<span class=
"s2">arg
</span> vol=
1.0, beepvol=
1.0, lock=
0;
</p>
64 <p class=
"p8"><span class=
"s2">var
</span> in, kbus;
<span class=
"Apple-converted-space"> </span></p>
65 <p class=
"p8"><span class=
"s2">var
</span> trackb,trackh,trackq,tempo, phase, period, groove;
</p>
66 <p class=
"p8"><span class=
"s2">var
</span> bsound,hsound,qsound, beep;
</p>
67 <p class=
"p8"><span class=
"s2">var
</span> fft;
<span class=
"Apple-converted-space"> </span></p>
68 <p class=
"p8"><span class=
"s2">var
</span> feature1, feature2, feature3;
<span class=
"Apple-converted-space"> </span></p>
69 <p class=
"p5"><br></p>
70 <p class=
"p8">in=
<span class=
"s2">PlayBuf
</span>.ar(
1,d.bufnum,
<span class=
"s2">BufRateScale
</span>.kr(d.bufnum),
1,
0,
1);
</p>
71 <p class=
"p6">//in = SoundIn.ar(
0);
<span class=
"Apple-converted-space"> </span></p>
72 <p class=
"p5"><br></p>
73 <p class=
"p6">//Create some features
</p>
74 <p class=
"p8">fft =
<span class=
"s2">FFT
</span>(b.bufnum, in);
</p>
75 <p class=
"p5"><br></p>
76 <p class=
"p8">feature1=
<span class=
"s2">RunningSum
</span>.rms(in,
64);
</p>
77 <p class=
"p8">feature2=
<span class=
"s2">MFCC
</span>.kr(fft,
2);
<span class=
"s4">//two coefficients
</span></p>
78 <p class=
"p8">feature3=
<span class=
"s2">A2K
</span>.kr(
<span class=
"s2">LPF
</span>.ar(in,
1000));
</p>
79 <p class=
"p5"><br></p>
80 <p class=
"p8">kbus=
<span class=
"s2">Out
</span>.kr(
0, [feature1, feature3]++feature2);
<span class=
"Apple-converted-space"> </span></p>
81 <p class=
"p5"><br></p>
82 <p class=
"p6">//Look at four features
</p>
83 <p class=
"p8">#trackb,trackh,trackq,tempo, phase, period, groove=
<span class=
"s2">BeatTrack2
</span>.kr(
0,
4,
2.0,
0.02, lock, -
2.5);
</p>
84 <p class=
"p5"><br></p>
85 <p class=
"p8">beep=
<span class=
"s2">SinOsc
</span>.ar(
1000,
0.0,
<span class=
"s2">Decay
</span>.kr(trackb,
0.1));
</p>
86 <p class=
"p5"><br></p>
87 <p class=
"p8"><span class=
"s2">Out
</span>.ar(
0,
<span class=
"s2">Pan2
</span>.ar((vol*in)+(beepvol*beep),
0.0));
</p>
88 <p class=
"p8">}).play
</p>
90 <p class=
"p5"><br></p>
91 <p class=
"p5"><br></p>
92 <p class=
"p8">a.set(
<span class=
"s3">\vol
</span>,
0.0);
</p>
93 <p class=
"p8">a.set(
<span class=
"s3">\vol
</span>,
1.0);
</p>
94 <p class=
"p5"><br></p>
95 <p class=
"p8">a.set(
<span class=
"s3">\beepvol
</span>,
1.0);
</p>
96 <p class=
"p8">a.set(
<span class=
"s3">\beepvol
</span>,
0.0);
</p>
97 <p class=
"p5"><br></p>
98 <p class=
"p6"><span class=
"s1">a.set(
</span><span class=
"s3">\lock
</span><span class=
"s1">,
1);
</span>//fix it rigidly from current phase/period solution
</p>
99 <p class=
"p6"><span class=
"s1">a.set(
</span><span class=
"s3">\lock
</span><span class=
"s1">,
0);
</span>//unfix, back to tracking
</p>
100 <p class=
"p5"><br></p>
101 <p class=
"p5"><br></p>
102 <p class=
"p5"><br></p>
103 <p class=
"p5"><br></p>
104 <p class=
"p5"><br></p>
105 <p class=
"p5"><br></p>
106 <p class=
"p5"><br></p>
107 <p class=
"p5"><br></p>
108 <p class=
"p8">a.free;
</p>
109 <p class=
"p5"><br></p>
110 <p class=
"p6">//same thing, trying with Onsets UGen raw output
</p>
112 <p class=
"p8">a=
<span class=
"s2">SynthDef
</span>(
<span class=
"s3">\help_beattrack2_1
</span>,{
<span class=
"s2">arg
</span> vol=
1.0, beepvol=
1.0, lock=
0;
</p>
113 <p class=
"p8"><span class=
"s2">var
</span> in, kbus;
<span class=
"Apple-converted-space"> </span></p>
114 <p class=
"p8"><span class=
"s2">var
</span> trackb,trackh,trackq,tempo, phase, period, groove;
</p>
115 <p class=
"p8"><span class=
"s2">var
</span> bsound,hsound,qsound, beep;
</p>
116 <p class=
"p8"><span class=
"s2">var
</span> fft;
<span class=
"Apple-converted-space"> </span></p>
117 <p class=
"p8"><span class=
"s2">var
</span> feature1, feature2, feature3;
<span class=
"Apple-converted-space"> </span></p>
118 <p class=
"p5"><br></p>
119 <p class=
"p8">in=
<span class=
"s2">PlayBuf
</span>.ar(
1,d.bufnum,
<span class=
"s2">BufRateScale
</span>.kr(d.bufnum),
1,
0,
1);
</p>
120 <p class=
"p6">//in = SoundIn.ar(
0);
<span class=
"Apple-converted-space"> </span></p>
121 <p class=
"p5"><br></p>
122 <p class=
"p6">//Create some features
</p>
123 <p class=
"p8">fft =
<span class=
"s2">FFT
</span>(b.bufnum, in);
</p>
124 <p class=
"p5"><br></p>
125 <p class=
"p8">feature1=
<span class=
"s2">Onsets
</span>.kr(fft,odftype:
<span class=
"s3">\mkl
</span>, rawodf:
1);
</p>
126 <p class=
"p5"><br></p>
127 <p class=
"p8">feature2=
<span class=
"s2">Onsets
</span>.kr(fft,odftype:
<span class=
"s3">\complex
</span>, rawodf:
1);
<span class=
"s4">//two coefficients
</span></p>
128 <p class=
"p5"><br></p>
129 <p class=
"p8">kbus=
<span class=
"s2">Out
</span>.kr(
0, [feature1,feature2]);
<span class=
"Apple-converted-space"> </span></p>
130 <p class=
"p5"><br></p>
131 <p class=
"p6">//Look at four features
</p>
132 <p class=
"p8">#trackb,trackh,trackq,tempo, phase, period, groove=
<span class=
"s2">BeatTrack2
</span>.kr(
0,
2,
3.0,
0.02, lock, -
2.5);
</p>
133 <p class=
"p5"><br></p>
134 <p class=
"p8">beep=
<span class=
"s2">SinOsc
</span>.ar(
1000,
0.0,
<span class=
"s2">Decay
</span>.kr(trackb,
0.1));
</p>
135 <p class=
"p5"><br></p>
136 <p class=
"p8"><span class=
"s2">Out
</span>.ar(
0,
<span class=
"s2">Pan2
</span>.ar((vol*in)+(beepvol*beep),
0.0));
</p>
137 <p class=
"p8">}).play
</p>
139 <p class=
"p5"><br></p>
140 <p class=
"p5"><br></p>
141 <p class=
"p5"><br></p>
142 <p class=
"p5"><br></p>
143 <p class=
"p5"><br></p>
144 <p class=
"p5"><br></p>
145 <p class=
"p5"><br></p>
146 <p class=
"p5"><br></p>
147 <p class=
"p6">//favour higher tempi in own weighting scheme
</p>
149 <p class=
"p8">c=
<span class=
"s2">Array
</span>.fill(
120,{
<span class=
"s2">arg
</span> i;
0.5+(
0.5*(i/
120))});
</p>
150 <p class=
"p8">e=
<span class=
"s2">Buffer
</span>.sendCollection(s, c,
1);
</p>
152 <p class=
"p5"><br></p>
153 <p class=
"p5"><br></p>
154 <p class=
"p6">//track audio in (try clapping a beat or beatboxing, but allow up to
6 seconds for tracking to begin) and spawning stuff at quarters, eighths and sixteenths
</p>
156 <p class=
"p9"><span class=
"s2">SynthDef
</span><span class=
"s1">(
</span>\help_beattrack2_2
<span class=
"s1">,{
</span></p>
157 <p class=
"p8"><span class=
"s2">var
</span> trackb,trackh,trackq,tempo;
</p>
158 <p class=
"p8"><span class=
"s2">var
</span> source, kbus;
</p>
159 <p class=
"p8"><span class=
"s2">var
</span> bsound,hsound,qsound;
</p>
160 <p class=
"p5"><br></p>
161 <p class=
"p6"><span class=
"s1">source=
</span><span class=
"s2">SoundIn
</span><span class=
"s1">.ar(
0);
</span>//PlayBuf.ar(
1,d.bufnum,BufRateScale.kr(d.bufnum),
1,
0,
1);
</p>
162 <p class=
"p5"><br></p>
163 <p class=
"p6">//downsampling automatic via kr from ar
</p>
164 <p class=
"p6"><span class=
"s1">kbus=
</span><span class=
"s2">Out
</span><span class=
"s1">.kr(
0,
</span><span class=
"s2">LPF
</span><span class=
"s1">.ar(source,
1000));
</span>//([feature1, feature3]++feature2);
<span class=
"Apple-converted-space"> </span></p>
165 <p class=
"p5"><br></p>
166 <p class=
"p8">#trackb,trackh,trackq,tempo=
<span class=
"s2">BeatTrack2
</span>.kr(
0,
1,weightingscheme:e.bufnum);
</p>
167 <p class=
"p5"><br></p>
168 <p class=
"p8">bsound=
<span class=
"s2">Pan2
</span>.ar(
<span class=
"s2">LPF
</span>.ar(
<span class=
"s2">WhiteNoise
</span>.ar*(
<span class=
"s2">Decay
</span>.kr(trackb,
0.05)),
1000),
0.0);
</p>
169 <p class=
"p5"><br></p>
170 <p class=
"p8">hsound=
<span class=
"s2">Pan2
</span>.ar(
<span class=
"s2">BPF
</span>.ar(
<span class=
"s2">WhiteNoise
</span>.ar*(
<span class=
"s2">Decay
</span>.kr(trackh,
0.05)),
3000,
0.66),-
0.5);
</p>
171 <p class=
"p5"><br></p>
172 <p class=
"p8">qsound=
<span class=
"s2">Pan2
</span>.ar(
<span class=
"s2">HPF
</span>.ar(
<span class=
"s2">WhiteNoise
</span>.ar*(
<span class=
"s2">Decay
</span>.kr(trackq,
0.05)),
5000),
0.5);
</p>
173 <p class=
"p5"><br></p>
174 <p class=
"p8"><span class=
"s2">Out
</span>.ar(
0, source + bsound+hsound+qsound);
</p>
175 <p class=
"p8">}).play;
</p>
177 <p class=
"p10"><br></p>
178 <p class=
"p10"><br></p>
179 <p class=
"p10"><br></p>
180 <p class=
"p10"><br></p>
181 <p class=
"p10"><br></p>
182 <p class=
"p10"><br></p>
183 <p class=
"p10"><br></p>
184 <p class=
"p10"><br></p>
185 <p class=
"p10"><br></p>
186 <p class=
"p10"><br></p>
187 <p class=
"p11">//Research note: Designed by Nick Collins following work by Jean Laroche
</p>
188 <p class=
"p10"><br></p>
189 <p class=
"p11">//geometric tempo placement very similar to linear, and linear easier to deal with looking up related tempi at double and half speed
</p>
191 <p class=
"p12"><span class=
"s5">var
</span> startbps=
1, endbps=
3;
<span class=
"Apple-converted-space"> </span></p>
192 <p class=
"p12"><span class=
"s5">var
</span> numtempi=
100;
</p>
193 <p class=
"p12"><span class=
"s5">var
</span> ratio;
<span class=
"Apple-converted-space"> </span></p>
194 <p class=
"p12"><span class=
"s5">var
</span> tempi, periods;
<span class=
"Apple-converted-space"> </span></p>
195 <p class=
"p10"><br></p>
196 <p class=
"p12">ratio= (endbps/startbps)**((numtempi-
1).reciprocal);
<span class=
"Apple-converted-space"> </span></p>
197 <p class=
"p10"><br></p>
198 <p class=
"p12">tempi=
<span class=
"s5">Array
</span>.geom(numtempi, startbps, ratio);
<span class=
"Apple-converted-space"> </span></p>
199 <p class=
"p10"><br></p>
200 <p class=
"p12">periods= tempi.reciprocal;
</p>
201 <p class=
"p10"><br></p>
202 <p class=
"p12"><span class=
"s5">Post
</span> << (tempi*
60)
<< nl;
<span class=
"Apple-converted-space"> </span></p>
203 <p class=
"p12"><span class=
"s5">Post
</span> << periods
<< nl;
<span class=
"Apple-converted-space"> </span></p>
205 <p class=
"p10"><br></p>
206 <p class=
"p11">//create linear periods
</p>
207 <p class=
"p12"><span class=
"s5">Post
</span> << ((
<span class=
"s5">Array
</span>.series(
120,
1,
2/
120)).reciprocal)
<< nl;
<span class=
"Apple-converted-space"> </span></p>
208 <p class=
"p10"><br></p>
209 <p class=
"p11">//tempo weights
</p>
210 <p class=
"p12"><span class=
"Apple-converted-space"> </span><span class=
"s5">Post
</span> << (
<span class=
"s5">Array
</span>.fill(
120,{
<span class=
"s5">arg
</span> i;
<span class=
"Apple-converted-space"> </span>0.2*((
1.0- ((abs(i-
60))/
60.0))**
0.5) +
0.8; }))
<< nl;
<span class=
"Apple-converted-space"> </span></p>
211 <p class=
"p10"><br></p>