A Wavelet-Based Approach to the Discovery of Themes and Motives in Melodies - PowerPoint PPT Presentation

About This Presentation
Title:

A Wavelet-Based Approach to the Discovery of Themes and Motives in Melodies

Description:

A Wavelet-Based Approach to the Discovery of Themes and Motives in Melodies Gissel Velarde and David Meredith Aalborg University Department of Architecture, Design ... – PowerPoint PPT presentation

Number of Views:41
Avg rating:3.0/5.0
Slides: 27
Provided by: casa3160
Category:

less

Transcript and Presenter's Notes

Title: A Wavelet-Based Approach to the Discovery of Themes and Motives in Melodies


1
  • A Wavelet-Based Approach to the Discovery of
    Themes and Motives in Melodies
  • Gissel Velarde and David Meredith
  • Aalborg UniversityDepartment of Architecture,
    Design Media Technology
  • EuroMAC, September 2014

2
We present
  • A computational method submitted to the MIREX
    2014 Discovery of Repeated Themes Sections task
  • The results on the monophonic version of the JKU
    Patterns Development Database

3
Ground TruthBachs Fugue BWV 889
4
Ground Truth Chopins Mazurka Op. 24, No. 4
5
The idea behind the method
  • In the context of pattern discovery in monophonic
    pieces
  • With a good melodic structure in terms of
    segments, it should be possible to gather similar
    segments into clusters and rank their salience
    within the piece.

6
Considerations
  • a good melodic structure in terms of segments
  • Is considered to be closer to the ground truth
    analysis (See Collins, 2014)
  • It specifies certain segments or patterns
  • These patterns can be overlapping and hierarchical

7
Considerations
  • We also consider other aspects of the problem,
  • representation,
  • segmentation,
  • measuring similarity,
  • clustering of segments and
  • ranking segments according to salience

8
The method
  • The method
  • Follows and extends our approach to melodic
    segmentation and classification based on
    filtering with the Haar wavelet (Velarde, Weyde
    and Meredith, 2013)
  • Uses idea of computing a similarity matrix for
    window connectivity information from a generic
    motif discovery algorithm for sequential data
    (Jensen, Styczynski, Rigoutsos and
    Stephanopoulos, 2006)

9
Wavelet transform
  • Haar Wavelet

A family of functions is obtained by translations
and dilatations of the mother wavelet
  • The wavelet coefficients of the pitch vector v
    for scale s and shift u are defined as the inner
    product

10
Representation (Velarde et al. 2013)
New representation
11
First stage Segmentation (Velarde et al. 2013)
New segmentation
12
Segmentation
Constant segmentation, wavelet zero-crossings or
modulus maxima
First stage segmentation
Distance matrix given a measure
Comparison
Binarized distance matrix given a threshold
Concatenation
Contiguous similar diagonal segments are
concatenated
Comparison
Distance matrix given a measure
By agglomerative clusters from an agglomerative
hierarchical cluster tree
Clustering
Ranking
Criteria sum of the length of occurrences
13
Parameter combinations
  • We tested the following parameter combinations
  • MIDI pitch
  • Sampling rate 16 samples per qn
  • Representation
  • normalized pitch signal, wav coefficients, wav
    coefficients modulus
  • Scale representation at 1 qn
  • Segmentation
  • constant segmentation, zero crossings, modulus
    maxima
  • Scale segmentation at 1 and 4 qn
  • Threshold for concatenation 0, 0.1, 1
  • Distances
  • city-block, Euclidean, DTW
  • Agglomerative clusters from an agglomerative
    hierarchical cluster tree
  • Number of clusters 7
  • Ranking criterion Sum of the length of
    occurrences

14
Evaluation
  • As described at MIREX 2014Discovery of Repeated
    Themes Sections
  • establishment precision, establishment recall,
    and establishment F1 score
  • occurrence precision, occurrence recall, and
    occurrence F1 score
  • three-layer precision, three-layer recall, and
    three-layer F1 score
  • runtime, first five target proportion and first
    five precision
  • standard precision, recall, and F1 score

15
Results
  • On the JKU Patterns Development Database
    monophonic version
  • J. S. Bach, Fugue BWV 889,
  • Beethoven's Sonata Op. 2, No. 1, Movement 3,
  • Chopin's Mazurka Op. 24, No. 4,
  • Gibbons's Silver Swan, and
  • Mozart's Sonata K.282, Movement 2.
  • We selected best combinations according to
    representation and segmentation.

16
Results
Fig 1. Mean F1 score (mean(f1_est, f1_occ(c.75),
3L F1, f1_occ (c.5)) .
17
Results
Fig 2. Standard F1 score
18
Results
Fig 3. Mean Runtime per piece.
19
Our MIREX Submissions VM1 and VM2
  • Combinations selected based on
  • mean F1 score mean(F1_est, F1_occ(c.75), F1_3,
    F1_occ (c.5))
  • standard F1 score
  • VM1 differs from VM2 in the following parameters
  • Normalized pitch signal representation,
  • Constant segmentation at the scale of 1 qn,
  • Threshold for concatenation 0.1.
  • VM2 differs from VM1 in the following parameters
  • Wavelet coefficients representation filtered at
    the scale of 1 qn
  • Modulus maxima segmentation at the scale of 4 qn
  • Threshold for concatenation 1

20
Our MIREX Submissions
 Piece n_P n_Q P_est R_est F1_est P_occ R_occ F1_occ P_3 R_3 F1_3 Runtime FFTP_ FFP P_occ R_occ F1_occ P R F1
 Piece n_P n_Q P_est R_est F1_est (c.75) (c.75) (c.75) P_3 R_3 F1_3 (s) est FFP (c.5) (c.5) (c.5) P R F1
Bach 3 7 0.87 0.95 0.91 0.63 0.72 0.67 0.51 0.65 0.57 8.5 0.95 0.6 0.63 0.72 0.67 0.14 0.33 0.2
Beethoven 7 7 0.92 0.92 0.92 0.98 0.98 0.98 0.86 0.91 0.88 31 0.76 0.8 0.89 0.93 0.91 0.57 0.57 0.57
Chopin 4 7 0.53 0.86 0.66 0.66 0.86 0.75 0.48 0.7 0.57 34.2 0.68 0.47 0.46 0.83 0.6 0 0 0
Gibbons 8 7 0.95 0.95 0.95 0.66 0.93 0.77 0.85 0.79 0.82 17.76 0.77 0.79 0.66 0.93 0.77 0.29 0.25 0.27
Mozart 9 7 0.92 0.79 0.85 0.82 0.96 0.88 0.79 0.69 0.73 23.61 0.67 0.73 0.72 0.92 0.81 0.57 0.44 0.5
mean 6.2 7 0.84 0.89 0.86 0.75 0.89 0.81 0.7 0.75 0.71 23.01 0.77 0.68 0.67 0.87 0.75 0.31 0.32 0.31
SD 2.59 0 0.17 0.07 0.12 0.15 0.11 0.12 0.19 0.1 0.14 10.34 0.11 0.14 0.15 0.09 0.12 0.26 0.22 0.23
Table 1. Results of VM1 on the JKU Patterns
Development Database.
 Piece n_P n_Q P_est R_est F1_est P_occ R_occ F1_occ P_3 R_3 F1_3 Runtime FFTP_ FFP P_occ R_occ F1_occ P R F1
 Piece n_P n_Q P_est R_est F1_est (c.75) (c.75) (c.75) P_3 R_3 F1_3 (s) est FFP (c.5) (c.5) (c.5) P R F1
Bach 3 7 0.56 0.65 0.6 0.89 0.43 0.58 0.39 0.41 0.4 5.07 0.59 0.37 0.56 0.46 0.5 0 0 0
Beethoven 7 7 0.9 0.9 0.9 0.79 0.89 0.84 0.82 0.86 0.84 5.54 0.67 0.75 0.83 0.9 0.86 0 0 0
Chopin 4 7 0.58 0.86 0.69 0.69 0.83 0.75 0.53 0.78 0.64 5.83 0.65 0.44 0.67 0.65 0.66 0 0 0
Gibbons 8 7 0.92 0.88 0.9 0.79 0.84 0.82 0.81 0.73 0.77 2.22 0.7 0.76 0.72 0.69 0.71 0.14 0.13 0.13
Mozart 9 7 0.83 0.71 0.77 0.93 0.93 0.93 0.77 0.63 0.69 5.7 0.56 0.68 0.84 0.88 0.86 0 0 0
mean 6.2 7 0.76 0.8 0.77 0.82 0.78 0.78 0.66 0.68 0.67 4.87 0.63 0.6 0.72 0.71 0.72 0.03 0.03 0.03
SD 2.59 0 0.17 0.11 0.13 0.09 0.2 0.13 0.19 0.17 0.17 1.51 0.06 0.18 0.12 0.18 0.15 0.06 0.06 0.06
Table 2. Results of VM2 on the JKU Patterns
Development Database.
Three Layer F1, (?2(1)1.8, p0.1797)
-gtNo significant difference Standard
F1, (?2(1)4, p0.045)
-gtVM1 preferred Runtime, (?2(1)5,
p0.0253)
-gtVM2 preferred
21
Example Bach's Fugue BWV 889 prototypical
pattern
Example Bach's Fugue BWV 889 prototypical pattern
22
Observations
  • The segmentation stage makes more difference in
    the results, according to the parameters
  • In the first stage segmentation
  • The size of the scale affects the results for
    standard measures and runtimes
  • In the first comparison
  • Zero-crossings segmentation works best with DTW
  • DTW is much more expensive to compute

23
Observations
  • In the comparison (after segmentation),
    City-block is dominant
  • DTW in the comparison after segmentation is not
    in the best combinations
  • Maybe because there is no ritardando or
    accelerando in this dataset and/or representation
  • For standard measures and a smaller segmentation
    scale
  • Pitch signal works better than wavelet
    representation
  • For non standard measures and a larger
    segmentation scale
  • Modulus maxima performs slightly better than
    zero-crossings and constant segmentation

24
Conclusions
  • Our novel wavelet-based method outperforms the
    methods reported by Meredith (2013) and Nieto
    Farbood (2013) on the monophonic version of the
    JKU PDD training dataset, scoring higher on
    precision, recall and F1 score, and reporting
    faster runtimes.

25
Conclusions
  • The segmentation stage makes more difference in
    the results, according to the parameters
  • A small scale for first stage segmentation should
    be preferable for higher values of the standard
    measures and a large scale should be preferable
    for runtime computation.
  • City-block should be preferable after segmentation

26
References
  • 1 T. Collins. Mirex 2014 competition Discovery
    of repeated themes and sections, 2014.
    http//www.music-ir.org/mirex/wiki/2014Discovery_
    of_Repeated_Themes_26_Sections. Accessed on 12
    May 2014.
  • 2 K. Jensen, M. Styczynski, I. Rigoutsos and G.
    Stephanopoulos A generic motif discovery
    algorithm for sequential data, Bioinformatics,
    221, pp. 21-28, 2006.
  • 3 D. Meredith. COSIATEC and SIATECCompress
    Pattern discovery by geometric compression,
    Competition on Discovery of Repeated Themes and
    Sections, MIREX 2013, Curitiba, Brazil, 2013.
  • 4 O. Nieto, and M. Farbood. Discovering
    Musical Patterns Using Audio Structural
    Segmentation Techniques. Competition on Discovery
    of Repeated Themes and Sections, MIREX 2013,
    Curitiba, Brazil, 2013
  • 5 G. Velarde, T. Weyde and D. Meredith An
    approach to melodic segmentation and
    classification based on filtering with the
    Haar-wavelet, Journal of New Music Research,
    424, 325-345, 2013.
Write a Comment
User Comments (0)
About PowerShow.com