• Aucun résultat trouvé

Multiple-F0 Estimation For MIREX 2010

N/A
N/A
Protected

Academic year: 2021

Partager "Multiple-F0 Estimation For MIREX 2010"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-01161264

https://hal.archives-ouvertes.fr/hal-01161264

Submitted on 2 Jul 2015

HAL

is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire

HAL, est

destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Copyright

Multiple-F0 Estimation For MIREX 2010

Chunghsin Yeh, Axel Roebel

To cite this version:

Chunghsin Yeh, Axel Roebel. Multiple-F0 Estimation For MIREX 2010. ISMIR/MIREX, Aug 2010,

Utrecht, France. pp.1-1. �hal-01161264�

(2)

MULTIPLE-F0 ESTIMATION FOR MIREX 2010

Chunghsin Yeh and Axel Roebel IRCAM/CNRS-STMS Paris, France

Analysis-Synthesis team

ABSTRACT

This extended abstract describes the system proposed for MIREX (Music Information Retrieval Evaluation eXchange) 2010 in the Multiple Fundamental Frequency Estima- tion and Tracking contest. The submitted system is based on the one submitted in 2009 with some modifications.

1. INTRODUCTION

The system is based on a frame-by-frame analysis with a tracking mechanism as post-processing. Two parameter settings for the noise level estimation and an optional note correction are submitted. There are four results generated for the first sub-task frame-by-frame evaluation: two pa- rameter settings with or without tracking, and four results generated for the second sub-task note contour evalua- tion: two parameter settings with or without note correc- tion.

2. NOISE LEVEL ESTIMATION

Noise level estimation serves to distinguish the sinusoids from noise in the observed spectrum. Different from the al- gorithm in [1], the distribution fit of the residual spectrum is based on the magnitude distribution of spectral bins. The combined skewness [2] of noise and sinusoidal bins are used as the statistical measure to test against the skewness of Rayleigh distribution. Thresholds for the skewness test and the residual energy are more coherently derived, too.

Two settings for the number of sinusoids to remove at each iteration is submitted: remove 1 or 20 sinusoids at a time.

3. JOINT EVALUATION OF F0 HYPOTHESES AND POLYPHONY INFERENCE

Similar to the candidate extraction method described in [1], the NHRF0s (non-harmonically related F0s) are iteratively extracted and all their harmonics are extracted as HRF0s (harmonically related F0s). Given a set of F0 hypotheses, the hypothetical sources are constructed by partial selec- tion and overlap treatment and the related combination is evaluated by a score function.

This document is licensed under the Creative Commons Attribution-Noncommercial-Share Alike 3.0 License.

http://creativecommons.org/licenses/by-nc-sa/3.0/

c

2010 C. Yeh and A. Roebel.

To infer the number of the sources, we propose an iter- ative combination method. The polyphony inference algo- rithm begins with the most likely F0 hypothesis and then progressively combined with other hypotheses. The scor- ing of hypothetical combinations is used to select the most plausible one and the contribution of an added F0 is veri- fied by the individual score criteria. An F0 hypothesis is considered valid if it either improves the envelope smooth- ness of the hypothetical sources that have partials overlap- ping with its partials, or explains a significant amount of salient peaks. The iteration stops when the most plausible combination does not improves either of the two criteria.

Different from an usual iterative approach, care has been taken to replace a subharmonic-like F0 with a more plau- sible one during the iteration.

4. SOURCE STREAM TRACKING

Source stream tracking aims at tracking the estimated F0s into note contours. Instead of tracking the intermediate F0 estimates (frame-based estimation), it is proposed to connect the F0 candidates across the frames to establish candidate trajectories [3]. The reason to establish candi- date trajectories beforehand is that candidate trajectories are more complete, which provides a good initial estimate of the source streams. However, a candidate trajectory may accidentally connect several notes of the same pitch when, for instance, the notes falling at a harmonic of a NHRF0.

Therefore, the connection is constrained by the intermedi- ate F0 estimates. Due to the frequency resolution limit at low frequencies, a simple correction is optionally applied to further segment the trajectories with respect to the MIDI note numbers.

5. REFERENCES

[1] Yeh, C., “Multiple fundamental frequency estimation of polyphonic recordings” Ph.D. thesis, Universit´e Paris VI, 2008.

[2] Raiford, T. E., “Skewness of combined distributions”, Journal of the American Statistical Association, vol.

37, no. 219, pp. 391393, September 1942.

[3] Chang, W.C., Su, W.Y., Yeh, C., Roebel, A., Rodet, X. “Multiple-F0 tracking based on a high-order HMM model”. Proc. of the 11th Intl. Conf. on Digital Audio Effects (DAFx-08), Espoo, Finland, 2008.

Références

Documents relatifs

Total revenues Service revenues thereof mobile service revenues thereof fixed-line service revenues Equipment revenues Other operating income EBITDA % of total revenues EBIT..

International operations Key performance indicators Financials in EUR million Total revenues Service revenues thereof mobile service revenues thereof fixed-line service

Group total costs and expenses rose by 2.5% year-on-year to EUR 742.9 mn in the second quarter of 2018 reported: +2.7%, mainly driven by higher cost of equipment, increased

In the Belarusian segment, EBITDA rose by 24.1% reported: +28.0% on a local currency basis in the first half of 2017 compared to the same period last year, as the increase in

Segment Bulgaria Key performance indicators Proforma view = Reported view Financials in EUR million Total revenues thereof total service revenues* Wireless revenues thereof

In the second quarter of 2016, Telekom Austria Group results benefitted mostly from strong operations in Austria, with additional support from the demand for data-centric services

As a result of higher revenues and lower operating expenses, EBITDA comparable rose by 6.1% year-on-year in the second quarter of 2015 excluding the above mentioned

Group revenue decline improved to -2.0% year-on-year as growth in Croatia and slowdown of decline in Austria mitigate losses in Bulgaria, Belarus and the Additional Markets