Algorithms reference¶

Analyzes predominant periodicities in a signal given its novelty curve [1] (see NoveltyCurve algorithm) or another onset detection function (see OnsetDetection and OnsetDetectionGlobal)

BpmHistogramDescriptors¶

(standard, streaming)

Computes beats per minute histogram and its statistics for the highest and second highest peak

BpmRubato¶

(standard, streaming)

Extracts the locations of large tempo changes from a list of beat ticks

Danceability¶

(standard, streaming)

Estimates danceability of a given audio signal

HarmonicBpm¶

(standard, streaming)

Extracts bpms that are harmonically related to the tempo given by the ‘bpm’ parameter

LoopBpmConfidence¶

(standard, streaming)

Takes an audio signal and a BPM estimate for that signal and predicts the reliability of the BPM estimate in a value from 0 to 1

LoopBpmEstimator¶

(standard, streaming)

Estimates the BPM of audio loops

Meter¶

(standard, streaming)

Estimates the time signature of a given beatogram by finding the highest correlation between beats

NoveltyCurve¶

(standard, streaming)

Computes the “novelty curve” (Grosche & Müller, 2009) onset detection function

NoveltyCurveFixedBpmEstimator¶

(standard)

Outputs a histogram of the most probable bpms assuming the signal has constant tempo given the novelty curve

OnsetDetection¶

(standard, streaming)

Computes various onset detection functions

OnsetDetectionGlobal¶

(standard, streaming)

Computes various onset detection functions

OnsetRate¶

(standard, streaming)

Computes the number of onsets per second and their position in time for an audio signal

Onsets¶

(standard, streaming)

Computes onset positions given various onset detection functions

PercivalBpmEstimator¶

(standard, streaming)

Estimates the tempo in beats per minute (BPM) from an input signal as described in [1]

PercivalEnhanceHarmonics¶

(standard, streaming)

Implements the ‘Enhance Harmonics’ step as described in [1]

PercivalEvaluatePulseTrains¶

(standard, streaming)

Implements the ‘Evaluate Pulse Trains’ step as described in [1]

RhythmDescriptors¶

(standard, streaming)

Computes rhythm features (bpm, beat positions, beat histogram peaks) for an audio signal

RhythmExtractor¶

(standard, streaming)

Estimates the tempo in bpm and beat positions given an audio signal

RhythmExtractor2013¶

(standard, streaming)

Extracts the beat positions and estimates their confidence as well as tempo in bpm for an audio signal

RhythmTransform¶

(standard, streaming)

Implements the rhythm transform

SingleBeatLoudness¶

(standard, streaming)

Computes the spectrum energy of a single beat across the whole frequency range and on each specified frequency band given an audio segment

SuperFluxExtractor¶

(standard, streaming)

Detects onsets given an audio signal using SuperFlux algorithm

SuperFluxNovelty¶

(standard, streaming)

Onset detection function for Superflux algorithm

SuperFluxPeaks¶

(standard, streaming)

Detects peaks of an onset detection function computed by the SuperFluxNovelty algorithm

TempoCNN¶

(standard, streaming)

Estimates tempo using TempoCNN-based models

TempoScaleBands¶

(standard, streaming)

Computes features for tempo tracking to be used with the TempoTap algorithm

TempoTap¶

(standard, streaming)

Estimates the periods and phases of a periodic signal, represented by a sequence of values of any number of detection functions, such as energy bands, onsets locations, etc

TempoTapDegara¶

(standard, streaming)

Estimates beat positions given an onset detection function

TempoTapMaxAgreement¶

(standard, streaming)

Outputs beat positions and confidence of their estimation based on the maximum mutual agreement between beat candidates estimated by different beat trackers (or using different features)

TempoTapTicks¶

(standard, streaming)

Builds the list of ticks from the period and phase candidates given by the TempoTap algorithm

Math¶

CartesianToPolar¶

(standard, streaming)

Converts an array of complex numbers from cartesian to polar form

Magnitude¶

(standard, streaming)

Computes the absolute value of each element in a vector of complex numbers

PolarToCartesian¶

(standard, streaming)

Converts an array of complex numbers from polar to cartesian form

Statistics¶

CentralMoments¶

(standard, streaming)

Extracts the 0th, 1st, 2nd, 3rd and 4th central moments of an array

Centroid¶

(standard, streaming)

Computes the centroid of an array

Crest¶

(standard, streaming)

Computes the crest of an array

Decrease¶

(standard, streaming)

Computes the decrease of an array defined as the linear regression coefficient

DistributionShape¶

(standard, streaming)

Computes the spread (variance), skewness and kurtosis of an array given its central moments

Energy¶

(standard, streaming)

Computes the energy of an array

Entropy¶

(standard, streaming)

Computes the Shannon entropy of an array

Flatness¶

(standard, streaming)

Computes the flatness of an array, which is defined as the ratio between the geometric mean and the arithmetic mean

GeometricMean¶

(standard, streaming)

Computes the geometric mean of an array of positive values

Histogram¶

(standard, streaming)

Computes a histogram

InstantPower¶

(standard, streaming)

Computes the instant power of an array

Mean¶

(standard, streaming)

Computes the mean of an array

Median¶

(standard, streaming)

Computes the median of an array

PoolAggregator¶

(standard, streaming)

Performs statistical aggregation on a Pool and places the results of the aggregation into a new Pool

PowerMean¶

(standard, streaming)

Computes the power mean of an array

RMS¶

(standard, streaming)

Computes the root mean square (quadratic mean) of an array

RawMoments¶

(standard, streaming)

Computes the first 5 raw moments of an array

SingleGaussian¶

(standard, streaming)

Estimates the single gaussian distribution for a matrix of feature vectors

Variance¶

(standard, streaming)

Computes the variance of an array

Viterbi¶

(standard, streaming)

Estimates the most-likely path by Viterbi algorithm

Tonal¶

ChordsDescriptors¶

(standard, streaming)

Given a chord progression this algorithm describes it by means of key, scale, histogram, and rate of change

ChordsDetection¶

(standard, streaming)

Estimates chords given an input sequence of harmonic pitch class profiles (HPCPs)

ChordsDetectionBeats¶

(standard)

Estimates chords using pitch profile classes on segments between beats

Chromagram¶

(standard, streaming)

Computes the Constant-Q chromagram using FFT

Dissonance¶

(standard, streaming)

Computes the sensory dissonance of an audio signal given its spectral peaks

HPCP¶

(standard, streaming)

Computes a Harmonic Pitch Class Profile (HPCP) from the spectral peaks of a signal

HarmonicPeaks¶

(standard, streaming)

Finds the harmonic peaks of a signal given its spectral peaks and its fundamental frequency

HighResolutionFeatures¶

(standard, streaming)

Computes high-resolution chroma features from an HPCP vector

Inharmonicity¶

(standard, streaming)

Calculates the inharmonicity of a signal given its spectral peaks

Key¶

(standard, streaming)

Computes key estimate given a pitch class profile (HPCP)

KeyExtractor¶

(standard, streaming)

Extracts key/scale for an audio signal

NNLSChroma¶

(standard, streaming)

Extracts treble and bass chromagrams from a sequence of log-frequency spectrum frames

OddToEvenHarmonicEnergyRatio¶

(standard, streaming)

Computes the ratio between a signal’s odd and even harmonic energy given the signal’s harmonic peaks

PitchSalience¶

(standard, streaming)

Computes the pitch salience of a spectrum

SpectrumCQ¶

(standard, streaming)

Computes the magnitude of the Constant-Q spectrum

TonalExtractor¶

(standard, streaming)

Computes tonal features for an audio signal

TonicIndianArtMusic¶

(standard)

Estimates the tonic frequency of the lead artist in Indian art music

Tristimulus¶

(standard, streaming)

Calculates the tristimulus of a signal given its harmonic peaks

TuningFrequency¶

(standard, streaming)

Estimates the tuning frequency give a sequence/set of spectral peaks

TuningFrequencyExtractor¶

(standard, streaming)

Extracts the tuning frequency of an audio signal

Music Similarity¶

ChromaCrossSimilarity¶

(standard, streaming)

Computes a binary cross similarity matrix from two chromagam feature vectors of a query and reference song

CoverSongSimilarity¶

(standard, streaming)

Computes a cover song similiarity measure from a binary cross similarity matrix input between two chroma vectors of a query and reference song using various alignment constraints of smith-waterman local-alignment algorithm

CrossSimilarityMatrix¶

(standard)

Computes a euclidean cross-similarity matrix of two sequences of frame features

Fingerprinting¶

Chromaprinter¶

(standard, streaming)

Computes the fingerprint of the input signal using Chromaprint algorithm

Audio Problems¶

ClickDetector¶

(standard, streaming)

Detects the locations of impulsive noises (clicks and pops) on the input audio frame

DiscontinuityDetector¶

(standard, streaming)

Uses LPC and some heuristics to detect discontinuities in an audio signal

FalseStereoDetector¶

(standard, streaming)

Detects if a stereo track has duplicated channels (false stereo)

GapsDetector¶

(standard, streaming)

Uses energy and time thresholds to detect gaps in the waveform

HumDetector¶

(standard, streaming)

Detects low frequency tonal noises in the audio signal

NoiseBurstDetector¶

(standard, streaming)

Detects noise bursts in the waveform by thresholding the peaks of the second derivative

SNR¶

(standard, streaming)

Computes the SNR of the input audio in a frame-wise manner

SaturationDetector¶

(standard, streaming)

This algorithm outputs the staring/ending locations of the saturated regions in seconds

StartStopCut¶

(standard, streaming)

Outputs if there is a cut at the beginning or at the end of the audio by locating the first and last non-silent frames and comparing their positions to the actual beginning and end of the audio

TruePeakDetector¶

(standard, streaming)

Implements a “true-peak” level meter for clipping detection

Duration/silence¶

Duration¶

(standard, streaming)

Outputs the total duration of an audio signal

EffectiveDuration¶

(standard, streaming)

Computes the effective duration of an envelope signal

FadeDetection¶

(standard, streaming)

Detects fade-in and fade-outs time positions in an audio signal given a sequence of RMS values

SilenceRate¶

(standard, streaming)

Estimates if a frame is silent

StartStopSilence¶

(standard, streaming)

Outputs the frame at which sound begins and the frame at which sound ends

Loudness/dynamics¶

DynamicComplexity¶

(standard, streaming)

Computes the dynamic complexity defined as the average absolute deviation from the global loudness level estimate on the dB scale

Intensity¶

(standard)

Classifies the input audio signal as either relaxed (-1), moderate (0), or aggressive (1)

Larm¶

(standard, streaming)

Estimates the long-term loudness of an audio signal

Leq¶

(standard, streaming)

Computes the Equivalent sound level (Leq) of an audio signal

LevelExtractor¶

(standard, streaming)

Extracts the loudness of an audio signal in frames using Loudness algorithm

Loudness¶

(standard, streaming)

Computes the loudness of an audio signal defined by Steven’s power law

LoudnessEBUR128¶

(standard, streaming)

Computes the EBU R128 loudness descriptors of an audio signal

LoudnessEBUR128Filter¶

(streaming)

An auxilary signal preprocessing algorithm used within the LoudnessEBUR128 algorithm

LoudnessVickers¶

(standard, streaming)

Computes Vickers’s loudness of an audio signal

ReplayGain¶

(standard, streaming)

Computes the Replay Gain loudness value of an audio signal

Extractors¶

BarkExtractor¶

(streaming)

Extracts some Bark bands based spectral features from an audio signal

Extractor¶

(standard)

Extracts all low-level, mid-level and high-level features from an audio signal and stores them in a pool

FreesoundExtractor¶

(standard)

Is a wrapper for Freesound Extractor

LowLevelSpectralEqloudExtractor¶

(standard, streaming)

Extracts a set of level spectral features for which it is recommended to apply a preliminary equal-loudness filter over an input audio signal (according to the internal evaluations conducted at Music Technology Group)

LowLevelSpectralExtractor¶

(standard, streaming)

Extracts all low-level spectral features, which do not require an equal-loudness filter for their computation, from an audio signal

MusicExtractor¶

(standard)

Is a wrapper for Music Extractor

MusicExtractorSVM¶

(standard)

This algorithms computes SVM predictions given a pool with aggregated descriptor values computed by MusicExtractor (or FreesoundExtractor)

Transformations¶

GaiaTransform¶

(standard)

Applies a given Gaia2 transformation history to a given pool

PCA¶

(standard)

Applies Principal Component Analysis based on the covariance matrix of the signal

Synthesis¶

HarmonicMask¶

(standard, streaming)

Applies a spectral mask to remove a pitched source component from the signal

HarmonicModelAnal¶

(standard, streaming)

Computes the harmonic model analysis

HprModelAnal¶

(standard, streaming)

Computes the harmonic plus residual model analysis

HpsModelAnal¶

(standard, streaming)

Computes the harmonic plus stochastic model analysis

ResampleFFT¶

(standard, streaming)

Resamples a sequence using FFT/IFFT

SineModelAnal¶

(standard, streaming)

Computes the sine model analysis

SineModelSynth¶

(standard, streaming)

Computes the sine model synthesis from sine model analysis

SineSubtraction¶

(standard, streaming)

Subtracts the sinusoids computed with the sine model analysis from an input audio signal

SprModelAnal¶

(standard, streaming)

Computes the sinusoidal plus residual model analysis

SprModelSynth¶

(standard, streaming)

Computes the sinusoidal plus residual model synthesis from SPS model analysis

SpsModelAnal¶

(standard, streaming)

Computes the stochastic model analysis

SpsModelSynth¶

(standard, streaming)

Computes the sinusoidal plus stochastic model synthesis from SPS model analysis

StochasticModelAnal¶

(standard, streaming)

Computes the stochastic model analysis

StochasticModelSynth¶

(standard, streaming)

Computes the stochastic model synthesis

Pitch¶

MultiPitchKlapuri¶

(standard)

Estimates multiple pitch values corresponding to the melodic lines present in a polyphonic music signal (for example, string quartet, piano)

MultiPitchMelodia¶

(standard, streaming)

Estimates multiple fundamental frequency contours from an audio signal

PitchCREPE¶

(standard, streaming)

Estimates pitch of monophonic audio signals using CREPE models

PitchContourSegmentation¶

(standard)

Converts a pitch sequence estimated from an audio signal into a set of discrete note events

PitchContours¶

(standard, streaming)

Tracks a set of predominant pitch contours of an audio signal

PitchContoursMelody¶

(standard, streaming)

Converts a set of pitch contours into a sequence of predominant f0 values in Hz by taking the value of the most predominant contour in each frame

PitchContoursMonoMelody¶

(standard, streaming)

Converts a set of pitch contours into a sequence of f0 values in Hz by taking the value of the most salient contour in each frame

PitchContoursMultiMelody¶

(standard, streaming)

Post-processes a set of pitch contours into a sequence of mutliple f0 values in Hz

PitchFilter¶

(standard, streaming)

Corrects the fundamental frequency estimations for a sequence of frames given pitch values together with their confidence values

PitchMelodia¶

(standard, streaming)

Estimates the fundamental frequency corresponding to the melody of a monophonic music signal based on the MELODIA algorithm

PitchSalienceFunction¶

(standard, streaming)

Computes the pitch salience function of a signal frame given its spectral peaks

PitchSalienceFunctionPeaks¶

(standard, streaming)

Computes the peaks of a given pitch salience function

PitchYin¶

(standard, streaming)

Estimates the fundamental frequency given the frame of a monophonic music signal

PitchYinFFT¶

(standard, streaming)

Estimates the fundamental frequency given the spectrum of a monophonic music signal

PitchYinProbabilistic¶

(standard, streaming)

Computes the pitch track of a mono audio signal using probabilistic Yin algorithm

PitchYinProbabilities¶

(standard, streaming)

Estimates the fundamental frequencies, their probabilities given the frame of a monophonic music signal

PitchYinProbabilitiesHMM¶

(standard, streaming)

Estimates the smoothed fundamental frequency given the pitch candidates and probabilities using hidden Markov models

PredominantPitchMelodia¶

(standard, streaming)

Estimates the fundamental frequency of the predominant melody from polyphonic music signals using the MELODIA algorithm

Vibrato¶

(standard, streaming)

Detects the presence of vibrato and estimates its parameters given a pitch contour [Hz]

Segmentation¶

SBic¶

(standard, streaming)

Segments audio using the Bayesian Information Criterion given a matrix of frame features

Machine Learning¶

TensorflowPredict¶

(standard, streaming)

Runs a Tensorflow graph and stores the desired output tensors in a pool

TensorflowPredict2D¶

(standard, streaming)

Makes predictions using models expecting 2D representations

TensorflowPredictCREPE¶

(standard, streaming)

Generates activations of monophonic audio signals using CREPE models

TensorflowPredictEffnetDiscogs¶

(standard, streaming)

Makes predictions using EffnetDiscogs-based models

TensorflowPredictFSDSINet¶

(standard, streaming)

Makes predictions using FSD-SINet models

TensorflowPredictMAEST¶

(standard, streaming)

Makes predictions using MAEST-based models

TensorflowPredictMusiCNN¶

(standard, streaming)

Makes predictions using MusiCNN-based models

TensorflowPredictTempoCNN¶

(standard, streaming)

Makes predictions using TempoCNN-based models

TensorflowPredictVGGish¶

(standard, streaming)

Makes predictions using VGGish-based models