MusicExtractor

standard mode | Extractors category

Inputs

  • filename (string) - the input audiofile

Outputs

  • results (pool) - Analysis results pool with across-frames statistics

  • resultsFrames (pool) - Analysis results pool with computed frame values

Parameters

  • analysisSampleRate (real ∈ (0, ∞), default = 44100) :

    the analysis sampling rate of the audio signal [Hz]

  • chromaprintCompute (bool ∈ {true, false}, default = false) :

    compute the Chromaprint

  • chromaprintDuration (real ∈ [0, ∞), default = 0) :

    the amount of time from the beginning used to compute the Chromaprint. 0 to use the full audio length [s]

  • endTime (real ∈ [0, ∞), default = 1e+06) :

    the end time of the slice you want to extract [s]

  • gfccStats (vector_string, default = [“mean”, “cov”, “icov”]) :

    the statistics to compute for GFCC features

  • highlevel (vector_string) :

    list of high-level classifier models (gaia2 history filenames) to apply using extracted features. Skip classification if not specified (empty list)

  • loudnessFrameSize (integer ∈ (0, ∞), default = 88200) :

    the frame size for computing average loudness

  • loudnessHopSize (integer ∈ (0, ∞), default = 44100) :

    the hop size for computing average loudness

  • lowlevelFrameSize (integer ∈ (0, ∞), default = 2048) :

    the frame size for computing low-level features

  • lowlevelHopSize (integer ∈ (0, ∞), default = 1024) :

    the hop size for computing low-level features

  • lowlevelSilentFrames (string ∈ {drop, keep, noise}, default = noise) :

    whether to [keep/drop/add noise to] silent frames for computing low-level features

  • lowlevelStats (vector_string, default = [“mean”, “var”, “stdev”, “median”, “min”, “max”, “dmean”, “dmean2”, “dvar”, “dvar2”]) :

    the statistics to compute for low-level features

  • lowlevelWindowType (string ∈ {hamming, hann, triangular, square, blackmanharris62, blackmanharris70, blackmanharris74, blackmanharris92}, default = blackmanharris62) :

    the window type for computing low-level features

  • lowlevelZeroPadding (integer ∈ [0, ∞), default = 0) :

    zero padding factor for computing low-level features

  • mfccStats (vector_string, default = [“mean”, “cov”, “icov”]) :

    the statistics to compute for MFCC features

  • profile (string) :

    profile filename. If specified, default parameter values are overwritten by values in the profile yaml file. If not specified (empty string), use values configured by user like in other normal algorithms

  • requireMbid (bool ∈ {true, false}, default = false) :

    ignore audio files without musicbrainz recording id tag (throw exception)

  • rhythmMaxTempo (integer ∈ [60, 250], default = 208) :

    the fastest tempo to detect [bpm]

  • rhythmMethod (string ∈ {multifeature, degara}, default = degara) :

    the method used for beat tracking

  • rhythmMinTempo (integer ∈ [40, 180], default = 40) :

    the slowest tempo to detect [bpm]

  • rhythmStats (vector_string, default = [“mean”, “var”, “stdev”, “median”, “min”, “max”, “dmean”, “dmean2”, “dvar”, “dvar2”]) :

    the statistics to compute for rhythm features

  • startTime (real ∈ [0, ∞), default = 0) :

    the start time of the slice you want to extract [s]

  • tonalFrameSize (integer ∈ (0, ∞), default = 4096) :

    the frame size for computing tonal features

  • tonalHopSize (integer ∈ (0, ∞), default = 2048) :

    the hop size for computing tonal features

  • tonalSilentFrames (string ∈ {drop, keep, noise}, default = noise) :

    whether to [keep/drop/add noise to] silent frames for computing tonal features

  • tonalStats (vector_string, default = [“mean”, “var”, “stdev”, “median”, “min”, “max”, “dmean”, “dmean2”, “dvar”, “dvar2”]) :

    the statistics to compute for tonal features

  • tonalWindowType (string ∈ {hamming, hann, triangular, square, blackmanharris62, blackmanharris70, blackmanharris74, blackmanharris92}, default = blackmanharris62) :

    the window type for computing tonal features

  • tonalZeroPadding (integer ∈ [0, ∞), default = 0) :

    zero padding factor for computing tonal features

Description

This algorithm is a wrapper for Music Extractor. See documentation for ‘essentia_streaming_extractor_music’.

Source code

See also

Extractor (standard)