The Gaussianize analyzer fits each descriptor into a gaussian distribution. More...

#include <gaussianize.h>

Inheritance diagram for gaia2::Gaussianize:

Public Member Functions
	Gaussianize (const ParameterMap &params)

Transformation	analyze (const DataSet *data, const Region &region) const

Public Member Functions inherited from gaia2::Analyzer
	Analyzer (const ParameterMap &params)

virtual Transformation	analyze (const DataSet *dataset) const

void	checkDataSet (const DataSet *dataset) const
	Checks that the given dataset is valid. More...

void	checkMinPoints (const DataSet *dataset, int n) const
	Checks that the given dataset as at least the specified number of points. More...

const Region &	checkFixedLength (const Region &region, const PointLayout &layout) const
	Checks that the given Region only contains fixed-length descriptors and throws an exception if not. More...

Protected Attributes
int	_maxDistSize

Protected Attributes inherited from gaia2::Analyzer
ParameterMap	_params

QStringList	_descriptorNames

QStringList	_exclude

Additional Inherited Members
Public Attributes inherited from gaia2::Analyzer
QString	name
	Name for the algorithm, usually the key that was used to instantiate it from the factory.

QStringList	validParams
	List of valid parameters this analyzer accepts. More...

Detailed Description

The Gaussianize analyzer fits each descriptor into a gaussian distribution.

It does it by sorting all the values, and giving them new values as if they were forming a perfect gaussian distribution. In doing so it completely discards the value it had before (only keeps the rank), and as such may be not as correct or precise as a BoxCox transformation, but it is much faster to compute.

NB: This algorithm does not yet work with multi-segments points.

Parameters

distributionSize when analyzing the distribution of a lot of values, it is not necessary to keep all points as reference, but only a subset of them. This value represent the maximum number of reference values used to model the distribution. In most cases, 10'000 points should give a precise enough distribution while still being efficient to compute. (default: 10000).

The documentation for this class was generated from the following files:

src/algorithms/gaussianize.h
src/algorithms/gaussianize.cpp

Public Member Functions

Protected Attributes

Additional Inherited Members

Detailed Description