NcmStatsDist: NumCosmo Reference Manual

NcmStatsDist

NcmStatsDist — Abstract class for implementing N-dimensional probability distributions.

Functions

NcmStatsDist *	ncm_stats_dist_ref ()
void	ncm_stats_dist_free ()
void	ncm_stats_dist_clear ()
void	ncm_stats_dist_set_kernel ()
NcmStatsDistKernel *	ncm_stats_dist_peek_kernel ()
NcmStatsDistKernel *	ncm_stats_dist_get_kernel ()
guint	ncm_stats_dist_get_dim ()
guint	ncm_stats_dist_get_sample_size ()
guint	ncm_stats_dist_get_n_kernels ()
gdouble	ncm_stats_dist_get_href ()
void	ncm_stats_dist_set_over_smooth ()
gdouble	ncm_stats_dist_get_over_smooth ()
void	ncm_stats_dist_set_split_frac ()
gdouble	ncm_stats_dist_get_split_frac ()
void	ncm_stats_dist_set_print_fit ()
gboolean	ncm_stats_dist_get_print_fit ()
void	ncm_stats_dist_set_cv_type ()
NcmStatsDistCV	ncm_stats_dist_get_cv_type ()
void	ncm_stats_dist_set_use_threads ()
gboolean	ncm_stats_dist_get_use_threads ()
void	ncm_stats_dist_prepare_kernel ()
void	ncm_stats_dist_prepare ()
void	ncm_stats_dist_prepare_interp ()
gdouble	ncm_stats_dist_eval ()
gdouble	ncm_stats_dist_eval_m2lnp ()
guint	ncm_stats_dist_kernel_choose ()
void	ncm_stats_dist_sample ()
gdouble	ncm_stats_dist_get_rnorm ()
void	ncm_stats_dist_add_obs ()
GPtrArray *	ncm_stats_dist_peek_sample_array ()
NcmMatrix *	ncm_stats_dist_peek_cov_decomp ()
gdouble	ncm_stats_dist_get_lnnorm ()
NcmVector *	ncm_stats_dist_peek_weights ()
void	ncm_stats_dist_get_Ki ()
void	ncm_stats_dist_reset ()

Properties

NcmStatsDistCV	CV-type	Read / Write / Construct
guint	N	Read
NcmStatsDistKernel *	kernel	Read / Write / Construct Only
double	over-smooth	Read / Write / Construct
gboolean	print-fit	Read / Write / Construct
double	split-frac	Read / Write / Construct
gboolean	use-threads	Read / Write / Construct

Types and Values

#define	NCM_TYPE_STATS_DIST
struct	NcmStatsDistClass
enum	NcmStatsDistCV
	NcmStatsDist

Object Hierarchy

    GEnum
    ╰── NcmStatsDistCV
    GObject
    ╰── NcmStatsDist
        ╰── NcmStatsDistKDE

Description

Abstract class to reconstruct an arbitrary N-dimensional probability distribution. This class provides the tools to perform a radial basis interpolation in a multidimensional function using a radial basis function and then generates a new sample using the interpolation function as the kernel. This method generates a sample that is distributed by the original distribution, but in a more simple way since the used kernels are easier to sample from. For more information about radial basis interpolation, check [Radial Basis Function Interpolation, Wilna du Toit]. A brief description of the radial basis interpolation method can be found below.

Given a d-simensional function $g(x): \mathbf{R}^d \rightarrow \mathbf{R}$, a radial basis function $\phi(x, \Sigma)$ is used such that \begin{align} \label{Interpolation_eq} s(x) = \sum_i^n \lambda_i \phi(|x-x_i|, \Sigma_i), \quad x~ \in~ \mathbf{R} . \end{align} The variables $\lambda_i$ represent the weights and are found such that \begin{align} \label{eqnnls1} s(x_i) = g(x_i) , \end{align} being $x_i$ the sample points. The values generated by $\phi(|x-x_i|, \Sigma_i)$ are displayed in a symmetric $n \times n$ matrix $\Phi$. This function depends on the norm of the points and on the covariance matrix $\Sigma$ associated with each point. The weights $\lambda_i$ are also organised in a matrix representation such that equation \eqref{eqnnls1} becomes \begin{align} \label{eqnnls} G = \lambda \times \Phi ,\end{align} where $G$ is a matrix containing all the function values $g(xi)$. Once the Lambda matrix is found, one may use $s(x)$ to sample values from $g(x)$, which is easier to do since $s(x)$ is a polynomial function.

We want $s(x)$ to be a probability distribution so we can sample from it. Therefore the Lambda matrix containing the weights is seen as the probability density and it must be minimized such that its values are always positive and sum up to one. To solve equation this problem, this algorithm has the tools to solve equation \eqref{eqnnls} for $\lambda$, which is a least-squares problem, using the NNLS method, which can be found in nnls.c file. Thus, the algorithm can randomly choose a kernel $\phi(|x-x_i|, \Sigma_i)$ associated to a probability contained in $\lambda$ and sample a point from it.

In this object, the radial basis interpolation function is not completely defined. One must choose one of the instances of the class, the NcmStatsDistKernelST object or the NcmStatsDistKernelGauss object, which uses a multivariate Student's t function and a Gaussian function as the kernel. After initializing the desired object for the interpolation function, one may use the methods of this file to generate the interpolation and to sample from the new interpolated function.

The user must provide the input the values: over_smooth - ncm_stats_dist_set_over_smooth(), split_frac - ncm_stats_dist_set_split_frac(), over_smooth - ncm_stats_dist_set_over_smooth(), $v(x)$ - ncm_stats_dist_prepare_interp(). The other parameters must be inserted when the instance for the NcmStatsDistKDE or the NcmStatsDistVKDE object is initialized. To perform a calculation of this class, one needs to initialize the class within one of its subclasses (NcmStatsDistKernelGauss or NcmStatsDistKernelST), along with the input of a child object of the class NcmStatsDistKernel. For more information about the algorithm, see the description below.

-Since this class does not define what type of kernel will be used in the calculation (the fixed kernel in the NcmStatsDistKDE class or the variable kernel in NcmStatsDistVKDE class), one cannot compute the sample just using this instance. Also, it must be provided the function to be used as the kernel, which is implemented in the children from the class NcmStatsDistKernel. When initializing the NcmStatsDistKDE or NcmStatsDistVKDE classes, the function to be used as the kernel is defined in the object initialization function.

-This class also needs a child object to compute the interpolation matrix $IM$ and the covariance matrices stored in cov_decomp to perform the interpolation, which is kernel dependent and therefore also computed by the class child objects.

-Regarding the kernel types based on the radial basis function, $\phi(|x-x_i|)$, and how the sample points in ncm_stats_dist_sample() are generated, see the different implementations of NcmStatsDistKernel, e.g., NcmStatsDistKernelGauss and NcmStatsDistKernelST

-Regarding how the functions ncm_stats_dist_eval() and ncm_stats_dist_eval_m2lnp() are implemented, see the different implementations of NcmStatsDist, i.e., NcmStatsDistKDE and NcmStatsDistVKDE. These objects also compute the covariance matrix of each sample point and other objects needed for the least-squares problem, when computing the weights matrix ($\lambda$).

Functions

ncm_stats_dist_ref ()

NcmStatsDist *
ncm_stats_dist_ref (NcmStatsDist *sd);

Increases the reference count of sd .

sd	a NcmStatsDist
sample_array	an array of NcmVector.	[element-type NcmVector]

sd	a NcmStatsDist
m2lnp	a NcmVector containing the distribution values that will be used to compute the interpolation function.

sd	a NcmStatsDist
i	kernel index
y_i	kernel location.	[out callee-allocates][transfer full]
cov_i	kernel covariance U.	[out callee-allocates][transfer full]
n_i	kernel normalization.	[out]
w_i	kernel weight.	[out]

NCM_STATS_DIST_CV_NONE	No cross validation
NCM_STATS_DIST_CV_SPLIT	Sample split cross validation
NCM_STATS_DIST_CV_SPLIT_NOFIT	Sample split cross validation without fitting
NCM_STATS_DIST_CV_LOO	Leave-one-out cross validation

sd	a NcmStatsDist
print_fit	a boolean

sd	a NcmStatsDist
use_threads	whether to use threads

sd	a NcmStatsDist
rng	a NcmRNG

sd	a NcmStatsDist
x	a NcmVector
rng	a NcmRNG

NcmStatsDist

Functions

Properties

Types and Values

Object Hierarchy

Description

Functions

ncm_stats_dist_ref ()

Parameters

Returns

ncm_stats_dist_free ()

Parameters

ncm_stats_dist_clear ()

Parameters

ncm_stats_dist_set_kernel ()

Parameters

ncm_stats_dist_peek_kernel ()

Parameters

Returns

ncm_stats_dist_get_kernel ()

Parameters

Returns

ncm_stats_dist_get_dim ()

Parameters

Returns

ncm_stats_dist_get_sample_size ()

Parameters

Returns

ncm_stats_dist_get_n_kernels ()

Parameters

Returns

ncm_stats_dist_get_href ()

Parameters

Returns

ncm_stats_dist_set_over_smooth ()

Parameters

ncm_stats_dist_get_over_smooth ()

Parameters

Returns

ncm_stats_dist_set_split_frac ()

Parameters

ncm_stats_dist_get_split_frac ()

Parameters

Returns

ncm_stats_dist_set_print_fit ()

Parameters

ncm_stats_dist_get_print_fit ()

Parameters

Returns

ncm_stats_dist_set_cv_type ()

Parameters

ncm_stats_dist_get_cv_type ()

Parameters

Returns

ncm_stats_dist_set_use_threads ()

Parameters

ncm_stats_dist_get_use_threads ()

Parameters

Returns

ncm_stats_dist_prepare_kernel ()

Parameters

ncm_stats_dist_prepare ()

Parameters

ncm_stats_dist_prepare_interp ()

Parameters

ncm_stats_dist_eval ()

Parameters

Returns

ncm_stats_dist_eval_m2lnp ()

Parameters

Returns

ncm_stats_dist_kernel_choose ()

Parameters

ncm_stats_dist_sample ()

Parameters

ncm_stats_dist_get_rnorm ()

Parameters

Returns

ncm_stats_dist_add_obs ()

Parameters

The `“CV-type”` property

The `“N”` property

The `“kernel”` property

The `“over-smooth”` property

The `“print-fit”` property

The `“split-frac”` property

The `“use-threads”` property