Reminder of some Statistical Values

This page reviews the equations of some classical statistical values.

If N designates the total number of points, Zi stands for the value of the variable at a given point,

Arithmetic mean
Weighted arithmetic mean
Geometric mean

or

Weighted geometric mean

or

Harmonic mean
Weighted harmonic mean
Variance
Weighted variance
Skewness
Weighted skewness
Kurtosis
Weighted kurtosis
Correlation (Pearson correlation coefficient)
Correlation (Spearman correlation coefficient)

For a dataset of n samples, the n values Xi, Yi are converted to ranks R(Xi), R(Yi), and the rank correlation is computed as:

On the opposite to the commonly used Pearson correlation, which measures the linear correlation of a bivariate dataset, the Spearman correlation coefficient measures how well the rank of the samples regarding two variables are correlated. It assesses how well the relationship between two variables can be described using a monotonic function (always increase or decrease). For this reason, it is also known as the “rank correlation coefficient”. It is generally used to compare multivariate datasets with complex relationships, such as the input and output dataset of a PPMT workflow. It is less sensitive to outliers than classical correlation since only the rank of the sample is used, and not the measure value itself.

Weighted correlation

In the following equations, n designates the number of pairs of data separated by the considered distance, Zi and Zi+h stands for the value of the variable at two data points i and i+h constituting a pair.

Variogram
Weighted variogram