Summary

Data file information

High level summary of the data file and available variables.

value value
Size (KB) 6.3
Observations 47 Numeric Variables 6
Variables 6 Non-Numeric Variables 0

Note that numeric data may be any one of double, integer, complex, logical or numeric

Variable Details

Information on each available variable.

type mean sd missing data missing data (%)
Fertility double 70.143 12.492 0 0 %
Agriculture double 50.66 22.711 0 0 %
Examination integer 16.489 7.978 0 0 %
Education integer 10.979 9.615 0 0 %
Catholic double 41.144 41.705 0 0 %
Infant.Mortality double 19.943 2.913 0 0 %

Distributions of variables

Numeric distributions

Understanding the distribution of numeric data is useful for informing data cleaning and modelling. Numeric data is assumed to be continuous for the creation of these distributions.

Categorical distributions

Categorical data is explored through the frequencies of occurrence of each category.

## [1] "No categorical columns"

Correlations between variables

Correlation Matrix

Linear correlation between variables, yields values between 1, -1. 1 and -1 correspond to perfect positive and negative relationships respectively, while values close to zero suggest no relationship between the variable pair.

Pairwise Matrix

Scatter plots for each pair of variables.

Regressions

Scatter plots and regressions for the three strongest pairwise correlations.