statistics Archives - Page 4 of 9

A Brief (Personal) Manifesto for SEO

The need I feel—the fruit of many years working in this field—is to affirm the decisive importance of basic scientific rigour in analysing traffic data, so that we can calibrate our SEO interventions with accuracy, and not merely “by gut feeling” (even though feelings do matter!).

The tools available to the SEO professional are countless, and yet it is undeniable that a sense of disappointment lingers within us. Too often we deal with data of apparent strategic importance that turn out, when put to the test, to be fallacious or imprecise—mere red herrings.

Continue reading “A Brief (Personal) Manifesto for SEO”

Measures of Dispersion and Variability: Variance, Standard Deviation, CV

Measures of variability are used to describe the degree of dispersion of observations around a central tendency index.

In other words, measures of variability allow us to assess how data are spread around a central value, which may be represented, for example, by the mean or the median. They provide valuable information about the distribution of data, enabling a better understanding of the phenomenon under observation.

The techniques for measuring the variability of datasets are numerous. Among them, the most widely known (and most commonly used) are:

the range
the mean deviation and the variance
the standard deviation
the coefficient of variation

We will also visualise the concepts of central tendency and dispersion by revisiting skewness and introducing the concept of kurtosis.

Continue reading “Measures of Dispersion and Variability: Variance, Standard Deviation, CV”

Probability Distributions: Discrete Distributions and the Binomial

A random variable (also called a stochastic variable) is a variable that can take on different values depending on some random phenomenon. In many statistics textbooks it is simply abbreviated as r.v. It is a numerical value.

When probability values are assigned to all the possible numerical values of a random variable x, the result is a probability distribution.

In even simpler terms: a random variable is a variable whose values are each associated with a probability of being observed. The set of all possible values of a random variable and their associated probabilities is called a probability distribution. The sum of all probabilities is 1.

Continue reading “Probability Distributions: Discrete Distributions and the Binomial”

Hypothesis Testing: A Step-by-Step Guide

In everyday life, we often have to make decisions based on incomplete information.

We may need to decide, for instance, whether a certain educational procedure is more effective than another, whether a new drug has genuinely positive effects on the course of a disease, and so on.

Hypothesis testing is a statistical procedure that allows us to pose a question on the basis of sample information, in order to reach a statistically significant decision.

In clearer and more direct terms: is my experimental finding due to chance? Hypothesis testing is precisely a statistical procedure for verifying whether chance is a plausible explanation of an experimental result.

Continue reading “Hypothesis Testing: A Step-by-Step Guide”

The Student’s t-Distribution and Hypothesis Testing

In a previous article we presented the concept of hypothesis testing—a statistical method widely used to determine the validity of a claim based on a sample of data.

In the examples we proposed, however, we knew the value of the population standard deviation, sigma. In practice, this is a rather rare case, which allowed us to use the normal distribution and compute the Z-score.

If instead we do not know the population sigma, or if we are working with small samples, we must turn to a different type of distribution, called the t distribution or Student’s distribution.

Put more simply and clearly:

Student’s t distribution is a probability distribution used to assess the statistical significance of results when dealing with small sample sizes and uncertainty about the variance.

Continue reading “The Student’s t-Distribution and Hypothesis Testing”