Probably Physics

Alexei Gilchrist

3 Probability

Exploration of the role of probability in various areas of physics

❦

Probability permeates much of physics. It appears in quantifying errors in every measurement, in the dynamics of stochastic processes, in statistical mechanics as a way of coping with the vast amount of variables, and even intrinsically in quantum mechanics. At first glance the use of probability may seem natural and even `obvious’, but things get much more lively when you realise that it is still not settled what a probability is. Different interpretations of probability affect the meaning of all the areas that it touches.

This material supports a second and third year advanced physics unit at Macquarie University.

In these notes I will take the view that probabilities are a measure of plausibility and probability theory is the extension of deductive logic to incomplete information. This view follows Laplace, Jeffreys, Cox, and Jaynes.

Status: In development. These notes are very much a work in progress and an exploration of the topic. They may change radically in the future. Lecture notes are available as mindmaps here: /map/probably-physics/.

Marginalization2Probability Inference
Variables are introduced and then some consequences of the sum-rule are explored.
Bayes’ rule3Probability Inference
Some consequences of the product rule are explored including the famous Bayes’ rule.
Assigning numbers2Probability
How do you assign actual values to probabilities?
Probability and Frequency3Probability
Given that we are interpreting probability as a measure of plausibility, just what is the relationship of probabilities to frequencies?
Comparing Models3Probability Inference
Model comparison is one of the principal tasks on inference. Given some data, how does the plausibility of different models change? Does the data select out a particular model as being better?
Probability Notation2Probability
making statements about whole families of logical propositions
Parameter Estimation2Probability Inference
Another of the key tasks of inference is to determine the value of a parameter in a model on the basis of observed data
Priors2Probability Inference
A closer look at prior distributions
Maximum Entropy2Probability Maximum Entropy
An argument to maximising the entropy
Understanding Uncertainty2Probability Maximum Entropy
What is meant by “uncertainty”?
Maximum entropy with average constraints3Maximum Entropy
Find the probability distribution that maximises the entropy subject to requiring some averages to be fixed.
Statistical Mechanics2Maximum Entropy
Maximum entropy leads to some of the famous results of statistical mechanics.
Variable Independence2Bayesian Networks Probability
When we have several variables, the joint probability distribution over all the variables has all the information we need to calculate any simpler probability distribution. The joint probability distribution can be expressed efficiently if there are independence relationships between variables.
Bayesian Networks2Bayesian Networks
A factorisation of the joint probability distribution can be represented in a graph known as a Bayesian Network. These networks codify independencies between variables and information flow as variables are measured.
Information2Information Theory Probability
A definition of information is introduced which leads to yet another connection with entropy.
Data Compression3Information Theory
We examine the problem of optimally encoding a set of symbols in some alphabet to reduce the average length of the code.
Convexity2Convexity
Quick introduction to convex sets, convex functions and Jensen’s inequality
Joint Entropy2Information Theory
The joint entropy and its properties
Conditional Entropy2Information Theory
Conditional entropy.
Relative Entropy2Information Theory
The relative entropy, or Kullback-Leibler divergence is a measure of the difference of two distributions
Mutual Information2Information Theory
Mutual information.
Communication through noisy channels3Information Theory
Communication through memory-less static channels will be examined, and in particular using repetition codes to counteract the errors introduced by the channel.