probly.dev

Examples

How it works

Probly is a Python-like mini-language for probabilistic estimation. It's based on Starlark and implemented in Go.

Probly syntax

You may use any Starlark syntax. There are only the following differences to Starlark:

A variable may follow a probability distribution, in addition to usual types like numbers or dictionaries
The Starlark math module is imported by default, so you can directly use it e.g. math.sqrt(2). Probly also has a built-in sum function not available in Starlark.

This page will show you the probability distributions of all the numeric (scalar or distribution) global variables in your program (except those starting with an underscore). The values are taken at the end of the program's execution.

Probability distributions

Name						Quantiles	Notes
`Normal`	`mean`	`sd`				2
`Normal` examples Parameters `Normal(1, 2) Normal(mean=1, sd=2)` Quantiles `Normal(p12=-1, p34=0) Normal(quantiles={0.123:-1, 0.456:0})` `to` operator (10th to 90th percentile) `Normal(-1 to 0)` `pm` operator `Normal(-1 pm 1)` `m pm x` makes `m` the median and `m-x` and `m+x` the 10th and 90th percentiles respectively. This works because the Normal distribution is symmetric around its median.
`LogNormal`	`mu`	`sigma`				2	Alternatively: `mean`, `sd`
`LogNormal` examples Parameters Parametrization 1 `LogNormal(1, 2) LogNormal(mu=1, sigma=2)` Parametrization 2 `LogNormal(mean=3, sd=4)` Quantiles `LogNormal(p12=1, p34=2) LogNormal(quantiles={0.123:1, 0.456:2})` `to` operator (10th to 90th percentile) `LogNormal(1 to 2)` `td` operator `LogNormal(1 td 2)` `m td x` makes `m` the median and `m/x` and `m*x` the 10th and 90th percentiles respectively. This works because the LogNormal distribution is symmetric in multiplicative space around its median.
`Beta`	`alpha`	`beta`
`Beta` examples `Beta(1, 2) Beta(alpha=1, beta=2)`
`PERT`	`min`	`mode`	`max`	`[lambd]`			Like the triangular, but smoother (Wikipedia)
`PERT` The optional parameter `lambd` (default: 4) controls the weight given to the mode. Values of `lambd < 4` have the effect of flattening the density curve. Examples Parametrization 1 `PERT(0, 1, 3) PERT(min=0, mode=1, max=3)` Parametrization 2 `PERT(min=0, mode=1, max=3, lambd=4)`
`Uniform`	`a`	`b`				2	`a` need not be less than `b`
`Uniform` examples Parameters `Uniform(1, 2) Uniform(a=1, b=2)` Quantiles `Uniform(p12=-1, p34=0) Uniform(quantiles={0.123:-1, 0.456:0})` `to` operator (10th to 90th percentile) `Uniform(-1 to 0)` `pm` operator `Uniform(-1 pm 1)` `m pm x` makes `m` the median and `m-x` and `m+x` the 10th and 90th percentiles respectively. This works because the Uniform distribution is symmetric around its median.
`LogUniform`	`a`	`b`				2	`a` need not be less than `b`
`LogUniform` examples Parameters `LogUniform(1, 2) LogUniform(a=1, b=2)` Quantiles `LogUniform(p12=1, p34=2) LogUniform(quantiles={0.123:1, 0.456:2})` `to` operator (10th to 90th percentile) `LogUniform(1 to 2)` `td` operator `LogUniform(1 td 2)` `m td x` makes `m` the median and `m/x` and `m*x` the 10th and 90th percentiles respectively. This works because the LogUniform distribution is symmetric in multiplicative space around its median.
`Bernoulli`	`p`
`Bernoulli` examples `Bernoulli(0.5) Bernoulli(p=0.5)`
`Binomial`	`n`	`p`
`Binomial` examples `Binomial(1, 0.5) Binomial(n=1, p=0.5)`
`Discrete`	`x_1`	`p_1`	`x_2`	`p_2`	...		Generic discrete distribution over any finite set of values
`Discrete` examples `Discrete(1, 0.4, 2, 0.6) Discrete(x_1=1, p_1=0.4, x_2=2, p_2=0.6)`

math

These mathematical functions and constants are available in the math module:

pow(x, y) - Returns x raised to the power of y
exp(x)
sqrt(x)
log(x, [base]) - Natural logarithm by default if base is not specified
e
pi
Ceil, floor, and sign manipulation:
- ceil(x)
- floor(x)
- fabs(x) - Returns the absolute value of x as float
- copysign(x, y) - Returns a value with the magnitude of x and the sign of y
mod(x, y) - Returns x modulo y
remainder(x, y)
round(x) - Returns the nearest integer, rounding half away from zero
Trigonometry (in radians unless otherwise specified):
- acos(x)
- asin(x)
- atan(x)
- atan2(y, x) - Returns atan(y / x). The result is between -pi and pi
- cos(x)
- sin(x)
- tan(x)
- degrees(x) - Converts angle x from radians to degrees
- radians(x) - Converts angle x from degrees to radians
- acosh(x)
- asinh(x)
- atanh(x)
- cosh(x)
- sinh(x)
- tanh(x)
hypot(x, y) - Returns the Euclidean norm, sqrt(x^2 + y^2); the distance from the origin to (x, y)
gamma(x) - Returns the Gamma function at x

Starlark syntax

This code provides an example of the syntax of Starlark:

# Define a number
number = 18

# Define a list
numbers = [1, 2, 3, 4, 5]

# List comprehension
halves = [n / 2 for n in numbers]

# Define a function
def is_even(n):
    """Return True if n is even."""
    return n % 2 == 0

# Define a dictionary
people = {
    "Alice": 22,
    "Bob": 40,
    "Charlie": 55,
    "Dave": 14,
}

names = ", ".join(people.keys())  # Alice, Bob, Charlie, Dave

# Modify a variable in a loop
sum_even_ages = 0
for age in people.values():
    if is_even(age):
        sum_even_ages += age

# Append to a list in a loop
over_30_names = []
for name, age in people.items():
    if age > 30:
        over_30_names.append(name)

If you've ever used Python, this should look very familiar. In fact, the code above is also valid Python code. Still, this short example shows most of the language. Starlark is a very small language that implements a limited subset of Python.

For our purposes, one notable difference to Python is that the exponentiation operator ** is not supported. You have to use math.pow.

You can also look at the Starlark language specification.

Speed

Though not designed for speed, Probly is fast enough for practical purposes: around 10 milliseconds for 3,000 samples, for most examples on this page. This is due to being implemented in Go.

The time taken to return results on this page is spent overwhelmingly in web application code, not in Probly evaluation.

Interestingly, Probly is still slower than Python code that uses entirely numpy array operations, which are very well optimised. This should only begin to matter at very large scales, or if latency is critical.

Limitations

It's not currently possible to obtain and manipulate properties of a distribution within an Probly program, like so:

x = Normal(1 to 10)
y = x.std()  # Not possible

Supporting this would require some fundamental changes to the implementation of Probly, which is currently very simplistic.

Prior work

The to binary operator was inspired by Squiggle.

Example Monty Hall problem

In the famous Monty Hall problem, you are given the choice of three doors. Behind one door is a car; behind the others, goats. You pick a door, say No. 1, and the host, who knows what's behind the doors, opens another door, say No. 3, which has a goat. He then says to you, "Do you want to pick door No. 2?" Is it to your advantage to switch your choice?

The answer is that you should always switch. This example demonstrates it by simulation.

In our model, a payoff of 0 represents a goat and 1 represents the car. The switch payoff distribution has a 2/3 probability of winning the car (vs 1/3 without switching). Its expectation is about 0.66.

Distribution details

switch_payoff


Mean	0.654
Std. dev.	0.476
Variance	0.226

Quantile
0.05	0
0.25	0
0.50	1.00
0.75	1.00
0.95	1.00

Log Scale

p (X <) =

our_door


Mean	2.01
Std. dev.	0.821
Variance	0.674

Quantile
0.05	1.00
0.25	1.00
0.50	2.00
0.75	3.00
0.95	3.00

Log Scale

p (X <) =

car_door


Mean	2.01
Std. dev.	0.815
Variance	0.664

Quantile
0.05	1.00
0.25	1.00
0.50	2.00
0.75	3.00
0.95	3.00

Log Scale

p (X <) =

Simulation data

CSV

Download CSV

Preview

	car_door	our_door	switch_payoff
0	3.00	3.00	0
1	3.00	2.00	1.00
2	3.00	2.00	1.00
...	...	...	...
2997	1.00	1.00	0
2998	1.00	2.00	1.00
2999	1.00	1.00	0

API

Get the simulation data (and more) in a machine-readable format: /api/sim/n5tzoecsQoXCEmsTtTKn5M/

Tip: you can share your results with anyone using the current url: https://probly.dev/sim/n5tzoecsQoXCEmsTtTKn5M/?is_example=monty. (No guarantees. If your results are important, please save them another way too).

Probly syntax

Probability distributions

Normal examples

Parameters

Quantiles

to operator (10th to 90th percentile)

pm operator

LogNormal examples

Parameters

Parametrization 1

Parametrization 2

Quantiles

to operator (10th to 90th percentile)

td operator

Beta examples

PERT

Examples

Parametrization 1

Parametrization 2

Uniform examples

Parameters

Quantiles

to operator (10th to 90th percentile)

pm operator

LogUniform examples

Parameters

Quantiles

to operator (10th to 90th percentile)

td operator

Bernoulli examples

Binomial examples

Discrete examples

math

Starlark syntax

Speed

Limitations

Prior work

Distribution details

switch_payoff

our_door

car_door

Simulation data

CSV

Preview

API

`Normal` examples

`to` operator (10th to 90th percentile)

`pm` operator

`LogNormal` examples

`to` operator (10th to 90th percentile)

`td` operator

`Beta` examples

`PERT`

`Uniform` examples

`to` operator (10th to 90th percentile)

`pm` operator

`LogUniform` examples

`to` operator (10th to 90th percentile)

`td` operator

`Bernoulli` examples

`Binomial` examples

`Discrete` examples