Skip to content

Inference of relative ability from winning probabilities

License

Notifications You must be signed in to change notification settings

microprediction/winning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

test-38 test-39 test-310 test-311 test-312 test-pandas

A fast, scalable numerical algorithm for inferring relative ability from multi-entrant contest winning probabilities.

An earlier version of this algorithm was published in SIAM Journal on Quantitative Finance (pdf).

See www.quinellacalculator.com for a demonstration and this comment on Thurston models for choice.

Install

pip install winning
pip install scipy

You can avoid installing scipy if you have Python 3.8 and above and you don't wish to use the copula functionality.

Usage

We choose a performance density

density = centered_std_density()

We set 'dividends', which are for all intents and purposes inverse probabilities

dividends = [2,6,np.nan, 3]

The algorithm implies relative ability (i.e. how much to translate the performance distributions to match the winning probabilities).

abilities = dividend_implied_ability(dividends=dividends,density=density, nan_value=2000)

Horses with no bid are assigned odds of nan_value ... or you can leave them out.

Examples

See example_basic

Ideas beyond horse-racing

See the notebook for examples of the use of this ability transform.

See the paper for why this is useful in lots of places, according to a wise man. For instance, the algorithm may also find use anywhere winning probabilities or frequencies are apparent, such as with e-commerce product placement, in web search, or, as is shown in the paper: addressing a fundamental problem of trade.

Generality

All performance distributions.

Visual example:

Use densitiesPlot to show the results

L = 600
unit = 0.01
density = centered_std_density(L=L, unit=unit)
dividends = [2,6,np.nan, 3]
abilities = dividend_implied_ability(dividends=dividends,density=density, nan_value=2000, unit=unit)
densities = [skew_normal_density(L=L, unit=unit, loc=a, a=0, scale=1.0) for a in abilities]
legend = [ str(d) for d in dividends ]
densitiesPlot(densities=densities, unit=unit, legend=legend)
plt.show()

Pricing show and place from win prices:

See the basic examples.

from winning.lattice_pricing import skew_normal_simulation
from pprint import pprint
dividends = [2.0,3.0,12.0,12.0,24.0,24.0]
pricing = skew_normal_simulation(dividends=dividends,longshot_expon=1.15,skew_parameter=1.0,nSamples=1000)
pprint(pricing)

Cite

    @article{doi:10.1137/19M1276261,
    author = {Cotton, Peter},
    title = {Inferring Relative Ability from Winning Probability in Multientrant Contests},
    journal = {SIAM Journal on Financial Mathematics},
    volume = {12},
    number = {1},
    pages = {295-317},
    year = {2021},
    doi = {10.1137/19M1276261},
    URL = { 
            https://doi.org/10.1137/19M1276261
    },
    eprint = { 
            https://doi.org/10.1137/19M1276261
    }
    }

Nomenclature, assumptions, limitations

The lattice_calibration module allows the user to infer relative abilities from state prices in a multi-entrant contest. At the racetrack, this would mean looking at the win odds and interpreting them as a relative ability.

The assumption made is that the performance distribution of one competitor is a translation of the performance distribution of another, and they are indepedent. The algorithm is:

  • Fast
  • Scalable ... to contests with hundreds of thousands of entrants.
  • General ... as noted it works for any performance distribution.

Here's a quick glossary to help with reading the code. It's a mix of financial and racetrack lingo.

  • State prices The expectation of an investment that has a payoff equal to 1 if there is only one winner, 1/2 if two are tied, 1/3 if three are tied and so forth. State prices are synomymous with winning probability, except for dead heats. However in the code a lattice is used so dead-heats must be accomodated and the distinction is important.

  • (Relative) ability refers to how much one performance distribution needs to be translated in order to match another. Implied abilities are vectors of relative abilities consistent with a collection of state prices.

  • Dividends are the inverse of state prices. This is Australian tote vernacular. Dividends are called 'decimal odds' in the UK and that's probably a better name. A dividend of 9.0 corresponds to a state price of 1/9.0, and a bookmaker quote of 8/1. Don't ask me to translate to American odds conventions because they are so utterly ridiculous!

The core algorithm is entirely ambivalent to the choice of performance distribution, and that certainly need not correspond to some analytic distribution with known properties. But normal and skew-normal are simple choices. See the basic examples

Performance

For smallish races, we are talking a few hundred milliseconds.

For large races we are talking 25 seconds for a 100,000 horse race.

Like parimutuels?

Support my open source work. Buy Microprediction: Building An Open AI Network published by MIT Press.