TrueSkill

Rating system supporting games with more than 2 players

TrueSkill is a skill-based ranking system developed by Microsoft for use with video game matchmaking on the Xbox network. Unlike the popular Elo rating system, which was initially designed for chess, TrueSkill is designed to support games with more than two players.[1][2] In 2018, Microsoft published details about an extended version of TrueSkill, named TrueSkill2. [3]

Calculation

A player's skill is represented as a normal distribution N {\displaystyle {\mathcal {N}}} characterized by a mean value of μ {\displaystyle \mu } (mu, representing perceived skill) and a variance of σ {\displaystyle \sigma } (sigma, representing how "unconfident" the system is in the player's μ {\displaystyle \mu } value).[1][2] As such N ( x ) {\displaystyle {\mathcal {N}}(x)} can be interpreted as the probability that the player's "true" skill is x {\displaystyle x} .[1][2]

On Xbox Live, players start with μ = 25 {\displaystyle \mu =25} and σ = 25 / 3 {\displaystyle \sigma =25/3} ; μ {\displaystyle \mu } always increases after a win and always decreases after a loss. The extent of actual updates depends on each player's σ {\displaystyle \sigma } and on how "surprising" the outcome is to the system. Unbalanced games, for example, result in either negligible updates when the favorite wins, or huge updates when the favorite loses surprisingly.

Factor graphs and expectation propagation via moment matching are used to compute the message passing equations which in turn compute the skills for the players.[1][2]

Player ranks are displayed as the conservative estimate of their skill, R = μ 3 × σ {\displaystyle R=\mu -3\times \sigma } . This is conservative, because the system is 99% sure that the player's skill is actually higher than what is displayed as their rank.

The system can be used with arbitrary scales, but Microsoft uses a scale from 0 to 50 for Xbox Live. Hence, players start with a rank of R = 25 3 25 3 = 0 {\displaystyle R=25-3\cdot {\frac {25}{3}}=0} . This means that a new player's defeat results in a large sigma loss, which partially or completely compensates their mu loss. This explains why people may gain ranks from losses.

Use in other projects

TrueSkill is patented,[4] and the name is trademarked,[5] so it is limited to Microsoft projects and commercial projects that obtain a license to use the algorithm.

See also

  • Software patents

References

  1. ^ a b c d Murphy, Kevin (2012). Machine Learning: A Probabilistic Perspective. MIT Press. ISBN 978-0262018029.
  2. ^ a b c d Herbrich, Ralf; Minka, Tom; Graepel, Thore (2007), Schölkopf, B.; Platt, J. C.; Hoffman, T. (eds.), "TrueSkill : A Bayesian Skill Rating System" (PDF), Advances in Neural Information Processing Systems 19, MIT Press, pp. 569–576, retrieved 2018-10-11
  3. ^ Minka, Tom; Cleven, Ryan; Zaykov, Yordan (2018-03-22). "TrueSkill 2: An improved Bayesian skill rating system". {{cite journal}}: Cite journal requires |journal= (help)
  4. ^ "United States Patent Application 20090227313: Determining Relative Skills of Players". USPTO. Retrieved 2014-02-16.
  5. ^ "Trademark Electronic Search System (TESS)". tmsearch.uspto.gov. Retrieved 2020-01-16.

External links

  • Microsoft Research's TrueSkill homepage
  • Microsoft Research's TrueSkill paper
  • In-depth explanation of the mathematical background
  • v
  • t
  • e
Sports rating systems
Concepts
  • Home advantage
  • Sabermetrics
  • Strength of schedule
  • Win probability
Methods and computer models
Elo family
Polls and opinionPeople
  • v
  • t
  • e
Main
projects
Languages, compilers
Distributedgrid computing
Internet, networking
Other projects
Operating systems
  • Barrelfish
  • HomeOS
  • Midori
  • Singularity
  • Verve
APIs
Launched as products
MSR Labs
applied
research
Live Labs
Current
Discontinued
FUSE Labs
Other labs
Category