SIGNAL LABS
Glossary · Ratings
Ratings

What is Surface Elo?

Separate Elo ratings for the same player by playing surface — hard, clay, or grass in tennis; turf or grass in soccer.

TL;DR
Surface Elo maintains independent ratings per playing surface for the same player or team, capturing real differences in performance across courts or fields.

Full explanation

A single Elo rating averages a player's results across all conditions. For chess this is fine — the board doesn't change. For tennis it isn't. Rafael Nadal on clay is a different player from Rafael Nadal on grass, and a single overall Elo will systematically misprice his matches at the French Open and Wimbledon. Surface Elo solves this by maintaining separate ratings per surface.

The mechanics are identical to standard Elo, except every match updates only the rating for the surface it was played on. A player's hard-court Elo updates after hard-court matches; the clay rating sits frozen until the next clay event. The result is three or four parallel ratings per player that capture surface-specific true talent.

Predictions use the surface of the upcoming match. The expected score function is unchanged — the logistic on the rating gap — but the gap is computed using the surface-specific ratings. For a clay-court major, the Nadal vs. Djokovic projection uses clay Elos; for Wimbledon, it uses grass.

Surface Elos have been the public standard in tennis modeling for over a decade. They are well-correlated with bookmaker prices and produce calibrated forecasts on tennis-data.co.uk and Jeff Sackmann's tennisabstract. The same idea generalizes to other surface-sensitive sports — golf has variants that condition on course type, and soccer has versions that separate turf from grass. The principle is always the same: when conditions reliably change outcomes, ratings should be conditional on those conditions.

Formula

Identical to standard Elo, but separate R values are maintained per surface. Updates only modify the surface-specific rating; predictions use the surface of the upcoming match.

Why it matters in our model

Our tennis model uses surface Elos seeded from Jeff Sackmann's published files and updated nightly from tour results. Surface-specific ratings produce visibly better calibration on clay events than blended ratings.

Frequently asked

Why don't all sports use surface Elo?
Most sports don't have meaningful surface variance. Where they do — tennis, golf, soccer — surface-conditional ratings outperform single ratings.
How long does it take for surface Elo to stabilize?
Roughly 20-30 matches per surface for most players. Until then, surface-specific ratings should be regressed toward the player's overall Elo.
Where do surface Elos start?
Most public implementations seed surface Elos at the player's overall Elo and let them drift apart with match results.

Related terms

← Back to glossary

Dashboard