Measuring players’ importance in team sports to help coaches and staff with the aim of winning the game is gaining relevance, mainly because of the advent of new data and advanced technologies. In this paper we evaluate each player’s importance - for the first time in basketball - as his/her average marginal contribution to the utility of an ordered subset of players, through a generalized version of the Shapley value, where the value assumed by the generalized characteristic function of the generalized coalitional game is expressed in terms of the probability a certain lineup has to win the game. In turn, such probability is estimated by applying a logistic regression model in which the response is represented by the game outcome and the Dean’s factors are used as explanatory features. Then, we estimate the generalized Shapley values of the players, with associated bootstrap confidence intervals. A novelty, allowed by explicitly considering single lineups, is represented by the possibility of forming best lineups based on players’ estimated generalized Shapley values conditional on specific constraints, such as an injury or an “a-priori” coach’s decision. A comparison of our proposed approach with industry-standard counterparts shows a strong linear relation. We show the application of our proposed method to seventeen full NBA seasons (from 2004/2005 to 2020/21). We eventually estimate generalized Shapley values for Utah Jazz players and we show how our method is allowed to be used to form best lineups.

Measuring players' importance in basketball using the generalized Shapley value

Giorgio Gnecco
2023-01-01

Abstract

Measuring players’ importance in team sports to help coaches and staff with the aim of winning the game is gaining relevance, mainly because of the advent of new data and advanced technologies. In this paper we evaluate each player’s importance - for the first time in basketball - as his/her average marginal contribution to the utility of an ordered subset of players, through a generalized version of the Shapley value, where the value assumed by the generalized characteristic function of the generalized coalitional game is expressed in terms of the probability a certain lineup has to win the game. In turn, such probability is estimated by applying a logistic regression model in which the response is represented by the game outcome and the Dean’s factors are used as explanatory features. Then, we estimate the generalized Shapley values of the players, with associated bootstrap confidence intervals. A novelty, allowed by explicitly considering single lineups, is represented by the possibility of forming best lineups based on players’ estimated generalized Shapley values conditional on specific constraints, such as an injury or an “a-priori” coach’s decision. A comparison of our proposed approach with industry-standard counterparts shows a strong linear relation. We show the application of our proposed method to seventeen full NBA seasons (from 2004/2005 to 2020/21). We eventually estimate generalized Shapley values for Utah Jazz players and we show how our method is allowed to be used to form best lineups.
2023
Players’ performance, Sports analytics, Logistic regression, Cooperative game theory, National Basketball Association
File in questo prodotto:
File Dimensione Formato  
Metulini-Gnecco2022_Article_MeasuringPlayersImportanceInBa.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 1.21 MB
Formato Adobe PDF
1.21 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11771/21078
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
social impact