Bradley–Terry SBM • BTSBM: Bradley–Terry Stochastic Block Model

Overview

BTSBM implements Bayesian inference for the Bradley–Terry Stochastic Block Model (BT–SBM),
combining pairwise comparison data with latent block clustering of items.

The package provides:

⚙️ Gibbs-type MCMC samplers for posterior inference
🎯 Posterior relabeling, point estimates and uncertainty quantification
🌐 Visualization tools for clusters and block interactions

Applications include sports analytics, psychometrics, and ranking problems with hidden group structure.

Quick Links

For a complete overview of all functions, see the 👉 Function Reference
For a step-by-step example using tennis data, visit the 👉 Getting Started Vignette

When to Use BTSBM

Use this model whenever outcomes can be summarized as pairwise preferences (i beats j):

Domain	Example
⚽ Sports & games	Player `i` beats player `j`
🔍 Information retrieval	Document `i` preferred to `j`
🧠 Psychometrics	Stimulus `i` chosen over `j`
🧪 A/B testing	Variant `i` performs better than `j`

Only a binary outcome per pair (“i over j”) is required — possibly aggregated into counts.

From now on, we will adopt a sport-related jargon for simplicity, but this framework well adapts to the aforementioned contexts as well.

Required Inputs

To fit the model, you need one object:

`w`: pairwise interaction matrix

It is as a directed weighted/binary adjacency matrix.

Square matrix (n × n), where w[i, j] = number of times player i beats j
Diagonal must be 0 (w[i, i] = 0)
For each unordered pair {i,j}:
( n_{ij} = w_{ij} + w_{ji} )

Examples

(A) Aggregated counts

# items: A, B, C, D
# w[i, j] = number of times i beat j
w <- matrix(
c( 0, 3, 0, 2,
1, 0, 4, 0,
2, 0, 0, 1,
0, 1, 3, 0 ),
nrow = 4, byrow = TRUE,
dimnames = list(c("A","B","C","D"), c("A","B","C","D"))
)

(B) Binary outcomes

# one comparison per pair observed (0/1 wins)
# w[i, j] is binary
w_bin <- matrix(
c( 0, 1, 0, 1,
0, 0, 1, 0,
1, 0, 0, 0,
0, 1, 1, 0 ),
nrow = 4, byrow = TRUE,
dimnames = list(c("A","B","C","D"), c("A","B","C","D"))
)

⚠️ The matrix must not be symmetric!

Installation

Install the development version from GitHub:

# install.packages("devtools")
devtools::install_github("laposanti/BTSBM")
library(BTSBM)

Minimal Example

Let’s fit the BT–SBM with a Gnedin prior on 2017 ATP season data:

# choosing the 2017 season
w_ij <- ATP_2000_2022$`2017`$Y_ij

# fit the model
fit <- gibbs_bt_sbm(
w_ij,
a = 4,
prior = "GN",
n_iter = 500,
burnin = 250,
verbose = FALSE
)

# relabel output
post <- BTSBM::relabel_by_lambda(fit$x_samples, fit$lambda_samples)

# plot adjacency matrix
plot_block_adjacency(fit = post, w_ij = w_ij)

Reordered Adjacency matrix

Workflow at a Glance

Prepare input matrix w
Inspect prior using gnedin_K_mean() and gnedin_K_var()
Fit the model via gibbs_bt_sbm()
Relabel samples with relabel_by_lambda()

5.️ Visualize clusters using plot_block_adjacency()

Learn More

👉 Function Reference

👉 Getting Started Vignette

Citation

Santi, L., & Friel, N. (2025). The Bradley–Terry Stochastic Block Model. Working paper, University College Dublin.