Skip to content

Bundesliga 2015/16 match file appears to contain only 34 matches instead of full season #59

Description

@RamNair01

Hi StatsBomb team,

I noticed a possible issue with the open-data match metadata for the 2015/16 Bundesliga season.

The 2015/16 Big 5 release identifies Bundesliga as:

  • competition_id = 9
  • season_id = 27

However, when loading the current master version of:

data/matches/9/27.json

I only get 34 matches.

For context, the other 2015/16 Big 5 leagues return the expected match counts:

Premier League   380
La Liga          380
Serie A          380
Ligue 1          377
Bundesliga        34
Total           1551

I would have expected Bundesliga 2015/16 to contain 306 matches, giving a Big 5 total of 1,823 matches with available match metadata.

Minimal reproduction:

import requests
import pandas as pd

BASE_URL = "https://raw.githubusercontent.com/statsbomb/open-data/master/data"

def get_match_data(competition_id, season_id):
    response = requests.get(
        f"{BASE_URL}/matches/{competition_id}/{season_id}.json"
    )
    response.raise_for_status()
    return pd.DataFrame(response.json())

bundesliga_1516 = get_match_data(9, 27)
print(len(bundesliga_1516))

Current output:

34

Expected output:

306

This may be related to a recent update to the open-data repository. I am using the data for academic work, so I wanted to flag it in case the full Bundesliga 2015/16 match file was accidentally overwritten or reduced.

Thanks for maintaining this dataset, it is extremely useful for research and football analytics.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions