Skip to content

Downloads fetching incomplete files without reporting failure #377

@david-cortes-intel

Description

@david-cortes-intel

Some datasets, particularly this one:
https://openml.org/data/v1/download/16797613/CIFAR_10.arff

Have lately been yielding incomplete files when downloading them, whether through a web browser or through API using scikit-learn's fetch_openml.

For example, back a few months ago, this dataset was yielding a 600mb+ file like this:
Image

But if I download the same dataset through a web browser today, I get files of different sizes:
Image
Image

The download reports completing successfully on both web browsers and scikit-learn's fetch_openml, despite not fetching the full file.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions