WindsorML Dataset

The WindsorML dataset is a publicly available dataset in Hugging Face licensed as CC BY-SA 4.0 and distributed separately to MLSimKit. It is a collection of high-fidelity CFD simulations showing different geometric variants of a body for automotive aerodynamics modeling. Please see the Hugging Face dataset for additional details.

Downloading the training data (170G)

You can download the data required for the training tutorials using Python with the huggingface_hub package:

pip install huggingface_hub

Then use this Python code to download the dataset:

from huggingface_hub import snapshot_download

snapshot_download(
    repo_id="neashton/windsorml",
    repo_type="dataset",
    local_dir="/path/to/dataset",
    allow_patterns=[
        "run_*/windsor_*.stl",
        "run_*/images/*",
        "run_*/boundary*.vtu",
        "run_*/force_mom_*.csv"
    ]
)

Replace /path/to/dataset to your own directory.

The Slices and KPI tutorials have steps to download only the necessary data for each. The entire dataset is ~8TB and includes high-resolution meshes not required for training.

Understanding the dataset

The dataset contains 355 cases, each representing a unique geometry variant. The cases are organized into separate folders named run_0 to run_354, one for each simulation case. The variability in geometry across the 355 cases leads to a diverse set of flow physics, making this dataset well-suited for machine learning model development.

Within each run folder is a standard set of files:

run_0/
├── boundary_0.vtu
├── force_mom_0.csv
├── force_mom_varref_0.csv
├── geo_parameters_0.csv
├── images
│   ├── pressureavg
│      ├── *.png
│   ├── rstress_xx
│      ├── *.png
│   ├── rstress_yy
│      ├── *.png
│   ├── rstress_zz
│      ├── *.png
│   ├── velocityxavg
│      ├── *.png
│   └── windsor_0.png
├── volume_0.vtu
├── windsor_0.stl
└── windsor_0.stp
  • windsor_<run #>.stl - The surface geometry definition in STL format

  • boundary_<run #>.vtu - Simulation results on the surface

  • volume_<run #>.vtu - Volumetric simulation outputs

  • force_mom_<run #>.csv - Time-averaged force and moment coefficients

  • force_mom_varref_<run #>.csv - Time-averaged force and moment coefficients using unique reference area per geometry

  • images/ - Folder containing slice images through the volume

  • windsor_<run #>.png - Image of the windsor body (see below)

Figure 1. An example Windsor Body

The slice images show simulation variables like pressure and velocity captured on 2D planes along the X, Y and Z axes. Multiple views are available.

Slice Images Views

The images folder within each case contains subfolders organizing the slice images by simulation output variable.

Within each variable folder (e.g. pressureavg, velocityxavg), there are multiple image sets showing different sliced views through the volume.

The main image sets are:

Z-Axis Slices

Filename pattern: view1_constz_*.png

Slices along the Z-axis, showing the XY-plane at different Z positions. Useful for visualizing the flow as it passes over the geometry from front to back.

Figure 2. An example Z-Axis slice

X-Axis Slices

Filename pattern: view2_constx_*.png

These are slices along the X-axis, showing the YZ-plane at different X positions.

Figure 3. An example X-Axis slice

Y-Axis Slices

Filename pattern: view3_consty_*.png

Slices along the Y-axis, showing the XZ-plane. Gives a top-down view at different heights.

Figure 4. An example Y-Axis slice