Inventory Overview

This page visualises metadata about the data files stored at https://data.wa.aemo.com.au/public/

Data is gathered every 6 hours, though no listed data files are downloaded in the process.

Number of files modified by day

The following gives an indication of last time of update for all files found; you can filter the starting year using the dropdown box below.

Loading...

Most Common File Types

Looking at file extensions simply by type alone can be misleading. In the following bar chart, CSV file types appear dominant; in reality, a small number of services regularly update (and seldom archive) CSV files.

Loading...

However, if we review the file sizes in aggregate, it becomes clear the majority of file size is attributable to zip files.

Typically, AEMO archives 1 day of JSON files for selected data models, each day.

Loading...

How many files and directories are there?

See how many directories and files are being updated in the pre-canned periods


Total Directories

171

Total Files

75,087.0

Total Size (Gb)

431

Directories Active in Period

Loading...

Files Active in Period

Loading...

Which files are updated repeatedly?

It is useful to know which files are updating most regularly. This is calculated by identifying files that:

  • Are listed in every inventory run
  • Have been updated within a threshold of the inventory run commencement time

This manages to avoid including files that are recently updated on an inventory run but are later archived into a daily zip file.

Click on any of the table entries above to download the listed file.

File Growth

The following Bubble Chart shows the periods of high file size drops.

The most recent time periods (to the right of the graph) will always show large bubbles because of the temporary presence (typically 24 hrs) of very large JSON files per dispatch interval.

Loading...

File Size Distribution

To get an idea of where file volume is hierarchically, a Sankey Diagram is useful to descend between paths with a visual indicator as to their aggregate size volume.

Below you can see that the recent WEM Dispatch Engine covers a large proportion of the overall data.

Loading...

File Count Distribution

Let's repeat the same process but for number of files, instead of storage volume

Loading...

This space intentionally left blank