Glossary#

run_start#

Forecasts with run times at or after this datetime are queried.

run_end#

Forecasts with run times before or at this datetime are queried.

run time#

run times#

run_time#

The time at which a forecast is nominally run.

P5MIN: Every 5 minutes beginning on the hour
PREDISPATCH/PDPASA: Every 30 minutes beginning on the hour
STPASA: On the hour, either every hour or every two hours
- Frequency of runs was increased in 2021
MTPASA: Run every week on Tuesdays, datetime of run will vary

forecasted_start#

Forecasts pertaining to times at or after this datetime are retained.

forecasted_end#

Forecasts pertaining to times before or at this datetime are retained.

forecasted time#

forecasted times#

forecasted_time#

The time to which a forecast’s outputs pertain.

forecast type#

forecast types#

The term used in nemseer to refer to AEMO’s ahead processes (as outlined in pre-dispatch and PASA).

actual run time#

The actual time at which the forecast run is executed/published. This is often reported in the LASTCHANGED column.

MTPASA#

STPASA#

PDPASA#

PASA#

Projected Assessment of System Adequacy. PASA processes are focused on assessing reliability/resource adequacy.

PDPASA is run with a similar frequency and horizon to (30-minute) pre-dispatch, and STPASA is run at least every two hours (it is currently run every hour) following the horizon covered by PDPASA. Both attempt to maximise generation reserves available to the system given forecasts for demand and variable renewable energy generation, a simplified set of forecasted network constraints and participant-submitted resource availabilities and energy constraints. Together, they assess reliability for the next 7 trading days[1]. There are 3 run types for PDPASA and STPASA (N.B. the descriptions below are based on an interpretation of AEMO documentation as it does not explicitly describe the run types that appear in PDPASA/STPASA tables):
1. RELIABILITY_LRC likely corresponds to the Capacity Adequacy (CA) run. In the CA run, 10% probability of exceedance regional demand traces are used to calculate the surplus reserve concurrently available to each region. This is then used to assess whether a Low Reserve Condition (LRC) may arise[2].
2. OUTAGE_LRC. Whilst RELIABILITY_LRC models full interconnector availability, OUTAGE_LRC is believed to model interconnector outages.
3. LOR. This almost certainly corresponds to the Maximum Surplus Capacity (MSC) run. In the MSC run, 50% probability of exceedance regional demand traces are used to calculate the maximum spare capacity available in each region[2]. Lack of Reserve conditions (LOR) are then determined based on the regional spare capacity available relative to 3 thresholds (LOR1 being the least severe and LOR3 being the most), each of which is set by one or more generation contingencies or the Forecast Uncertainty Measure[3].
Along with pre-dispatch processes, PASA processes are used to identify LOR conditions. In the event of projected supply scarcity (i.e. forecasted LOR2 or LOR3), AEMO will estimate a latest time to intervene. If AEMO deems the market response to be insufficient by this time, it can exercise the Reliability and Emergency Reserve Trader (RERT), issue directions or issue instructions (i.e. instruct network service providers to commence load shedding)[4].
Using participant-submitted resource availabilities, forecasted network constraints and resource short-run marginal costs (SRMC), MTPASA outputs are reported for each day following aggregation of the results of a market simulation run at half-hourly resolution. These outputs consist of system reliability forecasts (i.e. reporting unserved energy from the Reliability Run and loss of load probability from the Loss of Load Probability Run) that extend out for the next 24 months:
1. In the Reliability Run, forecast uncertainty is addressed by using a range of reference weather years (at least 8). For each of these reference weather years, AEMO uses at least two percentiles (i.e. probabilities of exceedence) of demand traces that are based on historical data and adjusted for future trends (e.g. accounting for growth in energy consumption or rooftop solar photovoltaics), as well as historically observed generation traces for wind and solar. Then, each of these trace combinations are run using at least 100 random forced outage patterns[5].
2. In the Loss of Load Probability Run, traces for “abstract” days are constructed based on monthly high demand and low variable renewable energy generation conditions observed over the different reference weather years. The Loss of Load Probability Run is used to determine the days that have a higher risk of load shedding[5].

If the expected annual unserved energy, averaged across simulations in the Reliability Run, exceeds the maximum level specified by the reliability standard, a Low Reserve Condition (LRC) is identified. In response to LRCs, AEMO can direct generators to reschedule outages or contract for longer notice RERT[4].

PD#

5MPD#

PREDISPATCH#

P5MIN#

5-minute pre-dispatch#

pre-dispatch#

Pre-dispatch processes consists of (30-minute) pre-dispatch (PREDISPATCH) and 5-minute pre-dispatch (5MPD or P5MIN). To add to any confusion, when people or documents refer to “pre-dispatch”, they are often referring to PREDISPATCH. The use of submitted participant offers distinguishes pre-dispatch processes from PASA processes. These are used alongside forecasts for constraints, demand and variable renewable energy generation to forecast dispatch conditions and regional prices for energy and FCAS. Along with PDPASA and STPASA, pre-dispatch processes are used to identify Lack of Reserve (LOR) conditions. If AEMO deems the market response to be insufficient by this time, it can exercise the Reliability and Emergency Reserve Trader (RERT), issue directions or issue instructions (i.e. instruct network service providers to commence load shedding)[4].

PREDISPATCH forecasts are generated every half hour at half-hourly resolution until the end of the last trading day for which bid band price submission has closed (this occurs at 1230 EST)[6].
P5MIN is run for every dispatch interval for the next hour.
For both P5MIN and PREDISPATCH, the impact of demand forecast error on regional energy prices and interconnector flows are explored through a sensitivity analysis[7]. Only sensitivites for PREDISPATCH are available via the MMSDM Historical Data SQLLOader.

market day#

trading day#

From 0400 (exclusive) to 0400 (inclusive) on the next day (i.e. (0400 Day 1, 0400 Day 2]).

MMSDM Historical Data SQLLOader#

SQLLoader#

An archive of historical market data used by nemseer for historical forecast data queries. Data is organised by year and month. The month corresponds to the month in which the forecast was run (i.e. if the month lies between run_start and run_end).

Within each month, there exists a directory for most of the data queried by nemseer (DATA), including pre-dispatch data with the most recent forecast run, and directories for “complete” pre-dispatch and 5-minute pre-dispatch data (PREDISP_ALL_DATA and P5MIN_ALL_DATA, respectively).
- For pre-dispatch, the complete directory contains data with all forecast runs pertaining to a particular time.
- As the data in the complete P5MIN directory appears to be the same as that in DATA, nemseer does not use this directory.

raw_cache#

Directory to which nemseer downloads cleaned raw data. Cleaning by nemseer includes:

Removing file metadata from the start and end of the file
Parsing datetimes, including parsing PREDISPATCHSEQNO (in :term:PREDISPATCH tables) into a new datetime column
Caching raw data in a parquet format, which enables column-based queries and uses less disk space than CSV An invalid/corrupted files list (.invalid_aemo_files.txt) is also maintained in this directory if an invalid/corrupted zip is queried via nemseer. This prevents nemseer from downloading/compiling invalid/corrupted data from AEMO’s database.

processed_cache#

Directory to which nemseer queries can be saved to (if provided to the relevant functions). The data for each table in a query (which is described by a set of run times and forecasted times) is either saved to a parquet file if the user specifies they want pandas DataFrame structures, or a netCDF file if the user specifies they want xarray Dataset structures. Each of these files is also saved with query metadata, which nemseer will check during subsequent queries. If the query metadata of any files in the processed cache corresponds to that of the current query, data will be loaded from that file in the processed cache. Note that as the metadata is saved within the file, files in the processed cache can be renamed without affecting this functionality (so long as file extensions are preserved).