Glossary#
run_start#Forecasts with run times at or after this datetime are queried.
run_end#Forecasts with run times before or at this datetime are queried.
run time#run times#run_time#The time at which a forecast is nominally run.
P5MIN: Every 5 minutes beginning on the hour
PREDISPATCH/PDPASA: Every 30 minutes beginning on the hour
STPASA: On the hour, either every hour or every two hours
Frequency of runs was increased in 2021
MTPASA: Run every week on Tuesdays, datetime of run will vary
forecasted_start#Forecasts pertaining to times at or after this datetime are retained.
forecasted_end#Forecasts pertaining to times before or at this datetime are retained.
forecasted time#forecasted times#forecasted_time#The time to which a forecast’s outputs pertain.
forecast type#forecast types#The term used in
nemseerto refer to AEMO’s ahead processes (as outlined in pre-dispatch and PASA).actual run time#The actual time at which the forecast run is executed/published. This is often reported in the
LASTCHANGEDcolumn.MTPASA#STPASA#PDPASA#PASA#Projected Assessment of System Adequacy. PASA processes are focused on assessing reliability/resource adequacy.
PDPASAis run with a similar frequency and horizon to (30-minute) pre-dispatch, andSTPASAis run at least every two hours (it is currently run every hour) following the horizon covered byPDPASA. Both attempt to maximise generation reserves available to the system given forecasts for demand and variable renewable energy generation, a simplified set of forecasted network constraints and participant-submitted resource availabilities and energy constraints. Together, they assess reliability for the next 7 trading days[1]. There are 3 run types forPDPASAandSTPASA(N.B. the descriptions below are based on an interpretation of AEMO documentation as it does not explicitly describe the run types that appear inPDPASA/STPASAtables):RELIABILITY_LRClikely corresponds to the Capacity Adequacy (CA) run. In the CA run, 10% probability of exceedance regional demand traces are used to calculate the surplus reserve concurrently available to each region. This is then used to assess whether a Low Reserve Condition (LRC) may arise[2].OUTAGE_LRC. WhilstRELIABILITY_LRCmodels full interconnector availability,OUTAGE_LRCis believed to model interconnector outages.LOR. This almost certainly corresponds to the Maximum Surplus Capacity (MSC) run. In the MSC run, 50% probability of exceedance regional demand traces are used to calculate the maximum spare capacity available in each region[2]. Lack of Reserve conditions (LOR) are then determined based on the regional spare capacity available relative to 3 thresholds (LOR1 being the least severe and LOR3 being the most), each of which is set by one or more generation contingencies or the Forecast Uncertainty Measure[3].
Along with pre-dispatch processes, PASA processes are used to identify LOR conditions. In the event of projected supply scarcity (i.e. forecasted LOR2 or LOR3), AEMO will estimate a latest time to intervene. If AEMO deems the market response to be insufficient by this time, it can exercise the Reliability and Emergency Reserve Trader (RERT), issue directions or issue instructions (i.e. instruct network service providers to commence load shedding)[4].
Using participant-submitted resource availabilities, forecasted network constraints and resource short-run marginal costs (SRMC),
MTPASAoutputs are reported for each day following aggregation of the results of a market simulation run at half-hourly resolution. These outputs consist of system reliability forecasts (i.e. reporting unserved energy from the Reliability Run and loss of load probability from the Loss of Load Probability Run) that extend out for the next 24 months:In the Reliability Run, forecast uncertainty is addressed by using a range of reference weather years (at least 8). For each of these reference weather years, AEMO uses at least two percentiles (i.e. probabilities of exceedence) of demand traces that are based on historical data and adjusted for future trends (e.g. accounting for growth in energy consumption or rooftop solar photovoltaics), as well as historically observed generation traces for wind and solar. Then, each of these trace combinations are run using at least 100 random forced outage patterns[5].
In the Loss of Load Probability Run, traces for “abstract” days are constructed based on monthly high demand and low variable renewable energy generation conditions observed over the different reference weather years. The Loss of Load Probability Run is used to determine the days that have a higher risk of load shedding[5].
If the expected annual unserved energy, averaged across simulations in the Reliability Run, exceeds the maximum level specified by the reliability standard, a Low Reserve Condition (LRC) is identified. In response to LRCs, AEMO can direct generators to reschedule outages or contract for longer notice RERT[4].
PD#5MPD#PREDISPATCH#P5MIN#5-minute pre-dispatch#pre-dispatch#Pre-dispatch processes consists of (30-minute) pre-dispatch (
PREDISPATCH) and 5-minute pre-dispatch (5MPD orP5MIN). To add to any confusion, when people or documents refer to “pre-dispatch”, they are often referring toPREDISPATCH. The use of submitted participant offers distinguishes pre-dispatch processes from PASA processes. These are used alongside forecasts for constraints, demand and variable renewable energy generation to forecast dispatch conditions and regional prices for energy and FCAS. Along with PDPASA and STPASA, pre-dispatch processes are used to identify Lack of Reserve (LOR) conditions. If AEMO deems the market response to be insufficient by this time, it can exercise the Reliability and Emergency Reserve Trader (RERT), issue directions or issue instructions (i.e. instruct network service providers to commence load shedding)[4].PREDISPATCHforecasts are generated every half hour at half-hourly resolution until the end of the last trading day for which bid band price submission has closed (this occurs at 1230 EST)[6].P5MINis run for every dispatch interval for the next hour.For both
P5MINandPREDISPATCH, the impact of demand forecast error on regional energy prices and interconnector flows are explored through a sensitivity analysis[7]. Only sensitivites forPREDISPATCHare available via the MMSDM Historical Data SQLLOader.
market day#trading day#From 0400 (exclusive) to 0400 (inclusive) on the next day (i.e.
(0400 Day 1, 0400 Day 2]).MMSDM Historical Data SQLLOader#SQLLoader#An archive of historical market data used by
nemseerfor historical forecast data queries. Data is organised by year and month. The month corresponds to the month in which the forecast was run (i.e. if the month lies between run_start and run_end).Within each month, there exists a directory for most of the data queried by
nemseer(DATA), including pre-dispatch data with the most recent forecast run, and directories for “complete” pre-dispatch and 5-minute pre-dispatch data (PREDISP_ALL_DATAandP5MIN_ALL_DATA, respectively).For pre-dispatch, the complete directory contains data with all forecast runs pertaining to a particular time.
As the data in the complete
P5MINdirectory appears to be the same as that inDATA,nemseerdoes not use this directory.
raw_cache#Directory to which
nemseerdownloads cleaned raw data. Cleaning bynemseerincludes:Removing file metadata from the start and end of the file
Parsing datetimes, including parsing
PREDISPATCHSEQNO(in :term:PREDISPATCHtables) into a new datetime columnCaching raw data in a parquet format, which enables column-based queries and uses less disk space than CSV An invalid/corrupted files list (
.invalid_aemo_files.txt) is also maintained in this directory if an invalid/corrupted zip is queried vianemseer. This preventsnemseerfrom downloading/compiling invalid/corrupted data from AEMO’s database.
processed_cache#Directory to which
nemseerqueries can be saved to (if provided to the relevant functions). The data for each table in a query (which is described by a set of run times and forecasted times) is either saved to a parquet file if the user specifies they want pandas DataFrame structures, or a netCDF file if the user specifies they want xarray Dataset structures. Each of these files is also saved with query metadata, whichnemseerwill check during subsequent queries. If the query metadata of any files in the processed cache corresponds to that of the current query, data will be loaded from that file in the processed cache. Note that as the metadata is saved within the file, files in the processed cache can be renamed without affecting this functionality (so long as file extensions are preserved).