4. VALIDATION OF SYNTHETIC GRIDS
4.4 Complex network analysis
To begin the complex network analysis of the real and synthetic datasets, a critical first decision is how the systems are viewed as a graph. This subtle choice may contribute significantly to the disagreements in literature over complex network properties. While the electrical circuit is a graph-based model, with circuit nodes (buses) connected by branch elements (edges), there are multiple ways to model the same system which may be electrically equivalent but are not topologically identical and thus may have different metric properties. It is common in planning cases, for example, to have a single substation bus for each voltage level, grouping all the substation connections together. But for operational cases, all elements might be modelled, with 20 times as many nodes and lots of low-impedance edges. If a branch
Metric EI Actual Systems WECC ERCOT 70K Synthetic Systems 20K 5000
π 36,187 9398 3827 34,999 9524 2941
πΜ 2.61 2.58 2.61 2.74 2.67 2.71
πΜ 0.044 0.058 0.032 0.048 0.034 0.031
βΜ 29.2 18.9 14.2 36.7 20.3 13.8
πΜ 0.083 0.21 0.40 0.11 0.22 0.50
Table 13. Summary of complex network properties. Each power flow dataset was pre- processed to identify the substations as a single vertices, connected by transmission lines. π is the number of substation vertices. The average vertex degree πΜ can be calculated as πΜ = 2π/π, where π is the number of transmission line edges. The Watts-Strogatz clustering coefficient πΜ is calculated by averaging, for each vertex, the fraction of possible connections between neighbors that actually exist [5]. Vertices with degree 1 are ignored for the purpose of calculating πΜ , since there are no possible interconnections between pairs of neighbors. The average shortest path length βΜ is the average number of hops between any two pairs of vertices. The average betweenness centrality quantifies what percentage of these shortest paths pass through the average vertex.
is divided into four segments, explicitly including elements such as breakers and switches, there will be many more edges with degree two, for example. Another important consideration is whether generators are modeled at the transmission bus or behind their own step-up transformer, which would add many radial vertices.
To minimize these concerns and get to the core of the power system structure, we consider each substation as an combined vertex, with edges being actual transmission lines that connect two substations in a single section. For the synthetic Figure 24. Degree distribution for real and synthetic grids. The graph shows the probability distribution function for the number of transmission line edges connecting to each substation vertex. Color indicates the size of the case; the solid lines are for actual grids and the dashed ones are for synthetic.
considered in building the synthetic grids, for the purpose of this dissertationβs analysis we focus only on the graph topology.
Summary metrics are given in Table 13, for the three actual system datasets and the corresponding synthetic datasets. The systemsβ substation vertices correspond to about two buses each on average. The average degree πΜ is in effect a design parameter, since it relates the number of lines to the number of substations. But the actual systems show remarkable consistency, within 1% of an average degree of 2.6 for all three North American grids. This value fits comfortably in the range reported by literature, and validates a design choice for the synthetic grids: how many transmission lines can be placed to meet the other objectives.
The degree distribution has been frequently discussed, and we concur with some that an exponential distribution fits the data well, as shown in Figure 24, with the exception that there are fewer vertices with degree one (radial substations) than an exact exponential distribution would predict. While some have claimed that there is a similar scarcity of degree two vertices, these results show degree two vertices to be the most common kind. The results here confirm prior analysis that rejects the scale-free model for actual power grids, and show the similarity of synthetic grid data to the data for the actual grid.
The Watt-Strogatz small-world model focuses on the combination of the clustering coefficient and the shortest path length metrics. As Table 13 shows, the average clustering coefficients for all six systems are in the same neighborhood of 0.03 to 0.06. It is unclear the exact usefulness of this metric for power systems: for most
nodes the clustering coefficient is zero, with the exact magnitude depending largely on the few nodes whose neighbors are interconnected.
Shortest path length does scale with system size, but sub-linearly. The average shortest path length as shown in Table 13 remains very low even for the large EI case, qualitatively fitting the small-world idea. Figure 25 shows the distribution of shortest path length for nodes in the real and synthetic systems. The 5000 bus Texas model seems to match the distribution for ERCOT almost exactly, and both the other synthetic cases have slightly larger means than the corresponding real case. But the overall shape of all three distributions is similar. These differences in mean can be Figure 25. Distribution of substation average shortest path to other substations. The plots show the probability distribution function of the average shortest path between substation vertices, traveling along the combined substation graph. Color indicates the size of the case; the solid lines are for actual grids and the dashed ones are synthetic.
to the 70K synthetic case, the EI case has part of Canada modeled and has several areas with only the highest voltage grids modeled.
Betweenness centrality measures how central each node is by the shortest paths passing through it. There are π2 possible combinations of nodes, and a nodeβs
betweenness centrality measures the percentage of these paths it includes. The distribution of this parameter shows the fraction of nodes that have various levels of centrality, as shown in Figure 26. These log-log plots show significant similarity between the synthetic and actual cases of each size. Power systems tend to have a small Figure 26. Distribution of betweenness centrality. The plots show the probability distribution function of the betweenness centrality metric. This metric is defined by the percentage of the shortest paths between each pair of nodes which passes through a given node, traveling along the combined substation graph. Color indicates the size of the case; the solid lines are for actual grids and the dashed ones are synthetic.
number of substations which are very central topologically, while many nodes have much lower betweenness.
The analysis of this subsection shows that the complex network properties of degree distribution, cluster coefficient, average shortest path length, and betweenness centrality indicate strong similarities topologically between the synthetic 5000, 20K, and 70K cases and the actual cases with which they share a size and geographic footprint.