Dataset statistics
| Number of variables | 37 |
|---|---|
| Number of observations | 5000 |
| Missing cells | 29983 |
| Missing cells (%) | 16.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.4 MiB |
| Average record size in memory | 296.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 26 |
| Unsupported | 2 |
nameplate-capacity-mw-units has constant value "MW" | Constant |
net-summer-capacity-mw-units has constant value "MW" | Constant |
net-winter-capacity-mw-units has constant value "MW" | Constant |
planned-derate-summer-cap-mw-units has constant value "MW" | Constant |
planned-uprate-summer-cap-mw-units has constant value "MW" | Constant |
stateid has a high cardinality: 51 distinct values | High cardinality |
stateName has a high cardinality: 51 distinct values | High cardinality |
entityName has a high cardinality: 985 distinct values | High cardinality |
plantName has a high cardinality: 2011 distinct values | High cardinality |
generatorid has a high cardinality: 1541 distinct values | High cardinality |
unit has a high cardinality: 80 distinct values | High cardinality |
balancing-authority-name has a high cardinality: 52 distinct values | High cardinality |
operating-year-month has a high cardinality: 915 distinct values | High cardinality |
county has a high cardinality: 714 distinct values | High cardinality |
entityid is highly correlated with Unnamed: 0 and 17 other fields | High correlation |
plantid is highly correlated with stateid and 14 other fields | High correlation |
nameplate-capacity-mw is highly correlated with unit and 9 other fields | High correlation |
net-summer-capacity-mw is highly correlated with unit and 9 other fields | High correlation |
net-winter-capacity-mw is highly correlated with unit and 9 other fields | High correlation |
planned-uprate-summer-cap-mw is highly correlated with Unnamed: 0 and 17 other fields | High correlation |
planned-uprate-year-month is highly correlated with period and 17 other fields | High correlation |
energy_source_code is highly correlated with stateid and 21 other fields | High correlation |
planned-derate-summer-cap-mw-units is highly correlated with planned-uprate-year-month and 19 other fields | High correlation |
stateid is highly correlated with Unnamed: 0 and 18 other fields | High correlation |
stateName is highly correlated with Unnamed: 0 and 18 other fields | High correlation |
technology is highly correlated with period and 21 other fields | High correlation |
sector is highly correlated with stateid and 13 other fields | High correlation |
balancing-authority-name is highly correlated with Unnamed: 0 and 18 other fields | High correlation |
sectorName is highly correlated with stateid and 13 other fields | High correlation |
statusDescription is highly correlated with technology and 4 other fields | High correlation |
net-summer-capacity-mw-units is highly correlated with planned-uprate-year-month and 19 other fields | High correlation |
prime_mover_code is highly correlated with stateid and 17 other fields | High correlation |
planned-retirement-year-month is highly correlated with Unnamed: 0 and 20 other fields | High correlation |
period is highly correlated with Unnamed: 0 and 11 other fields | High correlation |
unit is highly correlated with Unnamed: 0 and 18 other fields | High correlation |
nameplate-capacity-mw-units is highly correlated with planned-uprate-year-month and 19 other fields | High correlation |
energy-source-desc is highly correlated with stateid and 21 other fields | High correlation |
planned-uprate-summer-cap-mw-units is highly correlated with planned-uprate-year-month and 19 other fields | High correlation |
net-winter-capacity-mw-units is highly correlated with planned-uprate-year-month and 19 other fields | High correlation |
status is highly correlated with technology and 4 other fields | High correlation |
balancing_authority_code is highly correlated with Unnamed: 0 and 18 other fields | High correlation |
Unnamed: 0 is highly correlated with period and 8 other fields | High correlation |
longitude is highly correlated with period and 11 other fields | High correlation |
latitude is highly correlated with stateid and 11 other fields | High correlation |
unit has 4542 (90.8%) missing values | Missing |
balancing_authority_code has 285 (5.7%) missing values | Missing |
balancing-authority-name has 277 (5.5%) missing values | Missing |
planned-retirement-year-month has 4882 (97.6%) missing values | Missing |
planned-derate-year-month has 5000 (100.0%) missing values | Missing |
planned-derate-summer-cap-mw has 5000 (100.0%) missing values | Missing |
planned-uprate-year-month has 4974 (99.5%) missing values | Missing |
planned-uprate-summer-cap-mw has 4974 (99.5%) missing values | Missing |
Unnamed: 0 is uniformly distributed | Uniform |
planned-uprate-year-month is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
planned-derate-year-month is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
planned-derate-summer-cap-mw is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2022-11-17 22:35:04.700309 |
|---|---|
| Analysis finished | 2022-11-17 22:35:24.750036 |
| Duration | 20.05 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 5000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2499.5 |
| Minimum | 0 |
|---|---|
| Maximum | 4999 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 249.95 |
| Q1 | 1249.75 |
| median | 2499.5 |
| Q3 | 3749.25 |
| 95-th percentile | 4749.05 |
| Maximum | 4999 |
| Range | 4999 |
| Interquartile range (IQR) | 2499.5 |
Descriptive statistics
| Standard deviation | 1443.520003 |
|---|---|
| Coefficient of variation (CV) | 0.577523506 |
| Kurtosis | -1.2 |
| Mean | 2499.5 |
| Median Absolute Deviation (MAD) | 1250 |
| Skewness | 0 |
| Sum | 12497500 |
| Variance | 2083750 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 3330 | 1 | < 0.1% |
| 3337 | 1 | < 0.1% |
| 3336 | 1 | < 0.1% |
| 3335 | 1 | < 0.1% |
| 3334 | 1 | < 0.1% |
| 3333 | 1 | < 0.1% |
| 3332 | 1 | < 0.1% |
| 3331 | 1 | < 0.1% |
| 3329 | 1 | < 0.1% |
| Other values (4990) | 4990 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 4999 | 1 | |
| 4998 | 1 | |
| 4997 | 1 | |
| 4996 | 1 | |
| 4995 | 1 | |
| 4994 | 1 | |
| 4993 | 1 | |
| 4992 | 1 | |
| 4991 | 1 | |
| 4990 | 1 |
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| 2017-02 | |
|---|---|
| 2017-03 | |
| 2017-09 | |
| 2017-01 | |
| 2020-06 | |
| Other values (13) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 35000 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2020-09 |
|---|---|
| 2nd row | 2020-09 |
| 3rd row | 2020-09 |
| 4th row | 2020-09 |
| 5th row | 2020-09 |
Common Values
| Value | Count | Frequency (%) |
| 2017-02 | 1791 | |
| 2017-03 | 981 | |
| 2017-09 | 429 | 8.6% |
| 2017-01 | 388 | 7.8% |
| 2020-06 | 369 | 7.4% |
| 2020-03 | 233 | 4.7% |
| 2020-05 | 196 | 3.9% |
| 2020-09 | 179 | 3.6% |
| 2020-04 | 154 | 3.1% |
| 2020-11 | 112 | 2.2% |
| Other values (8) | 168 | 3.4% |
Length
| Value | Count | Frequency (%) |
| 2017-02 | 1791 | |
| 2017-03 | 981 | |
| 2017-09 | 429 | 8.6% |
| 2017-01 | 388 | 7.8% |
| 2020-06 | 369 | 7.4% |
| 2020-03 | 233 | 4.7% |
| 2020-05 | 196 | 3.9% |
| 2020-09 | 179 | 3.6% |
| 2020-04 | 154 | 3.1% |
| 2020-11 | 112 | 2.2% |
| Other values (8) | 168 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 11165 | |
| 2 | 8097 | |
| - | 5000 | |
| 1 | 4366 | 12.5% |
| 7 | 3664 | 10.5% |
| 3 | 1243 | 3.6% |
| 9 | 634 | 1.8% |
| 6 | 442 | 1.3% |
| 5 | 196 | 0.6% |
| 4 | 154 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 30000 | |
| Dash Punctuation | 5000 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 11165 | |
| 2 | 8097 | |
| 1 | 4366 | 14.6% |
| 7 | 3664 | 12.2% |
| 3 | 1243 | 4.1% |
| 9 | 634 | 2.1% |
| 6 | 442 | 1.5% |
| 5 | 196 | 0.7% |
| 4 | 154 | 0.5% |
| 8 | 39 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 35000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 11165 | |
| 2 | 8097 | |
| - | 5000 | |
| 1 | 4366 | 12.5% |
| 7 | 3664 | 10.5% |
| 3 | 1243 | 3.6% |
| 9 | 634 | 1.8% |
| 6 | 442 | 1.3% |
| 5 | 196 | 0.6% |
| 4 | 154 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 11165 | |
| 2 | 8097 | |
| - | 5000 | |
| 1 | 4366 | 12.5% |
| 7 | 3664 | 10.5% |
| 3 | 1243 | 3.6% |
| 9 | 634 | 1.8% |
| 6 | 442 | 1.3% |
| 5 | 196 | 0.6% |
| 4 | 154 | 0.4% |
| Distinct | 51 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| CA | |
|---|---|
| NY | |
| TX | 266 |
| NC | 225 |
| AK | 221 |
| Other values (46) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | TX |
|---|---|
| 2nd row | TX |
| 3rd row | TX |
| 4th row | TX |
| 5th row | NJ |
Common Values
| Value | Count | Frequency (%) |
| CA | 741 | 14.8% |
| NY | 336 | 6.7% |
| TX | 266 | 5.3% |
| NC | 225 | 4.5% |
| AK | 221 | 4.4% |
| MN | 204 | 4.1% |
| MI | 196 | 3.9% |
| SC | 180 | 3.6% |
| KS | 150 | 3.0% |
| VA | 139 | 2.8% |
| Other values (41) | 2342 |
Length
| Value | Count | Frequency (%) |
| ca | 741 | 14.8% |
| ny | 336 | 6.7% |
| tx | 266 | 5.3% |
| nc | 225 | 4.5% |
| ak | 221 | 4.4% |
| mn | 204 | 4.1% |
| mi | 196 | 3.9% |
| sc | 180 | 3.6% |
| ks | 150 | 3.0% |
| va | 139 | 2.8% |
| Other values (41) | 2342 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1894 | |
| C | 1238 | |
| N | 1168 | |
| M | 792 | 7.9% |
| I | 680 | 6.8% |
| T | 529 | 5.3% |
| K | 447 | 4.5% |
| S | 379 | 3.8% |
| O | 379 | 3.8% |
| Y | 355 | 3.5% |
| Other values (14) | 2139 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1894 | |
| C | 1238 | |
| N | 1168 | |
| M | 792 | 7.9% |
| I | 680 | 6.8% |
| T | 529 | 5.3% |
| K | 447 | 4.5% |
| S | 379 | 3.8% |
| O | 379 | 3.8% |
| Y | 355 | 3.5% |
| Other values (14) | 2139 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1894 | |
| C | 1238 | |
| N | 1168 | |
| M | 792 | 7.9% |
| I | 680 | 6.8% |
| T | 529 | 5.3% |
| K | 447 | 4.5% |
| S | 379 | 3.8% |
| O | 379 | 3.8% |
| Y | 355 | 3.5% |
| Other values (14) | 2139 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1894 | |
| C | 1238 | |
| N | 1168 | |
| M | 792 | 7.9% |
| I | 680 | 6.8% |
| T | 529 | 5.3% |
| K | 447 | 4.5% |
| S | 379 | 3.8% |
| O | 379 | 3.8% |
| Y | 355 | 3.5% |
| Other values (14) | 2139 |
| Distinct | 51 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| California | |
|---|---|
| New York | |
| Texas | 266 |
| North Carolina | 225 |
| Alaska | 221 |
| Other values (46) |
Length
| Max length | 20 |
|---|---|
| Median length | 13 |
| Mean length | 8.57 |
| Min length | 4 |
Characters and Unicode
| Total characters | 42850 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Texas |
|---|---|
| 2nd row | Texas |
| 3rd row | Texas |
| 4th row | Texas |
| 5th row | New Jersey |
Common Values
| Value | Count | Frequency (%) |
| California | 741 | 14.8% |
| New York | 336 | 6.7% |
| Texas | 266 | 5.3% |
| North Carolina | 225 | 4.5% |
| Alaska | 221 | 4.4% |
| Minnesota | 204 | 4.1% |
| Michigan | 196 | 3.9% |
| South Carolina | 180 | 3.6% |
| Kansas | 150 | 3.0% |
| Virginia | 139 | 2.8% |
| Other values (41) | 2342 |
Length
| Value | Count | Frequency (%) |
| california | 741 | 12.4% |
| new | 479 | 8.0% |
| carolina | 405 | 6.8% |
| york | 336 | 5.6% |
| texas | 266 | 4.4% |
| north | 253 | 4.2% |
| alaska | 221 | 3.7% |
| south | 205 | 3.4% |
| minnesota | 204 | 3.4% |
| michigan | 196 | 3.3% |
| Other values (45) | 2677 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6103 | |
| i | 4714 | 11.0% |
| n | 3750 | 8.8% |
| o | 3672 | 8.6% |
| s | 2900 | 6.8% |
| r | 2710 | 6.3% |
| e | 2375 | 5.5% |
| l | 1993 | 4.7% |
| t | 1244 | 2.9% |
| C | 1238 | 2.9% |
| Other values (36) | 12151 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35885 | |
| Uppercase Letter | 5982 | 14.0% |
| Space Separator | 983 | 2.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6103 | |
| i | 4714 | |
| n | 3750 | |
| o | 3672 | |
| s | 2900 | |
| r | 2710 | |
| e | 2375 | 6.6% |
| l | 1993 | 5.6% |
| t | 1244 | 3.5% |
| h | 1145 | 3.2% |
| Other values (14) | 5279 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1238 | |
| N | 792 | |
| M | 792 | |
| A | 403 | 6.7% |
| T | 395 | 6.6% |
| Y | 336 | 5.6% |
| I | 325 | 5.4% |
| W | 252 | 4.2% |
| V | 217 | 3.6% |
| O | 215 | 3.6% |
| Other values (11) | 1017 |
Space Separator
| Value | Count | Frequency (%) |
| 983 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41867 | |
| Common | 983 | 2.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6103 | |
| i | 4714 | |
| n | 3750 | 9.0% |
| o | 3672 | 8.8% |
| s | 2900 | 6.9% |
| r | 2710 | 6.5% |
| e | 2375 | 5.7% |
| l | 1993 | 4.8% |
| t | 1244 | 3.0% |
| C | 1238 | 3.0% |
| Other values (35) | 11168 |
Common
| Value | Count | Frequency (%) |
| 983 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42850 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6103 | |
| i | 4714 | 11.0% |
| n | 3750 | 8.8% |
| o | 3672 | 8.6% |
| s | 2900 | 6.8% |
| r | 2710 | 6.3% |
| e | 2375 | 5.5% |
| l | 1993 | 4.7% |
| t | 1244 | 2.9% |
| C | 1238 | 2.9% |
| Other values (36) | 12151 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| electric-utility | |
|---|---|
| ipp-non-chp | |
| industrial-chp | |
| commercial-chp | 135 |
| commercial-non-chp | 124 |
| Other values (2) | 139 |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 13.7766 |
| Min length | 7 |
Characters and Unicode
| Total characters | 68883 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ipp-non-chp |
|---|---|
| 2nd row | ipp-non-chp |
| 3rd row | ipp-non-chp |
| 4th row | ipp-non-chp |
| 5th row | ipp-non-chp |
Common Values
| Value | Count | Frequency (%) |
| electric-utility | 2419 | |
| ipp-non-chp | 1925 | |
| industrial-chp | 258 | 5.2% |
| commercial-chp | 135 | 2.7% |
| commercial-non-chp | 124 | 2.5% |
| ipp-chp | 112 | 2.2% |
| industrial-non-chp | 27 | 0.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| electric-utility | 2419 | |
| ipp-non-chp | 1925 | |
| industrial-chp | 258 | 5.2% |
| commercial-chp | 135 | 2.7% |
| commercial-non-chp | 124 | 2.5% |
| ipp-chp | 112 | 2.2% |
| industrial-non-chp | 27 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 10123 | |
| c | 7937 | |
| t | 7542 | |
| - | 7076 | |
| p | 6655 | |
| l | 5382 | |
| e | 5097 | |
| n | 4437 | |
| r | 2963 | 4.3% |
| u | 2704 | 3.9% |
| Other values (7) | 8967 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 61807 | |
| Dash Punctuation | 7076 | 10.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 10123 | |
| c | 7937 | |
| t | 7542 | |
| p | 6655 | |
| l | 5382 | |
| e | 5097 | |
| n | 4437 | |
| r | 2963 | 4.8% |
| u | 2704 | 4.4% |
| h | 2581 | 4.2% |
| Other values (6) | 6386 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7076 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 61807 | |
| Common | 7076 | 10.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 10123 | |
| c | 7937 | |
| t | 7542 | |
| p | 6655 | |
| l | 5382 | |
| e | 5097 | |
| n | 4437 | |
| r | 2963 | 4.8% |
| u | 2704 | 4.4% |
| h | 2581 | 4.2% |
| Other values (6) | 6386 |
Common
| Value | Count | Frequency (%) |
| - | 7076 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 10123 | |
| c | 7937 | |
| t | 7542 | |
| - | 7076 | |
| p | 6655 | |
| l | 5382 | |
| e | 5097 | |
| n | 4437 | |
| r | 2963 | 4.3% |
| u | 2704 | 3.9% |
| Other values (7) | 8967 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| Electric Utility | |
|---|---|
| IPP Non-CHP | |
| Industrial CHP | |
| Commercial CHP | 135 |
| Commercial Non-CHP | 124 |
| Other values (2) | 139 |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 13.7766 |
| Min length | 7 |
Characters and Unicode
| Total characters | 68883 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | IPP Non-CHP |
|---|---|
| 2nd row | IPP Non-CHP |
| 3rd row | IPP Non-CHP |
| 4th row | IPP Non-CHP |
| 5th row | IPP Non-CHP |
Common Values
| Value | Count | Frequency (%) |
| Electric Utility | 2419 | |
| IPP Non-CHP | 1925 | |
| Industrial CHP | 258 | 5.2% |
| Commercial CHP | 135 | 2.7% |
| Commercial Non-CHP | 124 | 2.5% |
| IPP CHP | 112 | 2.2% |
| Industrial Non-CHP | 27 | 0.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| electric | 2419 | |
| utility | 2419 | |
| non-chp | 2076 | |
| ipp | 2037 | |
| chp | 505 | 5.1% |
| industrial | 285 | 2.9% |
| commercial | 259 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 7801 | 11.3% |
| t | 7542 | 10.9% |
| P | 6655 | 9.7% |
| l | 5382 | 7.8% |
| c | 5097 | 7.4% |
| 5000 | 7.3% | |
| r | 2963 | 4.3% |
| C | 2840 | 4.1% |
| e | 2678 | 3.9% |
| H | 2581 | 3.7% |
| Other values (13) | 20344 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40495 | |
| Uppercase Letter | 21312 | |
| Space Separator | 5000 | 7.3% |
| Dash Punctuation | 2076 | 3.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 7801 | |
| t | 7542 | |
| l | 5382 | |
| c | 5097 | |
| r | 2963 | 7.3% |
| e | 2678 | 6.6% |
| y | 2419 | 6.0% |
| n | 2361 | 5.8% |
| o | 2335 | 5.8% |
| a | 544 | 1.3% |
| Other values (4) | 1373 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 6655 | |
| C | 2840 | |
| H | 2581 | 12.1% |
| E | 2419 | 11.4% |
| U | 2419 | 11.4% |
| I | 2322 | 10.9% |
| N | 2076 | 9.7% |
Space Separator
| Value | Count | Frequency (%) |
| 5000 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2076 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 61807 | |
| Common | 7076 | 10.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 7801 | |
| t | 7542 | |
| P | 6655 | |
| l | 5382 | 8.7% |
| c | 5097 | 8.2% |
| r | 2963 | 4.8% |
| C | 2840 | 4.6% |
| e | 2678 | 4.3% |
| H | 2581 | 4.2% |
| E | 2419 | 3.9% |
| Other values (11) | 15849 |
Common
| Value | Count | Frequency (%) |
| 5000 | ||
| - | 2076 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 7801 | 11.3% |
| t | 7542 | 10.9% |
| P | 6655 | 9.7% |
| l | 5382 | 7.8% |
| c | 5097 | 7.4% |
| 5000 | 7.3% | |
| r | 2963 | 4.3% |
| C | 2840 | 4.1% |
| e | 2678 | 3.9% |
| H | 2581 | 3.7% |
| Other values (13) | 20344 |
| Distinct | 984 |
|---|---|
| Distinct (%) | 19.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29407.4486 |
| Minimum | 34 |
|---|---|
| Maximum | 64137 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 34 |
|---|---|
| 5-th percentile | 1752 |
| Q1 | 9990.25 |
| median | 17886 |
| Q3 | 56814 |
| 95-th percentile | 60971.95 |
| Maximum | 64137 |
| Range | 64103 |
| Interquartile range (IQR) | 46823.75 |
Descriptive statistics
| Standard deviation | 22797.80787 |
|---|---|
| Coefficient of variation (CV) | 0.7752392319 |
| Kurtosis | -1.612175848 |
| Mean | 29407.4486 |
| Median Absolute Deviation (MAD) | 14621 |
| Skewness | 0.3191801074 |
| Sum | 147037243 |
| Variance | 519740043.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18642 | 167 | 3.3% |
| 17609 | 138 | 2.8% |
| 17650 | 85 | 1.7% |
| 13781 | 83 | 1.7% |
| 7140 | 74 | 1.5% |
| 17539 | 70 | 1.4% |
| 4254 | 70 | 1.4% |
| 58661 | 63 | 1.3% |
| 40577 | 61 | 1.2% |
| 17543 | 48 | 1.0% |
| Other values (974) | 4141 |
| Value | Count | Frequency (%) |
| 34 | 1 | < 0.1% |
| 213 | 15 | 0.3% |
| 219 | 40 | |
| 221 | 44 | |
| 429 | 1 | < 0.1% |
| 503 | 1 | < 0.1% |
| 733 | 35 | |
| 765 | 9 | 0.2% |
| 768 | 2 | < 0.1% |
| 792 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 64137 | 1 | < 0.1% |
| 64045 | 1 | < 0.1% |
| 64025 | 4 | |
| 63841 | 1 | < 0.1% |
| 63822 | 1 | < 0.1% |
| 63705 | 3 | |
| 63534 | 1 | < 0.1% |
| 63471 | 4 | |
| 63201 | 1 | < 0.1% |
| 63181 | 4 |
| Distinct | 985 |
|---|---|
| Distinct (%) | 19.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| Tennessee Valley Authority | 167 |
|---|---|
| Southern California Edison Co | 138 |
| Southern Power Co | 85 |
| Northern States Power Co - Minnesota | 83 |
| Georgia Power Co | 74 |
| Other values (980) |
Length
| Max length | 49 |
|---|---|
| Median length | 39 |
| Mean length | 25.1944 |
| Min length | 5 |
Characters and Unicode
| Total characters | 125972 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 390 ? |
|---|---|
| Unique (%) | 7.8% |
Sample
| 1st row | Peaker Power, LLC |
|---|---|
| 2nd row | Peaker Power, LLC |
| 3rd row | Pecos Wind I LP |
| 4th row | Pecos Wind II LP |
| 5th row | Pedricktown Cogeneration Company LP |
Common Values
| Value | Count | Frequency (%) |
| Tennessee Valley Authority | 167 | 3.3% |
| Southern California Edison Co | 138 | 2.8% |
| Southern Power Co | 85 | 1.7% |
| Northern States Power Co - Minnesota | 83 | 1.7% |
| Georgia Power Co | 74 | 1.5% |
| Consumers Energy Co | 70 | 1.4% |
| Sustainable Power Group, LLC | 63 | 1.3% |
| American Mun Power-Ohio, Inc | 61 | 1.2% |
| South Carolina Electric&Gas Company | 61 | 1.2% |
| South Carolina Public Service Authority | 48 | 1.0% |
| Other values (975) | 4150 |
Length
| Value | Count | Frequency (%) |
| llc | 1412 | 6.9% |
| power | 1027 | 5.0% |
| 789 | 3.8% | |
| co | 774 | 3.8% |
| of | 697 | 3.4% |
| inc | 599 | 2.9% |
| energy | 591 | 2.9% |
| city | 591 | 2.9% |
| authority | 289 | 1.4% |
| electric | 263 | 1.3% |
| Other values (1222) | 13497 |
Most occurring characters
| Value | Count | Frequency (%) |
| 15529 | 12.3% | |
| e | 9975 | 7.9% |
| o | 9139 | 7.3% |
| r | 7948 | 6.3% |
| n | 7396 | 5.9% |
| a | 6743 | 5.4% |
| i | 6363 | 5.1% |
| t | 5888 | 4.7% |
| C | 5039 | 4.0% |
| l | 4279 | 3.4% |
| Other values (63) | 47673 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 82020 | |
| Uppercase Letter | 24807 | 19.7% |
| Space Separator | 15529 | 12.3% |
| Other Punctuation | 1296 | 1.0% |
| Dash Punctuation | 815 | 0.6% |
| Open Punctuation | 521 | 0.4% |
| Close Punctuation | 521 | 0.4% |
| Decimal Number | 431 | 0.3% |
| Control | 32 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9975 | |
| o | 9139 | |
| r | 7948 | |
| n | 7396 | |
| a | 6743 | 8.2% |
| i | 6363 | 7.8% |
| t | 5888 | 7.2% |
| l | 4279 | 5.2% |
| s | 3574 | 4.4% |
| y | 2640 | 3.2% |
| Other values (16) | 18075 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 5039 | |
| L | 3389 | |
| S | 2142 | |
| P | 2069 | 8.3% |
| E | 1720 | 6.9% |
| A | 1259 | 5.1% |
| I | 1158 | 4.7% |
| N | 999 | 4.0% |
| M | 889 | 3.6% |
| G | 884 | 3.6% |
| Other values (16) | 5259 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 98 | |
| 1 | 89 | |
| 2 | 88 | |
| 3 | 42 | |
| 5 | 40 | |
| 9 | 32 | 7.4% |
| 6 | 22 | 5.1% |
| 7 | 7 | 1.6% |
| 8 | 7 | 1.6% |
| 4 | 6 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 794 | |
| & | 252 | 19.4% |
| . | 211 | 16.3% |
| / | 36 | 2.8% |
| # | 2 | 0.2% |
| ? | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 15529 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 815 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 521 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 521 |
Control
| Value | Count | Frequency (%) |
| 32 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 106827 | |
| Common | 19145 | 15.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 9975 | 9.3% |
| o | 9139 | 8.6% |
| r | 7948 | 7.4% |
| n | 7396 | 6.9% |
| a | 6743 | 6.3% |
| i | 6363 | 6.0% |
| t | 5888 | 5.5% |
| C | 5039 | 4.7% |
| l | 4279 | 4.0% |
| s | 3574 | 3.3% |
| Other values (42) | 40483 |
Common
| Value | Count | Frequency (%) |
| 15529 | ||
| - | 815 | 4.3% |
| , | 794 | 4.1% |
| ( | 521 | 2.7% |
| ) | 521 | 2.7% |
| & | 252 | 1.3% |
| . | 211 | 1.1% |
| 0 | 98 | 0.5% |
| 1 | 89 | 0.5% |
| 2 | 88 | 0.5% |
| Other values (11) | 227 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 125972 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 15529 | 12.3% | |
| e | 9975 | 7.9% |
| o | 9139 | 7.3% |
| r | 7948 | 6.3% |
| n | 7396 | 5.9% |
| a | 6743 | 5.4% |
| i | 6363 | 5.1% |
| t | 5888 | 4.7% |
| C | 5039 | 4.0% |
| l | 4279 | 3.4% |
| Other values (63) | 47673 |
| Distinct | 2008 |
|---|---|
| Distinct (%) | 40.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31686.292 |
| Minimum | 46 |
|---|---|
| Maximum | 64422 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 46 |
|---|---|
| 5-th percentile | 719 |
| Q1 | 3397 |
| median | 50385 |
| Q3 | 57482 |
| 95-th percentile | 60924.15 |
| Maximum | 64422 |
| Range | 64376 |
| Interquartile range (IQR) | 54085 |
Descriptive statistics
| Standard deviation | 26625.32909 |
|---|---|
| Coefficient of variation (CV) | 0.8402791051 |
| Kurtosis | -1.943962495 |
| Mean | 31686.292 |
| Median Absolute Deviation (MAD) | 11943.5 |
| Skewness | -0.07555309445 |
| Sum | 158431460 |
| Variance | 708908148.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3630 | 27 | 0.5% |
| 54766 | 27 | 0.5% |
| 3406 | 24 | 0.5% |
| 3393 | 22 | 0.4% |
| 10279 | 20 | 0.4% |
| 1166 | 20 | 0.4% |
| 54782 | 18 | 0.4% |
| 3295 | 17 | 0.3% |
| 3403 | 17 | 0.3% |
| 55380 | 17 | 0.3% |
| Other values (1998) | 4791 |
| Value | Count | Frequency (%) |
| 46 | 3 | 0.1% |
| 47 | 9 | |
| 48 | 4 | 0.1% |
| 51 | 1 | < 0.1% |
| 64 | 11 | |
| 65 | 1 | < 0.1% |
| 66 | 8 | |
| 69 | 4 | 0.1% |
| 70 | 2 | < 0.1% |
| 78 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 64422 | 4 | |
| 64232 | 1 | < 0.1% |
| 64231 | 1 | < 0.1% |
| 63986 | 1 | < 0.1% |
| 63857 | 1 | < 0.1% |
| 63837 | 1 | < 0.1% |
| 63836 | 1 | < 0.1% |
| 63835 | 1 | < 0.1% |
| 63753 | 1 | < 0.1% |
| 63681 | 2 |
| Distinct | 2011 |
|---|---|
| Distinct (%) | 40.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| Pearsall | 27 |
|---|---|
| Boydton Plank Road Cogen Plant | 27 |
| Johnsonville | 24 |
| Allen | 22 |
| Mt Pleasant | 20 |
| Other values (2006) |
Length
| Max length | 45 |
|---|---|
| Median length | 33 |
| Mean length | 17.7466 |
| Min length | 3 |
Characters and Unicode
| Total characters | 88733 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1029 ? |
|---|---|
| Unique (%) | 20.6% |
Sample
| 1st row | Port Comfort Power LLC |
|---|---|
| 2nd row | Port Comfort Power LLC |
| 3rd row | Woodward Mountain I |
| 4th row | Woodward Mountain II |
| 5th row | Pedricktown Cogeneration Company LP |
Common Values
| Value | Count | Frequency (%) |
| Pearsall | 27 | 0.5% |
| Boydton Plank Road Cogen Plant | 27 | 0.5% |
| Johnsonville | 24 | 0.5% |
| Allen | 22 | 0.4% |
| Mt Pleasant | 20 | 0.4% |
| Kansas River Project | 20 | 0.4% |
| Seneca Energy | 18 | 0.4% |
| Urquhart | 17 | 0.3% |
| T H Wharton | 17 | 0.3% |
| Union Power Station | 17 | 0.3% |
| Other values (2001) | 4791 |
Length
| Value | Count | Frequency (%) |
| solar | 555 | 4.0% |
| energy | 365 | 2.6% |
| llc | 362 | 2.6% |
| plant | 284 | 2.0% |
| project | 266 | 1.9% |
| power | 234 | 1.7% |
| station | 198 | 1.4% |
| center | 182 | 1.3% |
| facility | 178 | 1.3% |
| wind | 154 | 1.1% |
| Other values (2170) | 11209 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8988 | 10.1% | |
| e | 7301 | 8.2% |
| a | 6573 | 7.4% |
| o | 5734 | 6.5% |
| r | 5718 | 6.4% |
| n | 5583 | 6.3% |
| t | 5213 | 5.9% |
| l | 4643 | 5.2% |
| i | 4612 | 5.2% |
| C | 2167 | 2.4% |
| Other values (63) | 32201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 61953 | |
| Uppercase Letter | 15858 | 17.9% |
| Space Separator | 8988 | 10.1% |
| Decimal Number | 944 | 1.1% |
| Other Punctuation | 459 | 0.5% |
| Open Punctuation | 177 | 0.2% |
| Close Punctuation | 177 | 0.2% |
| Dash Punctuation | 174 | 0.2% |
| Control | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 7301 | |
| a | 6573 | |
| o | 5734 | |
| r | 5718 | |
| n | 5583 | |
| t | 5213 | |
| l | 4643 | 7.5% |
| i | 4612 | 7.4% |
| s | 2165 | 3.5% |
| y | 1691 | 2.7% |
| Other values (16) | 12720 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2167 | |
| S | 1959 | |
| P | 1690 | 10.7% |
| L | 1314 | 8.3% |
| E | 740 | 4.7% |
| M | 732 | 4.6% |
| G | 725 | 4.6% |
| B | 690 | 4.4% |
| H | 674 | 4.3% |
| F | 669 | 4.2% |
| Other values (16) | 4498 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 216 | |
| 2 | 197 | |
| 0 | 111 | |
| 3 | 93 | |
| 4 | 80 | 8.5% |
| 5 | 64 | 6.8% |
| 6 | 62 | 6.6% |
| 8 | 46 | 4.9% |
| 9 | 42 | 4.4% |
| 7 | 33 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 192 | |
| # | 141 | |
| . | 91 | |
| & | 18 | 3.9% |
| / | 10 | 2.2% |
| ' | 7 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 8988 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 177 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 177 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 174 |
Control
| Value | Count | Frequency (%) |
| 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 77811 | |
| Common | 10922 | 12.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 7301 | 9.4% |
| a | 6573 | 8.4% |
| o | 5734 | 7.4% |
| r | 5718 | 7.3% |
| n | 5583 | 7.2% |
| t | 5213 | 6.7% |
| l | 4643 | 6.0% |
| i | 4612 | 5.9% |
| C | 2167 | 2.8% |
| s | 2165 | 2.8% |
| Other values (42) | 28102 |
Common
| Value | Count | Frequency (%) |
| 8988 | ||
| 1 | 216 | 2.0% |
| 2 | 197 | 1.8% |
| , | 192 | 1.8% |
| ( | 177 | 1.6% |
| ) | 177 | 1.6% |
| - | 174 | 1.6% |
| # | 141 | 1.3% |
| 0 | 111 | 1.0% |
| 3 | 93 | 0.9% |
| Other values (11) | 456 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 88733 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8988 | 10.1% | |
| e | 7301 | 8.2% |
| a | 6573 | 7.4% |
| o | 5734 | 6.5% |
| r | 5718 | 6.4% |
| n | 5583 | 6.3% |
| t | 5213 | 5.9% |
| l | 4643 | 5.2% |
| i | 4612 | 5.2% |
| C | 2167 | 2.4% |
| Other values (63) | 32201 |
| Distinct | 1541 |
|---|---|
| Distinct (%) | 30.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 274 |
| 4 | 196 |
| GEN1 | 190 |
| Other values (1536) |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 2.5848 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12924 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1251 ? |
|---|---|
| Unique (%) | 25.0% |
Sample
| 1st row | PC1 |
|---|---|
| 2nd row | PC2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | GEN1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 694 | 13.9% |
| 2 | 405 | 8.1% |
| 3 | 274 | 5.5% |
| 4 | 196 | 3.9% |
| GEN1 | 190 | 3.8% |
| 5 | 134 | 2.7% |
| 6 | 112 | 2.2% |
| GEN2 | 95 | 1.9% |
| PV1 | 87 | 1.7% |
| 7 | 83 | 1.7% |
| Other values (1531) | 2730 |
Length
| Value | Count | Frequency (%) |
| 1 | 712 | 14.1% |
| 2 | 412 | 8.2% |
| 3 | 276 | 5.5% |
| 4 | 198 | 3.9% |
| gen1 | 190 | 3.8% |
| 5 | 137 | 2.7% |
| 6 | 114 | 2.3% |
| gen2 | 95 | 1.9% |
| pv1 | 87 | 1.7% |
| 7 | 84 | 1.7% |
| Other values (1516) | 2734 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2058 | |
| G | 1173 | 9.1% |
| 2 | 1102 | 8.5% |
| N | 742 | 5.7% |
| E | 718 | 5.6% |
| T | 709 | 5.5% |
| 3 | 681 | 5.3% |
| S | 601 | 4.7% |
| C | 541 | 4.2% |
| 4 | 510 | 3.9% |
| Other values (31) | 4089 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6854 | |
| Decimal Number | 5920 | |
| Dash Punctuation | 103 | 0.8% |
| Space Separator | 39 | 0.3% |
| Other Punctuation | 7 | 0.1% |
| Lowercase Letter | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1173 | |
| N | 742 | |
| E | 718 | |
| T | 709 | |
| S | 601 | |
| C | 541 | 7.9% |
| A | 282 | 4.1% |
| P | 258 | 3.8% |
| I | 217 | 3.2% |
| B | 193 | 2.8% |
| Other values (16) | 1420 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2058 | |
| 2 | 1102 | |
| 3 | 681 | 11.5% |
| 4 | 510 | 8.6% |
| 0 | 382 | 6.5% |
| 5 | 361 | 6.1% |
| 6 | 274 | 4.6% |
| 7 | 240 | 4.1% |
| 8 | 184 | 3.1% |
| 9 | 128 | 2.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| # | 6 | |
| . | 1 | 14.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 103 |
Space Separator
| Value | Count | Frequency (%) |
| 39 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6855 | |
| Common | 6069 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 1173 | |
| N | 742 | |
| E | 718 | |
| T | 709 | |
| S | 601 | |
| C | 541 | 7.9% |
| A | 282 | 4.1% |
| P | 258 | 3.8% |
| I | 217 | 3.2% |
| B | 193 | 2.8% |
| Other values (17) | 1421 |
Common
| Value | Count | Frequency (%) |
| 1 | 2058 | |
| 2 | 1102 | |
| 3 | 681 | 11.2% |
| 4 | 510 | 8.4% |
| 0 | 382 | 6.3% |
| 5 | 361 | 5.9% |
| 6 | 274 | 4.5% |
| 7 | 240 | 4.0% |
| 8 | 184 | 3.0% |
| 9 | 128 | 2.1% |
| Other values (4) | 149 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12924 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2058 | |
| G | 1173 | 9.1% |
| 2 | 1102 | 8.5% |
| N | 742 | 5.7% |
| E | 718 | 5.6% |
| T | 709 | 5.5% |
| 3 | 681 | 5.3% |
| S | 601 | 4.7% |
| C | 541 | 4.2% |
| 4 | 510 | 3.9% |
| Other values (31) | 4089 |
| Distinct | 80 |
|---|---|
| Distinct (%) | 17.5% |
| Missing | 4542 |
| Missing (%) | 90.8% |
| Memory size | 39.2 KiB |
| CC1 | |
|---|---|
| 1 | 15 |
| CHP1 | 10 |
| CC2 | 10 |
| BLK2 | 10 |
| Other values (75) |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.43231441 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1572 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | CC1 |
|---|---|
| 2nd row | CC1 |
| 3rd row | CC1 |
| 4th row | CC1 |
| 5th row | G104 |
Common Values
| Value | Count | Frequency (%) |
| CC1 | 175 | 3.5% |
| 1 | 15 | 0.3% |
| CHP1 | 10 | 0.2% |
| CC2 | 10 | 0.2% |
| BLK2 | 10 | 0.2% |
| CC01 | 10 | 0.2% |
| PB01 | 9 | 0.2% |
| BLK1 | 9 | 0.2% |
| STG1 | 6 | 0.1% |
| PLTB | 6 | 0.1% |
| Other values (70) | 198 | 4.0% |
| (Missing) | 4542 |
Length
| Value | Count | Frequency (%) |
| cc1 | 175 | |
| 1 | 15 | 3.3% |
| chp1 | 10 | 2.2% |
| cc01 | 10 | 2.2% |
| cc2 | 10 | 2.2% |
| blk2 | 10 | 2.2% |
| pb01 | 9 | 2.0% |
| blk1 | 9 | 2.0% |
| stg1 | 6 | 1.3% |
| pltb | 6 | 1.3% |
| Other values (70) | 198 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 506 | |
| 1 | 344 | |
| 0 | 109 | 6.9% |
| B | 78 | 5.0% |
| G | 65 | 4.1% |
| 2 | 57 | 3.6% |
| P | 47 | 3.0% |
| L | 46 | 2.9% |
| T | 40 | 2.5% |
| S | 38 | 2.4% |
| Other values (21) | 242 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 964 | |
| Decimal Number | 608 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 506 | |
| B | 78 | 8.1% |
| G | 65 | 6.7% |
| P | 47 | 4.9% |
| L | 46 | 4.8% |
| T | 40 | 4.1% |
| S | 38 | 3.9% |
| H | 29 | 3.0% |
| K | 26 | 2.7% |
| U | 17 | 1.8% |
| Other values (12) | 72 | 7.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 344 | |
| 0 | 109 | 17.9% |
| 2 | 57 | 9.4% |
| 3 | 31 | 5.1% |
| 4 | 21 | 3.5% |
| 6 | 14 | 2.3% |
| 5 | 13 | 2.1% |
| 8 | 10 | 1.6% |
| 7 | 9 | 1.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 964 | |
| Common | 608 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 506 | |
| B | 78 | 8.1% |
| G | 65 | 6.7% |
| P | 47 | 4.9% |
| L | 46 | 4.8% |
| T | 40 | 4.1% |
| S | 38 | 3.9% |
| H | 29 | 3.0% |
| K | 26 | 2.7% |
| U | 17 | 1.8% |
| Other values (12) | 72 | 7.5% |
Common
| Value | Count | Frequency (%) |
| 1 | 344 | |
| 0 | 109 | 17.9% |
| 2 | 57 | 9.4% |
| 3 | 31 | 5.1% |
| 4 | 21 | 3.5% |
| 6 | 14 | 2.3% |
| 5 | 13 | 2.1% |
| 8 | 10 | 1.6% |
| 7 | 9 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1572 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 506 | |
| 1 | 344 | |
| 0 | 109 | 6.9% |
| B | 78 | 5.0% |
| G | 65 | 4.1% |
| 2 | 57 | 3.6% |
| P | 47 | 3.0% |
| L | 46 | 2.9% |
| T | 40 | 2.5% |
| S | 38 | 2.4% |
| Other values (21) | 242 |
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| Petroleum Liquids | |
|---|---|
| Conventional Hydroelectric | |
| Solar Photovoltaic | |
| Natural Gas Fired Combustion Turbine | |
| Natural Gas Fired Combined Cycle | |
| Other values (19) |
Length
| Max length | 38 |
|---|---|
| Median length | 33 |
| Mean length | 23.5074 |
| Min length | 7 |
Characters and Unicode
| Total characters | 117537 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Natural Gas Fired Combustion Turbine |
|---|---|
| 2nd row | Natural Gas Fired Combustion Turbine |
| 3rd row | Onshore Wind Turbine |
| 4th row | Onshore Wind Turbine |
| 5th row | Natural Gas Fired Combined Cycle |
Common Values
| Value | Count | Frequency (%) |
| Petroleum Liquids | 928 | |
| Conventional Hydroelectric | 879 | |
| Solar Photovoltaic | 863 | |
| Natural Gas Fired Combustion Turbine | 500 | |
| Natural Gas Fired Combined Cycle | 450 | |
| Natural Gas Internal Combustion Engine | 243 | 4.9% |
| Onshore Wind Turbine | 242 | 4.8% |
| Landfill Gas | 225 | 4.5% |
| Conventional Steam Coal | 188 | 3.8% |
| Natural Gas Steam Turbine | 135 | 2.7% |
| Other values (14) | 347 | 6.9% |
Length
| Value | Count | Frequency (%) |
| gas | 1568 | 10.9% |
| natural | 1343 | 9.3% |
| conventional | 1067 | 7.4% |
| fired | 950 | 6.6% |
| hydroelectric | 934 | 6.5% |
| petroleum | 932 | 6.4% |
| liquids | 928 | 6.4% |
| turbine | 877 | 6.1% |
| solar | 872 | 6.0% |
| photovoltaic | 863 | 6.0% |
| Other values (29) | 4117 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 10446 | 8.9% |
| 9451 | 8.0% | |
| e | 9132 | 7.8% |
| i | 8650 | 7.4% |
| a | 8509 | 7.2% |
| t | 7730 | 6.6% |
| r | 7592 | 6.5% |
| l | 7473 | 6.4% |
| n | 6970 | 5.9% |
| u | 4911 | 4.2% |
| Other values (31) | 36673 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 93474 | |
| Uppercase Letter | 14527 | 12.4% |
| Space Separator | 9451 | 8.0% |
| Other Punctuation | 85 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 10446 | |
| e | 9132 | |
| i | 8650 | |
| a | 8509 | |
| t | 7730 | |
| r | 7592 | |
| l | 7473 | |
| n | 6970 | 7.5% |
| u | 4911 | 5.3% |
| s | 3966 | 4.2% |
| Other values (13) | 18095 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2902 | |
| P | 1850 | |
| G | 1622 | |
| N | 1360 | |
| S | 1268 | |
| L | 1153 | 7.9% |
| F | 952 | 6.6% |
| H | 934 | 6.4% |
| T | 886 | 6.1% |
| W | 569 | 3.9% |
| Other values (6) | 1031 | 7.1% |
Space Separator
| Value | Count | Frequency (%) |
| 9451 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 85 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 108001 | |
| Common | 9536 | 8.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 10446 | 9.7% |
| e | 9132 | 8.5% |
| i | 8650 | 8.0% |
| a | 8509 | 7.9% |
| t | 7730 | 7.2% |
| r | 7592 | 7.0% |
| l | 7473 | 6.9% |
| n | 6970 | 6.5% |
| u | 4911 | 4.5% |
| s | 3966 | 3.7% |
| Other values (29) | 32622 |
Common
| Value | Count | Frequency (%) |
| 9451 | ||
| / | 85 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 117537 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 10446 | 8.9% |
| 9451 | 8.0% | |
| e | 9132 | 7.8% |
| i | 8650 | 7.4% |
| a | 8509 | 7.2% |
| t | 7730 | 6.6% |
| r | 7592 | 6.5% |
| l | 7473 | 6.4% |
| n | 6970 | 5.9% |
| u | 4911 | 4.2% |
| Other values (31) | 36673 |
| Distinct | 28 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| NG | |
|---|---|
| WAT | |
| DFO | |
| SUN | |
| WND | |
| Other values (23) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.7184 |
| Min length | 2 |
Characters and Unicode
| Total characters | 13592 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NG |
|---|---|
| 2nd row | NG |
| 3rd row | WND |
| 4th row | WND |
| 5th row | NG |
Common Values
| Value | Count | Frequency (%) |
| NG | 1343 | |
| WAT | 934 | |
| DFO | 890 | |
| SUN | 872 | |
| WND | 242 | 4.8% |
| LFG | 225 | 4.5% |
| BIT | 81 | 1.6% |
| SUB | 77 | 1.5% |
| OBG | 61 | 1.2% |
| GEO | 47 | 0.9% |
| Other values (18) | 228 | 4.6% |
Length
| Value | Count | Frequency (%) |
| ng | 1343 | |
| wat | 934 | |
| dfo | 890 | |
| sun | 872 | |
| wnd | 242 | 4.8% |
| lfg | 225 | 4.5% |
| bit | 81 | 1.6% |
| sub | 77 | 1.5% |
| obg | 61 | 1.2% |
| geo | 47 | 0.9% |
| Other values (18) | 228 | 4.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 2474 | |
| G | 1685 | |
| W | 1274 | |
| D | 1177 | |
| F | 1132 | |
| O | 1030 | |
| T | 1016 | |
| S | 1003 | |
| U | 968 | 7.1% |
| A | 936 | 6.9% |
| Other values (12) | 897 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 13592 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2474 | |
| G | 1685 | |
| W | 1274 | |
| D | 1177 | |
| F | 1132 | |
| O | 1030 | |
| T | 1016 | |
| S | 1003 | |
| U | 968 | 7.1% |
| A | 936 | 6.9% |
| Other values (12) | 897 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13592 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 2474 | |
| G | 1685 | |
| W | 1274 | |
| D | 1177 | |
| F | 1132 | |
| O | 1030 | |
| T | 1016 | |
| S | 1003 | |
| U | 968 | 7.1% |
| A | 936 | 6.9% |
| Other values (12) | 897 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13592 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 2474 | |
| G | 1685 | |
| W | 1274 | |
| D | 1177 | |
| F | 1132 | |
| O | 1030 | |
| T | 1016 | |
| S | 1003 | |
| U | 968 | 7.1% |
| A | 936 | 6.9% |
| Other values (12) | 897 | 6.6% |
| Distinct | 28 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| Natural Gas | |
|---|---|
| Water | |
| Disillate Fuel Oil | |
| Solar | |
| Wind | |
| Other values (23) |
Length
| Max length | 35 |
|---|---|
| Median length | 27 |
| Mean length | 10.2334 |
| Min length | 4 |
Characters and Unicode
| Total characters | 51167 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Natural Gas |
|---|---|
| 2nd row | Natural Gas |
| 3rd row | Wind |
| 4th row | Wind |
| 5th row | Natural Gas |
Common Values
| Value | Count | Frequency (%) |
| Natural Gas | 1343 | |
| Water | 934 | |
| Disillate Fuel Oil | 890 | |
| Solar | 872 | |
| Wind | 242 | 4.8% |
| Landfill Gas | 225 | 4.5% |
| Bituminous Coal | 81 | 1.6% |
| Subbituminous Coal | 77 | 1.5% |
| Other Biomass Gases | 61 | 1.2% |
| Geothermal | 47 | 0.9% |
| Other values (18) | 228 | 4.6% |
Length
| Value | Count | Frequency (%) |
| gas | 1575 | |
| natural | 1343 | |
| water | 934 | |
| oil | 914 | |
| fuel | 907 | |
| disillate | 890 | |
| solar | 872 | |
| wind | 242 | 2.7% |
| landfill | 225 | 2.5% |
| coal | 186 | 2.1% |
| Other values (32) | 868 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7736 | |
| l | 6677 | |
| 4017 | 7.9% | |
| i | 3733 | 7.3% |
| t | 3603 | 7.0% |
| r | 3419 | 6.7% |
| e | 3283 | 6.4% |
| s | 3061 | 6.0% |
| u | 2755 | 5.4% |
| G | 1683 | 3.3% |
| Other values (32) | 11200 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 38248 | |
| Uppercase Letter | 8884 | 17.4% |
| Space Separator | 4017 | 7.9% |
| Open Punctuation | 9 | < 0.1% |
| Close Punctuation | 9 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7736 | |
| l | 6677 | |
| i | 3733 | |
| t | 3603 | |
| r | 3419 | |
| e | 3283 | |
| s | 3061 | 8.0% |
| u | 2755 | 7.2% |
| o | 1567 | 4.1% |
| n | 692 | 1.8% |
| Other values (11) | 1722 | 4.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1683 | |
| N | 1360 | |
| W | 1301 | |
| S | 1005 | |
| O | 983 | |
| F | 907 | |
| D | 890 | |
| L | 267 | 3.0% |
| C | 190 | 2.1% |
| B | 184 | 2.1% |
| Other values (8) | 114 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 4017 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 9 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 9 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 47132 | |
| Common | 4035 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7736 | |
| l | 6677 | |
| i | 3733 | 7.9% |
| t | 3603 | 7.6% |
| r | 3419 | 7.3% |
| e | 3283 | 7.0% |
| s | 3061 | 6.5% |
| u | 2755 | 5.8% |
| G | 1683 | 3.6% |
| o | 1567 | 3.3% |
| Other values (29) | 9615 |
Common
| Value | Count | Frequency (%) |
| 4017 | ||
| ( | 9 | 0.2% |
| ) | 9 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51167 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7736 | |
| l | 6677 | |
| 4017 | 7.9% | |
| i | 3733 | 7.3% |
| t | 3603 | 7.0% |
| r | 3419 | 6.7% |
| e | 3283 | 6.4% |
| s | 3061 | 6.0% |
| u | 2755 | 5.4% |
| G | 1683 | 3.3% |
| Other values (32) | 11200 |
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| IC | |
|---|---|
| HY | |
| PV | |
| GT | |
| ST | |
| Other values (11) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | GT |
|---|---|
| 2nd row | GT |
| 3rd row | WT |
| 4th row | WT |
| 5th row | CT |
Common Values
| Value | Count | Frequency (%) |
| IC | 1270 | |
| HY | 879 | |
| PV | 863 | |
| GT | 632 | |
| ST | 492 | 9.8% |
| CT | 300 | 6.0% |
| WT | 242 | 4.8% |
| CA | 152 | 3.0% |
| PS | 55 | 1.1% |
| FC | 35 | 0.7% |
| Other values (6) | 80 | 1.6% |
Length
| Value | Count | Frequency (%) |
| ic | 1270 | |
| hy | 879 | |
| pv | 863 | |
| gt | 632 | |
| st | 492 | 9.8% |
| ct | 300 | 6.0% |
| wt | 242 | 4.8% |
| ca | 152 | 3.0% |
| ps | 55 | 1.1% |
| fc | 35 | 0.7% |
| Other values (6) | 80 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1778 | |
| T | 1707 | |
| I | 1270 | |
| P | 920 | |
| H | 879 | |
| Y | 879 | |
| V | 863 | |
| G | 632 | 6.3% |
| S | 566 | 5.7% |
| W | 244 | 2.4% |
| Other values (4) | 262 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1778 | |
| T | 1707 | |
| I | 1270 | |
| P | 920 | |
| H | 879 | |
| Y | 879 | |
| V | 863 | |
| G | 632 | 6.3% |
| S | 566 | 5.7% |
| W | 244 | 2.4% |
| Other values (4) | 262 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 1778 | |
| T | 1707 | |
| I | 1270 | |
| P | 920 | |
| H | 879 | |
| Y | 879 | |
| V | 863 | |
| G | 632 | 6.3% |
| S | 566 | 5.7% |
| W | 244 | 2.4% |
| Other values (4) | 262 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1778 | |
| T | 1707 | |
| I | 1270 | |
| P | 920 | |
| H | 879 | |
| Y | 879 | |
| V | 863 | |
| G | 632 | 6.3% |
| S | 566 | 5.7% |
| W | 244 | 2.4% |
| Other values (4) | 262 | 2.6% |
| Distinct | 50 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 285 |
| Missing (%) | 5.7% |
| Memory size | 39.2 KiB |
| MISO | |
|---|---|
| CISO | |
| PJM | |
| SWPP | |
| NYIS | |
| Other values (45) |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.746129374 |
| Min length | 2 |
Characters and Unicode
| Total characters | 17663 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | ERCO |
|---|---|
| 2nd row | ERCO |
| 3rd row | ERCO |
| 4th row | ERCO |
| 5th row | PJM |
Common Values
| Value | Count | Frequency (%) |
| MISO | 889 | |
| CISO | 701 | |
| PJM | 619 | |
| SWPP | 344 | 6.9% |
| NYIS | 337 | 6.7% |
| ISNE | 264 | 5.3% |
| ERCO | 234 | 4.7% |
| TVA | 178 | 3.6% |
| SOCO | 161 | 3.2% |
| DUK | 132 | 2.6% |
| Other values (40) | 856 | |
| (Missing) | 285 | 5.7% |
Length
| Value | Count | Frequency (%) |
| miso | 889 | |
| ciso | 701 | |
| pjm | 619 | |
| swpp | 344 | 7.3% |
| nyis | 337 | 7.1% |
| isne | 264 | 5.6% |
| erco | 234 | 5.0% |
| tva | 178 | 3.8% |
| soco | 161 | 3.4% |
| duk | 132 | 2.8% |
| Other values (40) | 856 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2916 | |
| I | 2306 | |
| O | 2226 | |
| P | 1825 | |
| C | 1628 | |
| M | 1599 | |
| E | 868 | 4.9% |
| N | 662 | 3.7% |
| J | 625 | 3.5% |
| A | 566 | 3.2% |
| Other values (14) | 2442 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 17663 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2916 | |
| I | 2306 | |
| O | 2226 | |
| P | 1825 | |
| C | 1628 | |
| M | 1599 | |
| E | 868 | 4.9% |
| N | 662 | 3.7% |
| J | 625 | 3.5% |
| A | 566 | 3.2% |
| Other values (14) | 2442 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17663 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2916 | |
| I | 2306 | |
| O | 2226 | |
| P | 1825 | |
| C | 1628 | |
| M | 1599 | |
| E | 868 | 4.9% |
| N | 662 | 3.7% |
| J | 625 | 3.5% |
| A | 566 | 3.2% |
| Other values (14) | 2442 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17663 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 2916 | |
| I | 2306 | |
| O | 2226 | |
| P | 1825 | |
| C | 1628 | |
| M | 1599 | |
| E | 868 | 4.9% |
| N | 662 | 3.7% |
| J | 625 | 3.5% |
| A | 566 | 3.2% |
| Other values (14) | 2442 |
| Distinct | 52 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 277 |
| Missing (%) | 5.5% |
| Memory size | 39.2 KiB |
| Midcontinent Independent Transmission System Operator, Inc.. | |
|---|---|
| California Independent System Operator | |
| PJM Interconnection, LLC | |
| Southwest Power Pool | |
| New York Independent System Operator | |
| Other values (47) |
Length
| Max length | 100 |
|---|---|
| Median length | 57 |
| Mean length | 35.90239255 |
| Min length | 3 |
Characters and Unicode
| Total characters | 169567 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Electric Reliability Council of Texas, Inc. |
|---|---|
| 2nd row | Electric Reliability Council of Texas, Inc. |
| 3rd row | Electric Reliability Council of Texas, Inc. |
| 4th row | Electric Reliability Council of Texas, Inc. |
| 5th row | PJM Interconnection, LLC |
Common Values
| Value | Count | Frequency (%) |
| Midcontinent Independent Transmission System Operator, Inc.. | 889 | |
| California Independent System Operator | 701 | |
| PJM Interconnection, LLC | 619 | |
| Southwest Power Pool | 344 | 6.9% |
| New York Independent System Operator | 337 | 6.7% |
| ISO New England Inc. | 264 | 5.3% |
| Electric Reliability Council of Texas, Inc. | 234 | 4.7% |
| Tennessee Valley Authority | 178 | 3.6% |
| Southern Company Services, Inc. - Trans | 161 | 3.2% |
| Duke Energy Carolinas | 132 | 2.6% |
| Other values (42) | 864 | |
| (Missing) | 277 | 5.5% |
Length
| Value | Count | Frequency (%) |
| independent | 1927 | 9.2% |
| operator | 1927 | 9.2% |
| system | 1927 | 9.2% |
| inc | 1597 | 7.7% |
| midcontinent | 889 | 4.3% |
| transmission | 889 | 4.3% |
| california | 704 | 3.4% |
| llc | 629 | 3.0% |
| new | 625 | 3.0% |
| pjm | 619 | 3.0% |
| Other values (104) | 9102 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 18370 | 10.8% |
| e | 17277 | 10.2% |
| 16112 | 9.5% | |
| t | 11771 | 6.9% |
| o | 10754 | 6.3% |
| r | 9942 | 5.9% |
| i | 9109 | 5.4% |
| a | 7488 | 4.4% |
| s | 6986 | 4.1% |
| c | 5404 | 3.2% |
| Other values (48) | 56354 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 125065 | |
| Uppercase Letter | 23441 | 13.8% |
| Space Separator | 16112 | 9.5% |
| Other Punctuation | 4558 | 2.7% |
| Dash Punctuation | 361 | 0.2% |
| Open Punctuation | 10 | < 0.1% |
| Close Punctuation | 10 | < 0.1% |
| Decimal Number | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 18370 | |
| e | 17277 | |
| t | 11771 | |
| o | 10754 | |
| r | 9942 | |
| i | 9109 | |
| a | 7488 | 6.0% |
| s | 6986 | 5.6% |
| c | 5404 | 4.3% |
| d | 5375 | 4.3% |
| Other values (16) | 22589 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 4474 | |
| S | 3238 | |
| C | 2571 | |
| O | 2191 | |
| P | 1929 | |
| M | 1590 | 6.8% |
| T | 1563 | 6.7% |
| L | 1296 | 5.5% |
| E | 1122 | 4.8% |
| N | 696 | 3.0% |
| Other values (13) | 2771 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2493 | |
| , | 1960 | |
| & | 105 | 2.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9 | |
| 2 | 1 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 16112 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 361 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 148506 | |
| Common | 21061 | 12.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 18370 | 12.4% |
| e | 17277 | 11.6% |
| t | 11771 | 7.9% |
| o | 10754 | 7.2% |
| r | 9942 | 6.7% |
| i | 9109 | 6.1% |
| a | 7488 | 5.0% |
| s | 6986 | 4.7% |
| c | 5404 | 3.6% |
| d | 5375 | 3.6% |
| Other values (39) | 46030 |
Common
| Value | Count | Frequency (%) |
| 16112 | ||
| . | 2493 | 11.8% |
| , | 1960 | 9.3% |
| - | 361 | 1.7% |
| & | 105 | 0.5% |
| ( | 10 | < 0.1% |
| ) | 10 | < 0.1% |
| 1 | 9 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 169567 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 18370 | 10.8% |
| e | 17277 | 10.2% |
| 16112 | 9.5% | |
| t | 11771 | 6.9% |
| o | 10754 | 6.3% |
| r | 9942 | 5.9% |
| i | 9109 | 5.4% |
| a | 7488 | 4.4% |
| s | 6986 | 4.1% |
| c | 5404 | 3.2% |
| Other values (48) | 56354 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| OP | |
|---|---|
| SB | 386 |
| OS | 80 |
| OA | 35 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | OP |
|---|---|
| 2nd row | OP |
| 3rd row | OP |
| 4th row | OP |
| 5th row | OP |
Common Values
| Value | Count | Frequency (%) |
| OP | 4499 | |
| SB | 386 | 7.7% |
| OS | 80 | 1.6% |
| OA | 35 | 0.7% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| op | 4499 | |
| sb | 386 | 7.7% |
| os | 80 | 1.6% |
| oa | 35 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 4614 | |
| P | 4499 | |
| S | 466 | 4.7% |
| B | 386 | 3.9% |
| A | 35 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 4614 | |
| P | 4499 | |
| S | 466 | 4.7% |
| B | 386 | 3.9% |
| A | 35 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 4614 | |
| P | 4499 | |
| S | 466 | 4.7% |
| B | 386 | 3.9% |
| A | 35 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 4614 | |
| P | 4499 | |
| S | 466 | 4.7% |
| B | 386 | 3.9% |
| A | 35 | 0.4% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| Operating | |
|---|---|
| Standby/Backup: available for service but not normally used | 386 |
| Out of service and NOT expected to return to service in next calendar year | 80 |
| Out of service but expected to return to service in next calendar year | 35 |
Length
| Max length | 74 |
|---|---|
| Median length | 9 |
| Mean length | 14.327 |
| Min length | 9 |
Characters and Unicode
| Total characters | 71635 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Operating |
|---|---|
| 2nd row | Operating |
| 3rd row | Operating |
| 4th row | Operating |
| 5th row | Operating |
Common Values
| Value | Count | Frequency (%) |
| Operating | 4499 | |
| Standby/Backup: available for service but not normally used | 386 | 7.7% |
| Out of service and NOT expected to return to service in next calendar year | 80 | 1.6% |
| Out of service but expected to return to service in next calendar year | 35 | 0.7% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| operating | 4499 | |
| service | 616 | 6.7% |
| not | 466 | 5.1% |
| but | 421 | 4.6% |
| available | 386 | 4.2% |
| for | 386 | 4.2% |
| normally | 386 | 4.2% |
| used | 386 | 4.2% |
| standby/backup | 386 | 4.2% |
| to | 230 | 2.5% |
| Other values (9) | 1000 | 10.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 7308 | |
| a | 7240 | |
| t | 6382 | |
| r | 6347 | |
| n | 6197 | |
| i | 5616 | 7.8% |
| p | 5000 | 7.0% |
| O | 4694 | 6.6% |
| g | 4499 | 6.3% |
| 4162 | 5.8% | |
| Other values (19) | 14190 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 61075 | |
| Uppercase Letter | 5626 | 7.9% |
| Space Separator | 4162 | 5.8% |
| Other Punctuation | 772 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 7308 | |
| a | 7240 | |
| t | 6382 | |
| r | 6347 | |
| n | 6197 | |
| i | 5616 | |
| p | 5000 | |
| g | 4499 | |
| l | 1659 | 2.7% |
| o | 1503 | 2.5% |
| Other values (11) | 9324 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 4694 | |
| B | 386 | 6.9% |
| S | 386 | 6.9% |
| N | 80 | 1.4% |
| T | 80 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 386 | |
| / | 386 |
Space Separator
| Value | Count | Frequency (%) |
| 4162 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 66701 | |
| Common | 4934 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 7308 | |
| a | 7240 | |
| t | 6382 | |
| r | 6347 | |
| n | 6197 | |
| i | 5616 | |
| p | 5000 | |
| O | 4694 | 7.0% |
| g | 4499 | 6.7% |
| l | 1659 | 2.5% |
| Other values (16) | 11759 |
Common
| Value | Count | Frequency (%) |
| 4162 | ||
| : | 386 | 7.8% |
| / | 386 | 7.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 71635 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 7308 | |
| a | 7240 | |
| t | 6382 | |
| r | 6347 | |
| n | 6197 | |
| i | 5616 | 7.8% |
| p | 5000 | 7.0% |
| O | 4694 | 6.6% |
| g | 4499 | 6.3% |
| 4162 | 5.8% | |
| Other values (19) | 14190 |
| Distinct | 748 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 56.86852 |
| Minimum | 0.1 |
|---|---|
| Maximum | 1300 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 0.4 |
| Q1 | 1.6 |
| median | 5 |
| Q3 | 55 |
| 95-th percentile | 245 |
| Maximum | 1300 |
| Range | 1299.9 |
| Interquartile range (IQR) | 53.4 |
Descriptive statistics
| Standard deviation | 133.6035466 |
|---|---|
| Coefficient of variation (CV) | 2.349341016 |
| Kurtosis | 29.72833774 |
| Mean | 56.86852 |
| Median Absolute Deviation (MAD) | 4.5 |
| Skewness | 4.826863195 |
| Sum | 284342.6 |
| Variance | 17849.90766 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 255 | 5.1% |
| 1 | 200 | 4.0% |
| 0.5 | 188 | 3.8% |
| 1.6 | 119 | 2.4% |
| 0.3 | 112 | 2.2% |
| 1.8 | 108 | 2.2% |
| 3 | 107 | 2.1% |
| 5 | 99 | 2.0% |
| 1.5 | 99 | 2.0% |
| 0.8 | 95 | 1.9% |
| Other values (738) | 3618 |
| Value | Count | Frequency (%) |
| 0.1 | 28 | 0.6% |
| 0.2 | 55 | 1.1% |
| 0.3 | 112 | |
| 0.4 | 64 | 1.3% |
| 0.5 | 188 | |
| 0.6 | 73 | 1.5% |
| 0.7 | 44 | 0.9% |
| 0.8 | 95 | |
| 0.9 | 52 | 1.0% |
| 1 | 200 |
| Value | Count | Frequency (%) |
| 1300 | 6 | |
| 1245.6 | 2 | < 0.1% |
| 1242 | 1 | < 0.1% |
| 1205.1 | 2 | < 0.1% |
| 1190 | 1 | < 0.1% |
| 1152 | 2 | < 0.1% |
| 1029.6 | 1 | < 0.1% |
| 1008 | 1 | < 0.1% |
| 956.8 | 1 | < 0.1% |
| 952 | 2 | < 0.1% |
| Distinct | 827 |
|---|---|
| Distinct (%) | 16.6% |
| Missing | 18 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.1248896 |
| Minimum | 0 |
|---|---|
| Maximum | 1300 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.4 |
| Q1 | 1.5 |
| median | 4.5 |
| Q3 | 46.275 |
| 95-th percentile | 225 |
| Maximum | 1300 |
| Range | 1300 |
| Interquartile range (IQR) | 44.775 |
Descriptive statistics
| Standard deviation | 125.8161606 |
|---|---|
| Coefficient of variation (CV) | 2.413744404 |
| Kurtosis | 31.10802866 |
| Mean | 52.1248896 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 4.944792423 |
| Sum | 259686.2 |
| Variance | 15829.70627 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 252 | 5.0% |
| 0.5 | 194 | 3.9% |
| 1 | 175 | 3.5% |
| 1.5 | 132 | 2.6% |
| 0.3 | 121 | 2.4% |
| 1.8 | 117 | 2.3% |
| 3 | 96 | 1.9% |
| 1.6 | 96 | 1.9% |
| 0.8 | 90 | 1.8% |
| 0.6 | 84 | 1.7% |
| Other values (817) | 3625 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 0.1 | 38 | 0.8% |
| 0.2 | 54 | 1.1% |
| 0.3 | 121 | |
| 0.4 | 73 | 1.5% |
| 0.5 | 194 | |
| 0.6 | 84 | |
| 0.7 | 61 | 1.2% |
| 0.8 | 90 | |
| 0.9 | 81 |
| Value | Count | Frequency (%) |
| 1300 | 1 | |
| 1299 | 1 | |
| 1249.1 | 1 | |
| 1239 | 2 | |
| 1231 | 2 | |
| 1160.1 | 1 | |
| 1150.1 | 1 | |
| 1110 | 2 | |
| 1104.9 | 1 | |
| 1103.7 | 1 |
| Distinct | 809 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 27 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 53.79525437 |
| Minimum | 0 |
|---|---|
| Maximum | 1299 |
| Zeros | 14 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.4 |
| Q1 | 1.5 |
| median | 4.6 |
| Q3 | 49.5 |
| 95-th percentile | 230 |
| Maximum | 1299 |
| Range | 1299 |
| Interquartile range (IQR) | 48 |
Descriptive statistics
| Standard deviation | 128.2640542 |
|---|---|
| Coefficient of variation (CV) | 2.384300543 |
| Kurtosis | 30.27219714 |
| Mean | 53.79525437 |
| Median Absolute Deviation (MAD) | 4.1 |
| Skewness | 4.85992631 |
| Sum | 267523.8 |
| Variance | 16451.6676 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 249 | 5.0% |
| 0.5 | 184 | 3.7% |
| 1 | 170 | 3.4% |
| 1.5 | 126 | 2.5% |
| 0.3 | 123 | 2.5% |
| 1.8 | 116 | 2.3% |
| 1.6 | 105 | 2.1% |
| 0.8 | 103 | 2.1% |
| 5 | 100 | 2.0% |
| 3 | 87 | 1.7% |
| Other values (799) | 3610 |
| Value | Count | Frequency (%) |
| 0 | 14 | 0.3% |
| 0.1 | 42 | 0.8% |
| 0.2 | 56 | 1.1% |
| 0.3 | 123 | |
| 0.4 | 69 | 1.4% |
| 0.5 | 184 | |
| 0.6 | 80 | |
| 0.7 | 67 | 1.3% |
| 0.8 | 103 | |
| 0.9 | 74 |
| Value | Count | Frequency (%) |
| 1299 | 2 | |
| 1265 | 2 | |
| 1257 | 2 | |
| 1249.1 | 1 | |
| 1198.7 | 1 | |
| 1179.8 | 1 | |
| 1135.2 | 1 | |
| 1134.2 | 1 | |
| 1131.7 | 1 | |
| 1110 | 2 |
| Distinct | 915 |
|---|---|
| Distinct (%) | 18.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| 2016-12 | 83 |
|---|---|
| 2015-02 | 74 |
| 2001-06 | 53 |
| 2012-12 | 51 |
| 2014-12 | 41 |
| Other values (910) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 35000 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 203 ? |
|---|---|
| Unique (%) | 4.1% |
Sample
| 1st row | 2017-07 |
|---|---|
| 2nd row | 2017-07 |
| 3rd row | 2001-07 |
| 4th row | 2001-07 |
| 5th row | 1992-03 |
Common Values
| Value | Count | Frequency (%) |
| 2016-12 | 83 | 1.7% |
| 2015-02 | 74 | 1.5% |
| 2001-06 | 53 | 1.1% |
| 2012-12 | 51 | 1.0% |
| 2014-12 | 41 | 0.8% |
| 2013-12 | 41 | 0.8% |
| 2015-12 | 36 | 0.7% |
| 2003-06 | 35 | 0.7% |
| 2000-06 | 35 | 0.7% |
| 2002-07 | 34 | 0.7% |
| Other values (905) | 4517 |
Length
| Value | Count | Frequency (%) |
| 2016-12 | 83 | 1.7% |
| 2015-02 | 74 | 1.5% |
| 2001-06 | 53 | 1.1% |
| 2012-12 | 51 | 1.0% |
| 2014-12 | 41 | 0.8% |
| 2013-12 | 41 | 0.8% |
| 2015-12 | 36 | 0.7% |
| 2003-06 | 35 | 0.7% |
| 2000-06 | 35 | 0.7% |
| 2002-07 | 34 | 0.7% |
| Other values (905) | 4517 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8053 | |
| 1 | 6842 | |
| - | 5000 | |
| 2 | 4333 | |
| 9 | 3663 | |
| 6 | 1488 | 4.3% |
| 8 | 1312 | 3.7% |
| 7 | 1298 | 3.7% |
| 5 | 1201 | 3.4% |
| 4 | 908 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 30000 | |
| Dash Punctuation | 5000 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8053 | |
| 1 | 6842 | |
| 2 | 4333 | |
| 9 | 3663 | |
| 6 | 1488 | 5.0% |
| 8 | 1312 | 4.4% |
| 7 | 1298 | 4.3% |
| 5 | 1201 | 4.0% |
| 4 | 908 | 3.0% |
| 3 | 902 | 3.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 35000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8053 | |
| 1 | 6842 | |
| - | 5000 | |
| 2 | 4333 | |
| 9 | 3663 | |
| 6 | 1488 | 4.3% |
| 8 | 1312 | 3.7% |
| 7 | 1298 | 3.7% |
| 5 | 1201 | 3.4% |
| 4 | 908 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8053 | |
| 1 | 6842 | |
| - | 5000 | |
| 2 | 4333 | |
| 9 | 3663 | |
| 6 | 1488 | 4.3% |
| 8 | 1312 | 3.7% |
| 7 | 1298 | 3.7% |
| 5 | 1201 | 3.4% |
| 4 | 908 | 2.6% |
| Distinct | 36 |
|---|---|
| Distinct (%) | 30.5% |
| Missing | 4882 |
| Missing (%) | 97.6% |
| Memory size | 39.2 KiB |
| 2017-12 | |
|---|---|
| 2020-11 | |
| 2017-08 | |
| 2023-12 | |
| 2024-12 | 6 |
| Other values (31) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 826 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | 10.2% |
Sample
| 1st row | 2026-12 |
|---|---|
| 2nd row | 2021-05 |
| 3rd row | 2021-07 |
| 4th row | 2021-06 |
| 5th row | 2021-08 |
Common Values
| Value | Count | Frequency (%) |
| 2017-12 | 18 | 0.4% |
| 2020-11 | 10 | 0.2% |
| 2017-08 | 8 | 0.2% |
| 2023-12 | 7 | 0.1% |
| 2024-12 | 6 | 0.1% |
| 2020-12 | 5 | 0.1% |
| 2031-12 | 5 | 0.1% |
| 2021-06 | 5 | 0.1% |
| 2025-12 | 4 | 0.1% |
| 2022-11 | 4 | 0.1% |
| Other values (26) | 46 | 0.9% |
| (Missing) | 4882 |
Length
| Value | Count | Frequency (%) |
| 2017-12 | 18 | 15.3% |
| 2020-11 | 10 | 8.5% |
| 2017-08 | 8 | 6.8% |
| 2023-12 | 7 | 5.9% |
| 2024-12 | 6 | 5.1% |
| 2020-12 | 5 | 4.2% |
| 2031-12 | 5 | 4.2% |
| 2021-06 | 5 | 4.2% |
| 2026-12 | 4 | 3.4% |
| 2025-12 | 4 | 3.4% |
| Other values (26) | 46 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 240 | |
| 0 | 180 | |
| 1 | 162 | |
| - | 118 | |
| 7 | 38 | 4.6% |
| 8 | 21 | 2.5% |
| 6 | 19 | 2.3% |
| 3 | 18 | 2.2% |
| 9 | 13 | 1.6% |
| 4 | 11 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 708 | |
| Dash Punctuation | 118 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 240 | |
| 0 | 180 | |
| 1 | 162 | |
| 7 | 38 | 5.4% |
| 8 | 21 | 3.0% |
| 6 | 19 | 2.7% |
| 3 | 18 | 2.5% |
| 9 | 13 | 1.8% |
| 4 | 11 | 1.6% |
| 5 | 6 | 0.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 118 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 826 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 240 | |
| 0 | 180 | |
| 1 | 162 | |
| - | 118 | |
| 7 | 38 | 4.6% |
| 8 | 21 | 2.5% |
| 6 | 19 | 2.3% |
| 3 | 18 | 2.2% |
| 9 | 13 | 1.6% |
| 4 | 11 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 826 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 240 | |
| 0 | 180 | |
| 1 | 162 | |
| - | 118 | |
| 7 | 38 | 4.6% |
| 8 | 21 | 2.5% |
| 6 | 19 | 2.3% |
| 3 | 18 | 2.2% |
| 9 | 13 | 1.6% |
| 4 | 11 | 1.3% |
| Distinct | 23 |
|---|---|
| Distinct (%) | 88.5% |
| Missing | 4974 |
| Missing (%) | 99.5% |
| Memory size | 39.2 KiB |
| 2021-05 | |
|---|---|
| 2018-05 | |
| 2022-12 | |
| 2018-01 | 1 |
| 2019-12 | 1 |
| Other values (18) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 182 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | 76.9% |
Sample
| 1st row | 2022-12 |
|---|---|
| 2nd row | 2022-12 |
| 3rd row | 2023-12 |
| 4th row | 2024-12 |
| 5th row | 2025-12 |
Common Values
| Value | Count | Frequency (%) |
| 2021-05 | 2 | < 0.1% |
| 2018-05 | 2 | < 0.1% |
| 2022-12 | 2 | < 0.1% |
| 2018-01 | 1 | < 0.1% |
| 2019-12 | 1 | < 0.1% |
| 2018-12 | 1 | < 0.1% |
| 2018-09 | 1 | < 0.1% |
| 2019-04 | 1 | < 0.1% |
| 2018-11 | 1 | < 0.1% |
| 2018-02 | 1 | < 0.1% |
| Other values (13) | 13 | 0.3% |
| (Missing) | 4974 |
Length
| Value | Count | Frequency (%) |
| 2021-05 | 2 | 7.7% |
| 2022-12 | 2 | 7.7% |
| 2018-05 | 2 | 7.7% |
| 2022-06 | 1 | 3.8% |
| 2023-12 | 1 | 3.8% |
| 2024-12 | 1 | 3.8% |
| 2025-12 | 1 | 3.8% |
| 2026-12 | 1 | 3.8% |
| 2027-12 | 1 | 3.8% |
| 2023-06 | 1 | 3.8% |
| Other values (13) | 13 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 54 | |
| 0 | 44 | |
| 1 | 27 | |
| - | 26 | |
| 8 | 8 | 4.4% |
| 5 | 6 | 3.3% |
| 9 | 5 | 2.7% |
| 6 | 5 | 2.7% |
| 4 | 3 | 1.6% |
| 3 | 3 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 156 | |
| Dash Punctuation | 26 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 54 | |
| 0 | 44 | |
| 1 | 27 | |
| 8 | 8 | 5.1% |
| 5 | 6 | 3.8% |
| 9 | 5 | 3.2% |
| 6 | 5 | 3.2% |
| 4 | 3 | 1.9% |
| 3 | 3 | 1.9% |
| 7 | 1 | 0.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 26 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 182 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 54 | |
| 0 | 44 | |
| 1 | 27 | |
| - | 26 | |
| 8 | 8 | 4.4% |
| 5 | 6 | 3.3% |
| 9 | 5 | 2.7% |
| 6 | 5 | 2.7% |
| 4 | 3 | 1.6% |
| 3 | 3 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 182 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 54 | |
| 0 | 44 | |
| 1 | 27 | |
| - | 26 | |
| 8 | 8 | 4.4% |
| 5 | 6 | 3.3% |
| 9 | 5 | 2.7% |
| 6 | 5 | 2.7% |
| 4 | 3 | 1.6% |
| 3 | 3 | 1.6% |
| Distinct | 12 |
|---|---|
| Distinct (%) | 46.2% |
| Missing | 4974 |
| Missing (%) | 99.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.31153846 |
| Minimum | 0.5 |
|---|---|
| Maximum | 155 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0.5 |
|---|---|
| 5-th percentile | 0.825 |
| Q1 | 6 |
| median | 6.4 |
| Q3 | 63 |
| 95-th percentile | 155 |
| Maximum | 155 |
| Range | 154.5 |
| Interquartile range (IQR) | 57 |
Descriptive statistics
| Standard deviation | 49.62692275 |
|---|---|
| Coefficient of variation (CV) | 1.295351864 |
| Kurtosis | 1.44674801 |
| Mean | 38.31153846 |
| Median Absolute Deviation (MAD) | 5.9 |
| Skewness | 1.524962208 |
| Sum | 996.1 |
| Variance | 2462.831462 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 6 | 0.1% |
| 65 | 4 | 0.1% |
| 57 | 3 | 0.1% |
| 155 | 3 | 0.1% |
| 6.4 | 2 | < 0.1% |
| 0.5 | 2 | < 0.1% |
| 3.5 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
| (Missing) | 4974 |
| Value | Count | Frequency (%) |
| 0.5 | 2 | < 0.1% |
| 1.8 | 1 | < 0.1% |
| 3.5 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 6 | |
| 6.4 | 2 | < 0.1% |
| 16 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 57 | 3 |
| Value | Count | Frequency (%) |
| 155 | 3 | |
| 65 | 4 | |
| 57 | 3 | |
| 20 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 6.4 | 2 | < 0.1% |
| 6 | 6 | |
| 5 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 3.5 | 1 | < 0.1% |
| Distinct | 714 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 4 |
| Missing (%) | 0.1% |
| Memory size | 39.2 KiB |
| San Bernardino | 150 |
|---|---|
| Los Angeles | 89 |
| Kern | 78 |
| Riverside | 70 |
| Jackson | 55 |
| Other values (709) |
Length
| Max length | 25 |
|---|---|
| Median length | 19 |
| Mean length | 7.685348279 |
| Min length | 3 |
Characters and Unicode
| Total characters | 38396 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 159 ? |
|---|---|
| Unique (%) | 3.2% |
Sample
| 1st row | Calhoun |
|---|---|
| 2nd row | Calhoun |
| 3rd row | Pecos |
| 4th row | Pecos |
| 5th row | Salem |
Common Values
| Value | Count | Frequency (%) |
| San Bernardino | 150 | 3.0% |
| Los Angeles | 89 | 1.8% |
| Kern | 78 | 1.6% |
| Riverside | 70 | 1.4% |
| Jackson | 55 | 1.1% |
| Harris | 54 | 1.1% |
| Douglas | 50 | 1.0% |
| Orange | 49 | 1.0% |
| Franklin | 48 | 1.0% |
| Wayne | 44 | 0.9% |
| Other values (704) | 4309 |
Length
| Value | Count | Frequency (%) |
| san | 193 | 3.3% |
| bernardino | 150 | 2.6% |
| los | 89 | 1.5% |
| angeles | 89 | 1.5% |
| kern | 78 | 1.3% |
| riverside | 70 | 1.2% |
| new | 70 | 1.2% |
| st | 59 | 1.0% |
| jackson | 55 | 0.9% |
| harris | 54 | 0.9% |
| Other values (740) | 4954 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4002 | 10.4% |
| e | 3788 | 9.9% |
| n | 3339 | 8.7% |
| o | 2827 | 7.4% |
| r | 2739 | 7.1% |
| i | 2166 | 5.6% |
| l | 1824 | 4.8% |
| s | 1661 | 4.3% |
| t | 1468 | 3.8% |
| d | 1168 | 3.0% |
| Other values (43) | 13414 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31589 | |
| Uppercase Letter | 5940 | 15.5% |
| Space Separator | 865 | 2.3% |
| Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4002 | |
| e | 3788 | |
| n | 3339 | |
| o | 2827 | |
| r | 2739 | |
| i | 2166 | 6.9% |
| l | 1824 | 5.8% |
| s | 1661 | 5.3% |
| t | 1468 | 4.6% |
| d | 1168 | 3.7% |
| Other values (16) | 6607 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 832 | |
| C | 594 | 10.0% |
| B | 462 | 7.8% |
| L | 373 | 6.3% |
| M | 362 | 6.1% |
| A | 346 | 5.8% |
| H | 345 | 5.8% |
| W | 314 | 5.3% |
| P | 278 | 4.7% |
| D | 222 | 3.7% |
| Other values (15) | 1812 |
Space Separator
| Value | Count | Frequency (%) |
| 865 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37529 | |
| Common | 867 | 2.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4002 | 10.7% |
| e | 3788 | 10.1% |
| n | 3339 | 8.9% |
| o | 2827 | 7.5% |
| r | 2739 | 7.3% |
| i | 2166 | 5.8% |
| l | 1824 | 4.9% |
| s | 1661 | 4.4% |
| t | 1468 | 3.9% |
| d | 1168 | 3.1% |
| Other values (41) | 12547 |
Common
| Value | Count | Frequency (%) |
| 865 | ||
| ' | 2 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38396 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4002 | 10.4% |
| e | 3788 | 9.9% |
| n | 3339 | 8.7% |
| o | 2827 | 7.4% |
| r | 2739 | 7.1% |
| i | 2166 | 5.6% |
| l | 1824 | 4.8% |
| s | 1661 | 4.3% |
| t | 1468 | 3.8% |
| d | 1168 | 3.0% |
| Other values (43) | 13414 |
| Distinct | 1968 |
|---|---|
| Distinct (%) | 39.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -95.61612094 |
| Minimum | -170.475661 |
|---|---|
| Maximum | 93.968056 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4994 |
| Negative (%) | 99.9% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | -170.475661 |
|---|---|
| 5-th percentile | -132.821435 |
| Q1 | -112.904028 |
| median | -91.297894 |
| Q3 | -80.780246 |
| 95-th percentile | -72.776111 |
| Maximum | 93.968056 |
| Range | 264.443717 |
| Interquartile range (IQR) | 32.123782 |
Descriptive statistics
| Standard deviation | 21.10085811 |
|---|---|
| Coefficient of variation (CV) | -0.2206830596 |
| Kurtosis | 7.200314911 |
| Mean | -95.61612094 |
| Median Absolute Deviation (MAD) | 12.226066 |
| Skewness | -0.2178077608 |
| Sum | -478080.6047 |
| Variance | 445.2462131 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -77.467185 | 27 | 0.5% |
| -99.0919 | 27 | 0.5% |
| -87.9861 | 24 | 0.5% |
| -90.14868 | 22 | 0.4% |
| -91.551221 | 20 | 0.4% |
| -95.235078 | 20 | 0.4% |
| -76.8408 | 18 | 0.4% |
| -95.5306 | 17 | 0.3% |
| -86.4006 | 17 | 0.3% |
| -92.589364 | 17 | 0.3% |
| Other values (1958) | 4791 |
| Value | Count | Frequency (%) |
| -170.475661 | 2 | < 0.1% |
| -166.737211 | 4 | |
| -165.429814 | 8 | |
| -164.6544 | 1 | < 0.1% |
| -164.538447 | 2 | < 0.1% |
| -163.729072 | 4 | |
| -163.553106 | 4 | |
| -163.005833 | 4 | |
| -162.965728 | 3 | 0.1% |
| -162.880706 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 93.968056 | 4 | 0.1% |
| 72.021944 | 2 | < 0.1% |
| -68.21 | 1 | < 0.1% |
| -68.63554 | 2 | < 0.1% |
| -68.704368 | 13 | |
| -69.583527 | 8 | |
| -69.647441 | 2 | < 0.1% |
| -69.812168 | 1 | < 0.1% |
| -69.8658 | 4 | 0.1% |
| -70.0517 | 2 | < 0.1% |
| Distinct | 1972 |
|---|---|
| Distinct (%) | 39.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.24465948 |
| Minimum | 19.6316 |
|---|---|
| Maximum | 70.642877 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 19.6316 |
|---|---|
| 5-th percentile | 29.72297075 |
| Q1 | 34.632509 |
| median | 38.7506 |
| Q3 | 42.704391 |
| 95-th percentile | 48.2142 |
| Maximum | 70.642877 |
| Range | 51.011277 |
| Interquartile range (IQR) | 8.071882 |
Descriptive statistics
| Standard deviation | 7.047003619 |
|---|---|
| Coefficient of variation (CV) | 0.179565926 |
| Kurtosis | 4.077793006 |
| Mean | 39.24465948 |
| Median Absolute Deviation (MAD) | 4.0464 |
| Skewness | 1.343023199 |
| Sum | 196223.2974 |
| Variance | 49.66026 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 37.193584 | 27 | 0.5% |
| 28.9275 | 27 | 0.5% |
| 36.0278 | 24 | 0.5% |
| 35.074087 | 22 | 0.4% |
| 40.971826 | 20 | 0.4% |
| 38.974022 | 20 | 0.4% |
| 42.9281 | 18 | 0.4% |
| 29.9417 | 17 | 0.3% |
| 33.435 | 17 | 0.3% |
| 33.296146 | 17 | 0.3% |
| Other values (1962) | 4791 |
| Value | Count | Frequency (%) |
| 19.6316 | 2 | < 0.1% |
| 19.7041 | 2 | < 0.1% |
| 19.7052 | 5 | |
| 19.7203 | 2 | < 0.1% |
| 19.7264 | 2 | < 0.1% |
| 19.7317 | 7 | |
| 20.0252 | 1 | < 0.1% |
| 20.0939 | 6 | |
| 20.257252 | 1 | < 0.1% |
| 21.106 | 4 |
| Value | Count | Frequency (%) |
| 70.642877 | 5 | |
| 70.4826 | 7 | |
| 70.220565 | 6 | |
| 70.125617 | 4 | |
| 69.740833 | 4 | |
| 68.348424 | 4 | |
| 68.13795 | 5 | |
| 67.726644 | 2 | < 0.1% |
| 67.570931 | 3 | |
| 67.08798 | 3 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| MW |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MW |
|---|---|
| 2nd row | MW |
| 3rd row | MW |
| 4th row | MW |
| 5th row | MW |
Common Values
| Value | Count | Frequency (%) |
| MW | 5000 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| mw | 5000 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| MW |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MW |
|---|---|
| 2nd row | MW |
| 3rd row | MW |
| 4th row | MW |
| 5th row | MW |
Common Values
| Value | Count | Frequency (%) |
| MW | 5000 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| mw | 5000 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| MW |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MW |
|---|---|
| 2nd row | MW |
| 3rd row | MW |
| 4th row | MW |
| 5th row | MW |
Common Values
| Value | Count | Frequency (%) |
| MW | 5000 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| mw | 5000 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| MW |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MW |
|---|---|
| 2nd row | MW |
| 3rd row | MW |
| 4th row | MW |
| 5th row | MW |
Common Values
| Value | Count | Frequency (%) |
| MW | 5000 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| mw | 5000 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| MW |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MW |
|---|---|
| 2nd row | MW |
| 3rd row | MW |
| 4th row | MW |
| 5th row | MW |
Common Values
| Value | Count | Frequency (%) |
| MW | 5000 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| mw | 5000 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 5000 | |
| W | 5000 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Unnamed: 0 | period | stateid | stateName | sector | sectorName | entityid | entityName | plantid | plantName | generatorid | unit | technology | energy_source_code | energy-source-desc | prime_mover_code | balancing_authority_code | balancing-authority-name | status | statusDescription | nameplate-capacity-mw | net-summer-capacity-mw | net-winter-capacity-mw | operating-year-month | planned-retirement-year-month | planned-derate-year-month | planned-derate-summer-cap-mw | planned-uprate-year-month | planned-uprate-summer-cap-mw | county | longitude | latitude | nameplate-capacity-mw-units | net-summer-capacity-mw-units | net-winter-capacity-mw-units | planned-derate-summer-cap-mw-units | planned-uprate-summer-cap-mw-units | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 2020-09 | TX | Texas | ipp-non-chp | IPP Non-CHP | 61199 | Peaker Power, LLC | 60459 | Port Comfort Power LLC | PC1 | NaN | Natural Gas Fired Combustion Turbine | NG | Natural Gas | GT | ERCO | Electric Reliability Council of Texas, Inc. | OP | Operating | 60.5 | 43.0 | 46.0 | 2017-07 | NaN | NaN | NaN | NaN | NaN | Calhoun | -96.546210 | 28.648070 | MW | MW | MW | MW | MW |
| 1 | 1 | 2020-09 | TX | Texas | ipp-non-chp | IPP Non-CHP | 61199 | Peaker Power, LLC | 60459 | Port Comfort Power LLC | PC2 | NaN | Natural Gas Fired Combustion Turbine | NG | Natural Gas | GT | ERCO | Electric Reliability Council of Texas, Inc. | OP | Operating | 60.5 | 43.0 | 46.0 | 2017-07 | NaN | NaN | NaN | NaN | NaN | Calhoun | -96.546210 | 28.648070 | MW | MW | MW | MW | MW |
| 2 | 2 | 2020-09 | TX | Texas | ipp-non-chp | IPP Non-CHP | 14628 | Pecos Wind I LP | 55796 | Woodward Mountain I | 1 | NaN | Onshore Wind Turbine | WND | Wind | WT | ERCO | Electric Reliability Council of Texas, Inc. | OP | Operating | 82.0 | 82.0 | 82.0 | 2001-07 | NaN | NaN | NaN | NaN | NaN | Pecos | -102.414067 | 30.951400 | MW | MW | MW | MW | MW |
| 3 | 3 | 2020-09 | TX | Texas | ipp-non-chp | IPP Non-CHP | 14629 | Pecos Wind II LP | 55795 | Woodward Mountain II | 1 | NaN | Onshore Wind Turbine | WND | Wind | WT | ERCO | Electric Reliability Council of Texas, Inc. | OP | Operating | 78.0 | 78.0 | 78.0 | 2001-07 | NaN | NaN | NaN | NaN | NaN | Pecos | -102.414067 | 30.951400 | MW | MW | MW | MW | MW |
| 4 | 4 | 2020-09 | NJ | New Jersey | ipp-non-chp | IPP Non-CHP | 50160 | Pedricktown Cogeneration Company LP | 10099 | Pedricktown Cogeneration Company LP | GEN1 | CC1 | Natural Gas Fired Combined Cycle | NG | Natural Gas | CT | PJM | PJM Interconnection, LLC | OP | Operating | 95.2 | 112.8 | 112.6 | 1992-03 | NaN | NaN | NaN | NaN | NaN | Salem | -75.423800 | 39.766800 | MW | MW | MW | MW | MW |
| 5 | 5 | 2020-09 | NJ | New Jersey | ipp-non-chp | IPP Non-CHP | 50160 | Pedricktown Cogeneration Company LP | 10099 | Pedricktown Cogeneration Company LP | GEN2 | CC1 | Natural Gas Fired Combined Cycle | NG | Natural Gas | CA | PJM | PJM Interconnection, LLC | OP | Operating | 45.0 | NaN | NaN | 1992-03 | NaN | NaN | NaN | NaN | NaN | Salem | -75.423800 | 39.766800 | MW | MW | MW | MW | MW |
| 6 | 6 | 2020-09 | MN | Minnesota | ipp-non-chp | IPP Non-CHP | 60823 | Pegasus Community Solar | 61175 | Pegasus Community Solar | CPCS1 | NaN | Solar Photovoltaic | SUN | Solar | PV | MISO | Midcontinent Independent Transmission System Operator, Inc.. | OP | Operating | 1.0 | 0.9 | 0.9 | 2017-08 | NaN | NaN | NaN | NaN | NaN | Stearns | -95.124682 | 45.495453 | MW | MW | MW | MW | MW |
| 7 | 7 | 2020-09 | MN | Minnesota | ipp-non-chp | IPP Non-CHP | 60823 | Pegasus Community Solar | 61175 | Pegasus Community Solar | CPCS2 | NaN | Solar Photovoltaic | SUN | Solar | PV | MISO | Midcontinent Independent Transmission System Operator, Inc.. | OP | Operating | 1.0 | 0.9 | 0.9 | 2017-08 | NaN | NaN | NaN | NaN | NaN | Stearns | -95.124682 | 45.495453 | MW | MW | MW | MW | MW |
| 8 | 8 | 2020-09 | MI | Michigan | ipp-non-chp | IPP Non-CHP | 61521 | Pegasus Wind, LLC | 61916 | Pegasus Wind | PWEC | NaN | Onshore Wind Turbine | WND | Wind | WT | MISO | Midcontinent Independent Transmission System Operator, Inc.. | OP | Operating | 48.0 | 48.0 | 48.0 | 2019-12 | NaN | NaN | NaN | NaN | NaN | Tuscola | -83.507210 | 43.452003 | MW | MW | MW | MW | MW |
| 9 | 9 | 2020-09 | MI | Michigan | ipp-non-chp | IPP Non-CHP | 61521 | Pegasus Wind, LLC | 61916 | Pegasus Wind | PWEC2 | NaN | Onshore Wind Turbine | WND | Wind | WT | MISO | Midcontinent Independent Transmission System Operator, Inc.. | OP | Operating | 130.0 | 130.0 | 130.0 | 2020-09 | NaN | NaN | NaN | NaN | NaN | Tuscola | -83.507210 | 43.452003 | MW | MW | MW | MW | MW |
Last rows
| Unnamed: 0 | period | stateid | stateName | sector | sectorName | entityid | entityName | plantid | plantName | generatorid | unit | technology | energy_source_code | energy-source-desc | prime_mover_code | balancing_authority_code | balancing-authority-name | status | statusDescription | nameplate-capacity-mw | net-summer-capacity-mw | net-winter-capacity-mw | operating-year-month | planned-retirement-year-month | planned-derate-year-month | planned-derate-summer-cap-mw | planned-uprate-year-month | planned-uprate-summer-cap-mw | county | longitude | latitude | nameplate-capacity-mw-units | net-summer-capacity-mw-units | net-winter-capacity-mw-units | planned-derate-summer-cap-mw-units | planned-uprate-summer-cap-mw-units | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4990 | 4990 | 2017-01 | IL | Illinois | electric-utility | Electric Utility | 3153 | City of Casey - (IL) | 56053 | Casey City of | 2 | NaN | Petroleum Liquids | DFO | Disillate Fuel Oil | IC | MISO | Midcontinent Independent Transmission System Operator, Inc.. | SB | Standby/Backup: available for service but not normally used | 1.8 | 1.8 | 1.8 | 2002-05 | NaN | NaN | NaN | NaN | NaN | Clark | -87.992592 | 39.310555 | MW | MW | MW | MW | MW |
| 4991 | 4991 | 2017-01 | IL | Illinois | electric-utility | Electric Utility | 3153 | City of Casey - (IL) | 56053 | Casey City of | 3 | NaN | Petroleum Liquids | DFO | Disillate Fuel Oil | IC | MISO | Midcontinent Independent Transmission System Operator, Inc.. | SB | Standby/Backup: available for service but not normally used | 1.8 | 1.8 | 1.8 | 2002-05 | NaN | NaN | NaN | NaN | NaN | Clark | -87.992592 | 39.310555 | MW | MW | MW | MW | MW |
| 4992 | 4992 | 2017-01 | CO | Colorado | electric-utility | Electric Utility | 3227 | City of Center - (CO) | 491 | Center | 3 | NaN | Petroleum Liquids | DFO | Disillate Fuel Oil | IC | PSCO | Public Service Company of Colorado | SB | Standby/Backup: available for service but not normally used | 0.5 | 0.5 | 0.5 | 1963-07 | NaN | NaN | NaN | NaN | NaN | Saguache | -106.104670 | 37.753606 | MW | MW | MW | MW | MW |
| 4993 | 4993 | 2017-01 | CO | Colorado | electric-utility | Electric Utility | 3227 | City of Center - (CO) | 491 | Center | 5 | NaN | Petroleum Liquids | DFO | Disillate Fuel Oil | IC | PSCO | Public Service Company of Colorado | SB | Standby/Backup: available for service but not normally used | 1.0 | 1.0 | 1.0 | 1959-08 | NaN | NaN | NaN | NaN | NaN | Saguache | -106.104670 | 37.753606 | MW | MW | MW | MW | MW |
| 4994 | 4994 | 2017-09 | SC | South Carolina | ipp-non-chp | IPP Non-CHP | 54810 | Broad River Energy LLC | 55166 | Broad River Energy Center | CT01 | NaN | Natural Gas Fired Combustion Turbine | NG | Natural Gas | GT | DUK | Duke Energy Carolinas | OP | Operating | 197.0 | 173.4 | 200.8 | 2000-07 | NaN | NaN | NaN | NaN | NaN | Cherokee | -81.575000 | 35.078600 | MW | MW | MW | MW | MW |
| 4995 | 4995 | 2017-09 | SC | South Carolina | ipp-non-chp | IPP Non-CHP | 54810 | Broad River Energy LLC | 55166 | Broad River Energy Center | CT02 | NaN | Natural Gas Fired Combustion Turbine | NG | Natural Gas | GT | DUK | Duke Energy Carolinas | OP | Operating | 197.0 | 170.5 | 197.3 | 2000-07 | NaN | NaN | NaN | NaN | NaN | Cherokee | -81.575000 | 35.078600 | MW | MW | MW | MW | MW |
| 4996 | 4996 | 2017-09 | SC | South Carolina | ipp-non-chp | IPP Non-CHP | 54810 | Broad River Energy LLC | 55166 | Broad River Energy Center | CT03 | NaN | Natural Gas Fired Combustion Turbine | NG | Natural Gas | GT | DUK | Duke Energy Carolinas | OP | Operating | 197.0 | 169.4 | 196.0 | 2000-07 | NaN | NaN | NaN | NaN | NaN | Cherokee | -81.575000 | 35.078600 | MW | MW | MW | MW | MW |
| 4997 | 4997 | 2017-09 | SC | South Carolina | ipp-non-chp | IPP Non-CHP | 54810 | Broad River Energy LLC | 55166 | Broad River Energy Center | CT04 | NaN | Natural Gas Fired Combustion Turbine | NG | Natural Gas | GT | DUK | Duke Energy Carolinas | OP | Operating | 197.0 | 174.0 | 201.4 | 2001-06 | NaN | NaN | NaN | NaN | NaN | Cherokee | -81.575000 | 35.078600 | MW | MW | MW | MW | MW |
| 4998 | 4998 | 2017-06 | NY | New York | ipp-non-chp | IPP Non-CHP | 5511 | CCI Roseton LLC | 8006 | Roseton Generating Facility | 2 | NaN | Natural Gas Steam Turbine | NG | Natural Gas | ST | NYIS | New York Independent System Operator | OP | Operating | 621.0 | 604.0 | 605.7 | 1974-09 | NaN | NaN | NaN | NaN | NaN | Orange | -73.966269 | 41.573783 | MW | MW | MW | MW | MW |
| 4999 | 4999 | 2017-06 | NC | North Carolina | ipp-non-chp | IPP Non-CHP | 60865 | CD Global Solar Holdings, LLC | 61258 | Innovative Solar 35, LLC | ISS35 | NaN | Solar Photovoltaic | SUN | Solar | PV | CPLE | Duke Energy Progress East | OP | Operating | 1.9 | 1.9 | 1.9 | 2017-02 | NaN | NaN | NaN | NaN | NaN | Duplin | -77.825000 | 35.046000 | MW | MW | MW | MW | MW |