Dataset statistics
Number of variables | 37 |
---|---|
Number of observations | 5000 |
Missing cells | 29983 |
Missing cells (%) | 16.2% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.4 MiB |
Average record size in memory | 296.0 B |
Variable types
Numeric | 9 |
---|---|
Categorical | 26 |
Unsupported | 2 |
nameplate-capacity-mw-units has constant value "MW" | Constant |
net-summer-capacity-mw-units has constant value "MW" | Constant |
net-winter-capacity-mw-units has constant value "MW" | Constant |
planned-derate-summer-cap-mw-units has constant value "MW" | Constant |
planned-uprate-summer-cap-mw-units has constant value "MW" | Constant |
stateid has a high cardinality: 51 distinct values | High cardinality |
stateName has a high cardinality: 51 distinct values | High cardinality |
entityName has a high cardinality: 985 distinct values | High cardinality |
plantName has a high cardinality: 2011 distinct values | High cardinality |
generatorid has a high cardinality: 1541 distinct values | High cardinality |
unit has a high cardinality: 80 distinct values | High cardinality |
balancing-authority-name has a high cardinality: 52 distinct values | High cardinality |
operating-year-month has a high cardinality: 915 distinct values | High cardinality |
county has a high cardinality: 714 distinct values | High cardinality |
entityid is highly correlated with Unnamed: 0 and 17 other fields | High correlation |
plantid is highly correlated with stateid and 14 other fields | High correlation |
nameplate-capacity-mw is highly correlated with unit and 9 other fields | High correlation |
net-summer-capacity-mw is highly correlated with unit and 9 other fields | High correlation |
net-winter-capacity-mw is highly correlated with unit and 9 other fields | High correlation |
planned-uprate-summer-cap-mw is highly correlated with Unnamed: 0 and 17 other fields | High correlation |
planned-uprate-year-month is highly correlated with period and 17 other fields | High correlation |
energy_source_code is highly correlated with stateid and 21 other fields | High correlation |
planned-derate-summer-cap-mw-units is highly correlated with planned-uprate-year-month and 19 other fields | High correlation |
stateid is highly correlated with Unnamed: 0 and 18 other fields | High correlation |
stateName is highly correlated with Unnamed: 0 and 18 other fields | High correlation |
technology is highly correlated with period and 21 other fields | High correlation |
sector is highly correlated with stateid and 13 other fields | High correlation |
balancing-authority-name is highly correlated with Unnamed: 0 and 18 other fields | High correlation |
sectorName is highly correlated with stateid and 13 other fields | High correlation |
statusDescription is highly correlated with technology and 4 other fields | High correlation |
net-summer-capacity-mw-units is highly correlated with planned-uprate-year-month and 19 other fields | High correlation |
prime_mover_code is highly correlated with stateid and 17 other fields | High correlation |
planned-retirement-year-month is highly correlated with Unnamed: 0 and 20 other fields | High correlation |
period is highly correlated with Unnamed: 0 and 11 other fields | High correlation |
unit is highly correlated with Unnamed: 0 and 18 other fields | High correlation |
nameplate-capacity-mw-units is highly correlated with planned-uprate-year-month and 19 other fields | High correlation |
energy-source-desc is highly correlated with stateid and 21 other fields | High correlation |
planned-uprate-summer-cap-mw-units is highly correlated with planned-uprate-year-month and 19 other fields | High correlation |
net-winter-capacity-mw-units is highly correlated with planned-uprate-year-month and 19 other fields | High correlation |
status is highly correlated with technology and 4 other fields | High correlation |
balancing_authority_code is highly correlated with Unnamed: 0 and 18 other fields | High correlation |
Unnamed: 0 is highly correlated with period and 8 other fields | High correlation |
longitude is highly correlated with period and 11 other fields | High correlation |
latitude is highly correlated with stateid and 11 other fields | High correlation |
unit has 4542 (90.8%) missing values | Missing |
balancing_authority_code has 285 (5.7%) missing values | Missing |
balancing-authority-name has 277 (5.5%) missing values | Missing |
planned-retirement-year-month has 4882 (97.6%) missing values | Missing |
planned-derate-year-month has 5000 (100.0%) missing values | Missing |
planned-derate-summer-cap-mw has 5000 (100.0%) missing values | Missing |
planned-uprate-year-month has 4974 (99.5%) missing values | Missing |
planned-uprate-summer-cap-mw has 4974 (99.5%) missing values | Missing |
Unnamed: 0 is uniformly distributed | Uniform |
planned-uprate-year-month is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
planned-derate-year-month is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
planned-derate-summer-cap-mw is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2022-11-17 22:35:04.700309 |
---|---|
Analysis finished | 2022-11-17 22:35:24.750036 |
Duration | 20.05 seconds |
Software version | pandas-profiling v3.4.0 |
Download configuration | config.json |
Distinct | 5000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2499.5 |
Minimum | 0 |
---|---|
Maximum | 4999 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 39.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 249.95 |
Q1 | 1249.75 |
median | 2499.5 |
Q3 | 3749.25 |
95-th percentile | 4749.05 |
Maximum | 4999 |
Range | 4999 |
Interquartile range (IQR) | 2499.5 |
Descriptive statistics
Standard deviation | 1443.520003 |
---|---|
Coefficient of variation (CV) | 0.577523506 |
Kurtosis | -1.2 |
Mean | 2499.5 |
Median Absolute Deviation (MAD) | 1250 |
Skewness | 0 |
Sum | 12497500 |
Variance | 2083750 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
0 | 1 | < 0.1% |
3330 | 1 | < 0.1% |
3337 | 1 | < 0.1% |
3336 | 1 | < 0.1% |
3335 | 1 | < 0.1% |
3334 | 1 | < 0.1% |
3333 | 1 | < 0.1% |
3332 | 1 | < 0.1% |
3331 | 1 | < 0.1% |
3329 | 1 | < 0.1% |
Other values (4990) | 4990 |
Value | Count | Frequency (%) |
0 | 1 | |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 |
Value | Count | Frequency (%) |
4999 | 1 | |
4998 | 1 | |
4997 | 1 | |
4996 | 1 | |
4995 | 1 | |
4994 | 1 | |
4993 | 1 | |
4992 | 1 | |
4991 | 1 | |
4990 | 1 |
Distinct | 18 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
2017-02 | |
---|---|
2017-03 | |
2017-09 | |
2017-01 | |
2020-06 | |
Other values (13) |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Characters and Unicode
Total characters | 35000 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2020-09 |
---|---|
2nd row | 2020-09 |
3rd row | 2020-09 |
4th row | 2020-09 |
5th row | 2020-09 |
Common Values
Value | Count | Frequency (%) |
2017-02 | 1791 | |
2017-03 | 981 | |
2017-09 | 429 | 8.6% |
2017-01 | 388 | 7.8% |
2020-06 | 369 | 7.4% |
2020-03 | 233 | 4.7% |
2020-05 | 196 | 3.9% |
2020-09 | 179 | 3.6% |
2020-04 | 154 | 3.1% |
2020-11 | 112 | 2.2% |
Other values (8) | 168 | 3.4% |
Length
Value | Count | Frequency (%) |
2017-02 | 1791 | |
2017-03 | 981 | |
2017-09 | 429 | 8.6% |
2017-01 | 388 | 7.8% |
2020-06 | 369 | 7.4% |
2020-03 | 233 | 4.7% |
2020-05 | 196 | 3.9% |
2020-09 | 179 | 3.6% |
2020-04 | 154 | 3.1% |
2020-11 | 112 | 2.2% |
Other values (8) | 168 | 3.4% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 11165 | |
2 | 8097 | |
- | 5000 | |
1 | 4366 | 12.5% |
7 | 3664 | 10.5% |
3 | 1243 | 3.6% |
9 | 634 | 1.8% |
6 | 442 | 1.3% |
5 | 196 | 0.6% |
4 | 154 | 0.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 30000 | |
Dash Punctuation | 5000 | 14.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 11165 | |
2 | 8097 | |
1 | 4366 | 14.6% |
7 | 3664 | 12.2% |
3 | 1243 | 4.1% |
9 | 634 | 2.1% |
6 | 442 | 1.5% |
5 | 196 | 0.7% |
4 | 154 | 0.5% |
8 | 39 | 0.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 5000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 35000 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 11165 | |
2 | 8097 | |
- | 5000 | |
1 | 4366 | 12.5% |
7 | 3664 | 10.5% |
3 | 1243 | 3.6% |
9 | 634 | 1.8% |
6 | 442 | 1.3% |
5 | 196 | 0.6% |
4 | 154 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 11165 | |
2 | 8097 | |
- | 5000 | |
1 | 4366 | 12.5% |
7 | 3664 | 10.5% |
3 | 1243 | 3.6% |
9 | 634 | 1.8% |
6 | 442 | 1.3% |
5 | 196 | 0.6% |
4 | 154 | 0.4% |
Distinct | 51 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
CA | |
---|---|
NY | |
TX | 266 |
NC | 225 |
AK | 221 |
Other values (46) |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Characters and Unicode
Total characters | 10000 |
---|---|
Distinct characters | 24 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | TX |
---|---|
2nd row | TX |
3rd row | TX |
4th row | TX |
5th row | NJ |
Common Values
Value | Count | Frequency (%) |
CA | 741 | 14.8% |
NY | 336 | 6.7% |
TX | 266 | 5.3% |
NC | 225 | 4.5% |
AK | 221 | 4.4% |
MN | 204 | 4.1% |
MI | 196 | 3.9% |
SC | 180 | 3.6% |
KS | 150 | 3.0% |
VA | 139 | 2.8% |
Other values (41) | 2342 |
Length
Value | Count | Frequency (%) |
ca | 741 | 14.8% |
ny | 336 | 6.7% |
tx | 266 | 5.3% |
nc | 225 | 4.5% |
ak | 221 | 4.4% |
mn | 204 | 4.1% |
mi | 196 | 3.9% |
sc | 180 | 3.6% |
ks | 150 | 3.0% |
va | 139 | 2.8% |
Other values (41) | 2342 |
Most occurring characters
Value | Count | Frequency (%) |
A | 1894 | |
C | 1238 | |
N | 1168 | |
M | 792 | 7.9% |
I | 680 | 6.8% |
T | 529 | 5.3% |
K | 447 | 4.5% |
S | 379 | 3.8% |
O | 379 | 3.8% |
Y | 355 | 3.5% |
Other values (14) | 2139 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 1894 | |
C | 1238 | |
N | 1168 | |
M | 792 | 7.9% |
I | 680 | 6.8% |
T | 529 | 5.3% |
K | 447 | 4.5% |
S | 379 | 3.8% |
O | 379 | 3.8% |
Y | 355 | 3.5% |
Other values (14) | 2139 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 10000 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 1894 | |
C | 1238 | |
N | 1168 | |
M | 792 | 7.9% |
I | 680 | 6.8% |
T | 529 | 5.3% |
K | 447 | 4.5% |
S | 379 | 3.8% |
O | 379 | 3.8% |
Y | 355 | 3.5% |
Other values (14) | 2139 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
A | 1894 | |
C | 1238 | |
N | 1168 | |
M | 792 | 7.9% |
I | 680 | 6.8% |
T | 529 | 5.3% |
K | 447 | 4.5% |
S | 379 | 3.8% |
O | 379 | 3.8% |
Y | 355 | 3.5% |
Other values (14) | 2139 |
Distinct | 51 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
California | |
---|---|
New York | |
Texas | 266 |
North Carolina | 225 |
Alaska | 221 |
Other values (46) |
Length
Max length | 20 |
---|---|
Median length | 13 |
Mean length | 8.57 |
Min length | 4 |
Characters and Unicode
Total characters | 42850 |
---|---|
Distinct characters | 46 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Texas |
---|---|
2nd row | Texas |
3rd row | Texas |
4th row | Texas |
5th row | New Jersey |
Common Values
Value | Count | Frequency (%) |
California | 741 | 14.8% |
New York | 336 | 6.7% |
Texas | 266 | 5.3% |
North Carolina | 225 | 4.5% |
Alaska | 221 | 4.4% |
Minnesota | 204 | 4.1% |
Michigan | 196 | 3.9% |
South Carolina | 180 | 3.6% |
Kansas | 150 | 3.0% |
Virginia | 139 | 2.8% |
Other values (41) | 2342 |
Length
Value | Count | Frequency (%) |
california | 741 | 12.4% |
new | 479 | 8.0% |
carolina | 405 | 6.8% |
york | 336 | 5.6% |
texas | 266 | 4.4% |
north | 253 | 4.2% |
alaska | 221 | 3.7% |
south | 205 | 3.4% |
minnesota | 204 | 3.4% |
michigan | 196 | 3.3% |
Other values (45) | 2677 |
Most occurring characters
Value | Count | Frequency (%) |
a | 6103 | |
i | 4714 | 11.0% |
n | 3750 | 8.8% |
o | 3672 | 8.6% |
s | 2900 | 6.8% |
r | 2710 | 6.3% |
e | 2375 | 5.5% |
l | 1993 | 4.7% |
t | 1244 | 2.9% |
C | 1238 | 2.9% |
Other values (36) | 12151 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 35885 | |
Uppercase Letter | 5982 | 14.0% |
Space Separator | 983 | 2.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 6103 | |
i | 4714 | |
n | 3750 | |
o | 3672 | |
s | 2900 | |
r | 2710 | |
e | 2375 | 6.6% |
l | 1993 | 5.6% |
t | 1244 | 3.5% |
h | 1145 | 3.2% |
Other values (14) | 5279 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 1238 | |
N | 792 | |
M | 792 | |
A | 403 | 6.7% |
T | 395 | 6.6% |
Y | 336 | 5.6% |
I | 325 | 5.4% |
W | 252 | 4.2% |
V | 217 | 3.6% |
O | 215 | 3.6% |
Other values (11) | 1017 |
Space Separator
Value | Count | Frequency (%) |
983 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 41867 | |
Common | 983 | 2.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 6103 | |
i | 4714 | |
n | 3750 | 9.0% |
o | 3672 | 8.8% |
s | 2900 | 6.9% |
r | 2710 | 6.5% |
e | 2375 | 5.7% |
l | 1993 | 4.8% |
t | 1244 | 3.0% |
C | 1238 | 3.0% |
Other values (35) | 11168 |
Common
Value | Count | Frequency (%) |
983 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 42850 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 6103 | |
i | 4714 | 11.0% |
n | 3750 | 8.8% |
o | 3672 | 8.6% |
s | 2900 | 6.8% |
r | 2710 | 6.3% |
e | 2375 | 5.5% |
l | 1993 | 4.7% |
t | 1244 | 2.9% |
C | 1238 | 2.9% |
Other values (36) | 12151 |
Distinct | 7 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
electric-utility | |
---|---|
ipp-non-chp | |
industrial-chp | |
commercial-chp | 135 |
commercial-non-chp | 124 |
Other values (2) | 139 |
Length
Max length | 18 |
---|---|
Median length | 16 |
Mean length | 13.7766 |
Min length | 7 |
Characters and Unicode
Total characters | 68883 |
---|---|
Distinct characters | 17 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | ipp-non-chp |
---|---|
2nd row | ipp-non-chp |
3rd row | ipp-non-chp |
4th row | ipp-non-chp |
5th row | ipp-non-chp |
Common Values
Value | Count | Frequency (%) |
electric-utility | 2419 | |
ipp-non-chp | 1925 | |
industrial-chp | 258 | 5.2% |
commercial-chp | 135 | 2.7% |
commercial-non-chp | 124 | 2.5% |
ipp-chp | 112 | 2.2% |
industrial-non-chp | 27 | 0.5% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
electric-utility | 2419 | |
ipp-non-chp | 1925 | |
industrial-chp | 258 | 5.2% |
commercial-chp | 135 | 2.7% |
commercial-non-chp | 124 | 2.5% |
ipp-chp | 112 | 2.2% |
industrial-non-chp | 27 | 0.5% |
Most occurring characters
Value | Count | Frequency (%) |
i | 10123 | |
c | 7937 | |
t | 7542 | |
- | 7076 | |
p | 6655 | |
l | 5382 | |
e | 5097 | |
n | 4437 | |
r | 2963 | 4.3% |
u | 2704 | 3.9% |
Other values (7) | 8967 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 61807 | |
Dash Punctuation | 7076 | 10.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
i | 10123 | |
c | 7937 | |
t | 7542 | |
p | 6655 | |
l | 5382 | |
e | 5097 | |
n | 4437 | |
r | 2963 | 4.8% |
u | 2704 | 4.4% |
h | 2581 | 4.2% |
Other values (6) | 6386 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 7076 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 61807 | |
Common | 7076 | 10.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
i | 10123 | |
c | 7937 | |
t | 7542 | |
p | 6655 | |
l | 5382 | |
e | 5097 | |
n | 4437 | |
r | 2963 | 4.8% |
u | 2704 | 4.4% |
h | 2581 | 4.2% |
Other values (6) | 6386 |
Common
Value | Count | Frequency (%) |
- | 7076 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 68883 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
i | 10123 | |
c | 7937 | |
t | 7542 | |
- | 7076 | |
p | 6655 | |
l | 5382 | |
e | 5097 | |
n | 4437 | |
r | 2963 | 4.3% |
u | 2704 | 3.9% |
Other values (7) | 8967 |
Distinct | 7 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
Electric Utility | |
---|---|
IPP Non-CHP | |
Industrial CHP | |
Commercial CHP | 135 |
Commercial Non-CHP | 124 |
Other values (2) | 139 |
Length
Max length | 18 |
---|---|
Median length | 16 |
Mean length | 13.7766 |
Min length | 7 |
Characters and Unicode
Total characters | 68883 |
---|---|
Distinct characters | 23 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | IPP Non-CHP |
---|---|
2nd row | IPP Non-CHP |
3rd row | IPP Non-CHP |
4th row | IPP Non-CHP |
5th row | IPP Non-CHP |
Common Values
Value | Count | Frequency (%) |
Electric Utility | 2419 | |
IPP Non-CHP | 1925 | |
Industrial CHP | 258 | 5.2% |
Commercial CHP | 135 | 2.7% |
Commercial Non-CHP | 124 | 2.5% |
IPP CHP | 112 | 2.2% |
Industrial Non-CHP | 27 | 0.5% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
electric | 2419 | |
utility | 2419 | |
non-chp | 2076 | |
ipp | 2037 | |
chp | 505 | 5.1% |
industrial | 285 | 2.9% |
commercial | 259 | 2.6% |
Most occurring characters
Value | Count | Frequency (%) |
i | 7801 | 11.3% |
t | 7542 | 10.9% |
P | 6655 | 9.7% |
l | 5382 | 7.8% |
c | 5097 | 7.4% |
5000 | 7.3% | |
r | 2963 | 4.3% |
C | 2840 | 4.1% |
e | 2678 | 3.9% |
H | 2581 | 3.7% |
Other values (13) | 20344 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 40495 | |
Uppercase Letter | 21312 | |
Space Separator | 5000 | 7.3% |
Dash Punctuation | 2076 | 3.0% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
i | 7801 | |
t | 7542 | |
l | 5382 | |
c | 5097 | |
r | 2963 | 7.3% |
e | 2678 | 6.6% |
y | 2419 | 6.0% |
n | 2361 | 5.8% |
o | 2335 | 5.8% |
a | 544 | 1.3% |
Other values (4) | 1373 | 3.4% |
Uppercase Letter
Value | Count | Frequency (%) |
P | 6655 | |
C | 2840 | |
H | 2581 | 12.1% |
E | 2419 | 11.4% |
U | 2419 | 11.4% |
I | 2322 | 10.9% |
N | 2076 | 9.7% |
Space Separator
Value | Count | Frequency (%) |
5000 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2076 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 61807 | |
Common | 7076 | 10.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
i | 7801 | |
t | 7542 | |
P | 6655 | |
l | 5382 | 8.7% |
c | 5097 | 8.2% |
r | 2963 | 4.8% |
C | 2840 | 4.6% |
e | 2678 | 4.3% |
H | 2581 | 4.2% |
E | 2419 | 3.9% |
Other values (11) | 15849 |
Common
Value | Count | Frequency (%) |
5000 | ||
- | 2076 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 68883 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
i | 7801 | 11.3% |
t | 7542 | 10.9% |
P | 6655 | 9.7% |
l | 5382 | 7.8% |
c | 5097 | 7.4% |
5000 | 7.3% | |
r | 2963 | 4.3% |
C | 2840 | 4.1% |
e | 2678 | 3.9% |
H | 2581 | 3.7% |
Other values (13) | 20344 |
Distinct | 984 |
---|---|
Distinct (%) | 19.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 29407.4486 |
Minimum | 34 |
---|---|
Maximum | 64137 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 39.2 KiB |
Quantile statistics
Minimum | 34 |
---|---|
5-th percentile | 1752 |
Q1 | 9990.25 |
median | 17886 |
Q3 | 56814 |
95-th percentile | 60971.95 |
Maximum | 64137 |
Range | 64103 |
Interquartile range (IQR) | 46823.75 |
Descriptive statistics
Standard deviation | 22797.80787 |
---|---|
Coefficient of variation (CV) | 0.7752392319 |
Kurtosis | -1.612175848 |
Mean | 29407.4486 |
Median Absolute Deviation (MAD) | 14621 |
Skewness | 0.3191801074 |
Sum | 147037243 |
Variance | 519740043.5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
18642 | 167 | 3.3% |
17609 | 138 | 2.8% |
17650 | 85 | 1.7% |
13781 | 83 | 1.7% |
7140 | 74 | 1.5% |
17539 | 70 | 1.4% |
4254 | 70 | 1.4% |
58661 | 63 | 1.3% |
40577 | 61 | 1.2% |
17543 | 48 | 1.0% |
Other values (974) | 4141 |
Value | Count | Frequency (%) |
34 | 1 | < 0.1% |
213 | 15 | 0.3% |
219 | 40 | |
221 | 44 | |
429 | 1 | < 0.1% |
503 | 1 | < 0.1% |
733 | 35 | |
765 | 9 | 0.2% |
768 | 2 | < 0.1% |
792 | 2 | < 0.1% |
Value | Count | Frequency (%) |
64137 | 1 | < 0.1% |
64045 | 1 | < 0.1% |
64025 | 4 | |
63841 | 1 | < 0.1% |
63822 | 1 | < 0.1% |
63705 | 3 | |
63534 | 1 | < 0.1% |
63471 | 4 | |
63201 | 1 | < 0.1% |
63181 | 4 |
Distinct | 985 |
---|---|
Distinct (%) | 19.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
Tennessee Valley Authority | 167 |
---|---|
Southern California Edison Co | 138 |
Southern Power Co | 85 |
Northern States Power Co - Minnesota | 83 |
Georgia Power Co | 74 |
Other values (980) |
Length
Max length | 49 |
---|---|
Median length | 39 |
Mean length | 25.1944 |
Min length | 5 |
Characters and Unicode
Total characters | 125972 |
---|---|
Distinct characters | 73 |
Distinct categories | 9 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 390 ? |
---|---|
Unique (%) | 7.8% |
Sample
1st row | Peaker Power, LLC |
---|---|
2nd row | Peaker Power, LLC |
3rd row | Pecos Wind I LP |
4th row | Pecos Wind II LP |
5th row | Pedricktown Cogeneration Company LP |
Common Values
Value | Count | Frequency (%) |
Tennessee Valley Authority | 167 | 3.3% |
Southern California Edison Co | 138 | 2.8% |
Southern Power Co | 85 | 1.7% |
Northern States Power Co - Minnesota | 83 | 1.7% |
Georgia Power Co | 74 | 1.5% |
Consumers Energy Co | 70 | 1.4% |
Sustainable Power Group, LLC | 63 | 1.3% |
American Mun Power-Ohio, Inc | 61 | 1.2% |
South Carolina Electric&Gas Company | 61 | 1.2% |
South Carolina Public Service Authority | 48 | 1.0% |
Other values (975) | 4150 |
Length
Value | Count | Frequency (%) |
llc | 1412 | 6.9% |
power | 1027 | 5.0% |
789 | 3.8% | |
co | 774 | 3.8% |
of | 697 | 3.4% |
inc | 599 | 2.9% |
energy | 591 | 2.9% |
city | 591 | 2.9% |
authority | 289 | 1.4% |
electric | 263 | 1.3% |
Other values (1222) | 13497 |
Most occurring characters
Value | Count | Frequency (%) |
15529 | 12.3% | |
e | 9975 | 7.9% |
o | 9139 | 7.3% |
r | 7948 | 6.3% |
n | 7396 | 5.9% |
a | 6743 | 5.4% |
i | 6363 | 5.1% |
t | 5888 | 4.7% |
C | 5039 | 4.0% |
l | 4279 | 3.4% |
Other values (63) | 47673 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 82020 | |
Uppercase Letter | 24807 | 19.7% |
Space Separator | 15529 | 12.3% |
Other Punctuation | 1296 | 1.0% |
Dash Punctuation | 815 | 0.6% |
Open Punctuation | 521 | 0.4% |
Close Punctuation | 521 | 0.4% |
Decimal Number | 431 | 0.3% |
Control | 32 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 9975 | |
o | 9139 | |
r | 7948 | |
n | 7396 | |
a | 6743 | 8.2% |
i | 6363 | 7.8% |
t | 5888 | 7.2% |
l | 4279 | 5.2% |
s | 3574 | 4.4% |
y | 2640 | 3.2% |
Other values (16) | 18075 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 5039 | |
L | 3389 | |
S | 2142 | |
P | 2069 | 8.3% |
E | 1720 | 6.9% |
A | 1259 | 5.1% |
I | 1158 | 4.7% |
N | 999 | 4.0% |
M | 889 | 3.6% |
G | 884 | 3.6% |
Other values (16) | 5259 |
Decimal Number
Value | Count | Frequency (%) |
0 | 98 | |
1 | 89 | |
2 | 88 | |
3 | 42 | |
5 | 40 | |
9 | 32 | 7.4% |
6 | 22 | 5.1% |
7 | 7 | 1.6% |
8 | 7 | 1.6% |
4 | 6 | 1.4% |
Other Punctuation
Value | Count | Frequency (%) |
, | 794 | |
& | 252 | 19.4% |
. | 211 | 16.3% |
/ | 36 | 2.8% |
# | 2 | 0.2% |
? | 1 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
15529 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 815 |
Open Punctuation
Value | Count | Frequency (%) |
( | 521 |
Close Punctuation
Value | Count | Frequency (%) |
) | 521 |
Control
Value | Count | Frequency (%) |
32 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 106827 | |
Common | 19145 | 15.2% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 9975 | 9.3% |
o | 9139 | 8.6% |
r | 7948 | 7.4% |
n | 7396 | 6.9% |
a | 6743 | 6.3% |
i | 6363 | 6.0% |
t | 5888 | 5.5% |
C | 5039 | 4.7% |
l | 4279 | 4.0% |
s | 3574 | 3.3% |
Other values (42) | 40483 |
Common
Value | Count | Frequency (%) |
15529 | ||
- | 815 | 4.3% |
, | 794 | 4.1% |
( | 521 | 2.7% |
) | 521 | 2.7% |
& | 252 | 1.3% |
. | 211 | 1.1% |
0 | 98 | 0.5% |
1 | 89 | 0.5% |
2 | 88 | 0.5% |
Other values (11) | 227 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 125972 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
15529 | 12.3% | |
e | 9975 | 7.9% |
o | 9139 | 7.3% |
r | 7948 | 6.3% |
n | 7396 | 5.9% |
a | 6743 | 5.4% |
i | 6363 | 5.1% |
t | 5888 | 4.7% |
C | 5039 | 4.0% |
l | 4279 | 3.4% |
Other values (63) | 47673 |
Distinct | 2008 |
---|---|
Distinct (%) | 40.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 31686.292 |
Minimum | 46 |
---|---|
Maximum | 64422 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 39.2 KiB |
Quantile statistics
Minimum | 46 |
---|---|
5-th percentile | 719 |
Q1 | 3397 |
median | 50385 |
Q3 | 57482 |
95-th percentile | 60924.15 |
Maximum | 64422 |
Range | 64376 |
Interquartile range (IQR) | 54085 |
Descriptive statistics
Standard deviation | 26625.32909 |
---|---|
Coefficient of variation (CV) | 0.8402791051 |
Kurtosis | -1.943962495 |
Mean | 31686.292 |
Median Absolute Deviation (MAD) | 11943.5 |
Skewness | -0.07555309445 |
Sum | 158431460 |
Variance | 708908148.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3630 | 27 | 0.5% |
54766 | 27 | 0.5% |
3406 | 24 | 0.5% |
3393 | 22 | 0.4% |
10279 | 20 | 0.4% |
1166 | 20 | 0.4% |
54782 | 18 | 0.4% |
3295 | 17 | 0.3% |
3403 | 17 | 0.3% |
55380 | 17 | 0.3% |
Other values (1998) | 4791 |
Value | Count | Frequency (%) |
46 | 3 | 0.1% |
47 | 9 | |
48 | 4 | 0.1% |
51 | 1 | < 0.1% |
64 | 11 | |
65 | 1 | < 0.1% |
66 | 8 | |
69 | 4 | 0.1% |
70 | 2 | < 0.1% |
78 | 3 | 0.1% |
Value | Count | Frequency (%) |
64422 | 4 | |
64232 | 1 | < 0.1% |
64231 | 1 | < 0.1% |
63986 | 1 | < 0.1% |
63857 | 1 | < 0.1% |
63837 | 1 | < 0.1% |
63836 | 1 | < 0.1% |
63835 | 1 | < 0.1% |
63753 | 1 | < 0.1% |
63681 | 2 |
Distinct | 2011 |
---|---|
Distinct (%) | 40.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
Pearsall | 27 |
---|---|
Boydton Plank Road Cogen Plant | 27 |
Johnsonville | 24 |
Allen | 22 |
Mt Pleasant | 20 |
Other values (2006) |
Length
Max length | 45 |
---|---|
Median length | 33 |
Mean length | 17.7466 |
Min length | 3 |
Characters and Unicode
Total characters | 88733 |
---|---|
Distinct characters | 73 |
Distinct categories | 9 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1029 ? |
---|---|
Unique (%) | 20.6% |
Sample
1st row | Port Comfort Power LLC |
---|---|
2nd row | Port Comfort Power LLC |
3rd row | Woodward Mountain I |
4th row | Woodward Mountain II |
5th row | Pedricktown Cogeneration Company LP |
Common Values
Value | Count | Frequency (%) |
Pearsall | 27 | 0.5% |
Boydton Plank Road Cogen Plant | 27 | 0.5% |
Johnsonville | 24 | 0.5% |
Allen | 22 | 0.4% |
Mt Pleasant | 20 | 0.4% |
Kansas River Project | 20 | 0.4% |
Seneca Energy | 18 | 0.4% |
Urquhart | 17 | 0.3% |
T H Wharton | 17 | 0.3% |
Union Power Station | 17 | 0.3% |
Other values (2001) | 4791 |
Length
Value | Count | Frequency (%) |
solar | 555 | 4.0% |
energy | 365 | 2.6% |
llc | 362 | 2.6% |
plant | 284 | 2.0% |
project | 266 | 1.9% |
power | 234 | 1.7% |
station | 198 | 1.4% |
center | 182 | 1.3% |
facility | 178 | 1.3% |
wind | 154 | 1.1% |
Other values (2170) | 11209 |
Most occurring characters
Value | Count | Frequency (%) |
8988 | 10.1% | |
e | 7301 | 8.2% |
a | 6573 | 7.4% |
o | 5734 | 6.5% |
r | 5718 | 6.4% |
n | 5583 | 6.3% |
t | 5213 | 5.9% |
l | 4643 | 5.2% |
i | 4612 | 5.2% |
C | 2167 | 2.4% |
Other values (63) | 32201 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 61953 | |
Uppercase Letter | 15858 | 17.9% |
Space Separator | 8988 | 10.1% |
Decimal Number | 944 | 1.1% |
Other Punctuation | 459 | 0.5% |
Open Punctuation | 177 | 0.2% |
Close Punctuation | 177 | 0.2% |
Dash Punctuation | 174 | 0.2% |
Control | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 7301 | |
a | 6573 | |
o | 5734 | |
r | 5718 | |
n | 5583 | |
t | 5213 | |
l | 4643 | 7.5% |
i | 4612 | 7.4% |
s | 2165 | 3.5% |
y | 1691 | 2.7% |
Other values (16) | 12720 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 2167 | |
S | 1959 | |
P | 1690 | 10.7% |
L | 1314 | 8.3% |
E | 740 | 4.7% |
M | 732 | 4.6% |
G | 725 | 4.6% |
B | 690 | 4.4% |
H | 674 | 4.3% |
F | 669 | 4.2% |
Other values (16) | 4498 |
Decimal Number
Value | Count | Frequency (%) |
1 | 216 | |
2 | 197 | |
0 | 111 | |
3 | 93 | |
4 | 80 | 8.5% |
5 | 64 | 6.8% |
6 | 62 | 6.6% |
8 | 46 | 4.9% |
9 | 42 | 4.4% |
7 | 33 | 3.5% |
Other Punctuation
Value | Count | Frequency (%) |
, | 192 | |
# | 141 | |
. | 91 | |
& | 18 | 3.9% |
/ | 10 | 2.2% |
' | 7 | 1.5% |
Space Separator
Value | Count | Frequency (%) |
8988 |
Open Punctuation
Value | Count | Frequency (%) |
( | 177 |
Close Punctuation
Value | Count | Frequency (%) |
) | 177 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 174 |
Control
Value | Count | Frequency (%) |
3 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 77811 | |
Common | 10922 | 12.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 7301 | 9.4% |
a | 6573 | 8.4% |
o | 5734 | 7.4% |
r | 5718 | 7.3% |
n | 5583 | 7.2% |
t | 5213 | 6.7% |
l | 4643 | 6.0% |
i | 4612 | 5.9% |
C | 2167 | 2.8% |
s | 2165 | 2.8% |
Other values (42) | 28102 |
Common
Value | Count | Frequency (%) |
8988 | ||
1 | 216 | 2.0% |
2 | 197 | 1.8% |
, | 192 | 1.8% |
( | 177 | 1.6% |
) | 177 | 1.6% |
- | 174 | 1.6% |
# | 141 | 1.3% |
0 | 111 | 1.0% |
3 | 93 | 0.9% |
Other values (11) | 456 | 4.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 88733 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
8988 | 10.1% | |
e | 7301 | 8.2% |
a | 6573 | 7.4% |
o | 5734 | 6.5% |
r | 5718 | 6.4% |
n | 5583 | 6.3% |
t | 5213 | 5.9% |
l | 4643 | 5.2% |
i | 4612 | 5.2% |
C | 2167 | 2.4% |
Other values (63) | 32201 |
Distinct | 1541 |
---|---|
Distinct (%) | 30.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
1 | |
---|---|
2 | |
3 | 274 |
4 | 196 |
GEN1 | 190 |
Other values (1536) |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 2.5848 |
Min length | 1 |
Characters and Unicode
Total characters | 12924 |
---|---|
Distinct characters | 41 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1251 ? |
---|---|
Unique (%) | 25.0% |
Sample
1st row | PC1 |
---|---|
2nd row | PC2 |
3rd row | 1 |
4th row | 1 |
5th row | GEN1 |
Common Values
Value | Count | Frequency (%) |
1 | 694 | 13.9% |
2 | 405 | 8.1% |
3 | 274 | 5.5% |
4 | 196 | 3.9% |
GEN1 | 190 | 3.8% |
5 | 134 | 2.7% |
6 | 112 | 2.2% |
GEN2 | 95 | 1.9% |
PV1 | 87 | 1.7% |
7 | 83 | 1.7% |
Other values (1531) | 2730 |
Length
Value | Count | Frequency (%) |
1 | 712 | 14.1% |
2 | 412 | 8.2% |
3 | 276 | 5.5% |
4 | 198 | 3.9% |
gen1 | 190 | 3.8% |
5 | 137 | 2.7% |
6 | 114 | 2.3% |
gen2 | 95 | 1.9% |
pv1 | 87 | 1.7% |
7 | 84 | 1.7% |
Other values (1516) | 2734 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 2058 | |
G | 1173 | 9.1% |
2 | 1102 | 8.5% |
N | 742 | 5.7% |
E | 718 | 5.6% |
T | 709 | 5.5% |
3 | 681 | 5.3% |
S | 601 | 4.7% |
C | 541 | 4.2% |
4 | 510 | 3.9% |
Other values (31) | 4089 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 6854 | |
Decimal Number | 5920 | |
Dash Punctuation | 103 | 0.8% |
Space Separator | 39 | 0.3% |
Other Punctuation | 7 | 0.1% |
Lowercase Letter | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
G | 1173 | |
N | 742 | |
E | 718 | |
T | 709 | |
S | 601 | |
C | 541 | 7.9% |
A | 282 | 4.1% |
P | 258 | 3.8% |
I | 217 | 3.2% |
B | 193 | 2.8% |
Other values (16) | 1420 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2058 | |
2 | 1102 | |
3 | 681 | 11.5% |
4 | 510 | 8.6% |
0 | 382 | 6.5% |
5 | 361 | 6.1% |
6 | 274 | 4.6% |
7 | 240 | 4.1% |
8 | 184 | 3.1% |
9 | 128 | 2.2% |
Other Punctuation
Value | Count | Frequency (%) |
# | 6 | |
. | 1 | 14.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 103 |
Space Separator
Value | Count | Frequency (%) |
39 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 6855 | |
Common | 6069 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
G | 1173 | |
N | 742 | |
E | 718 | |
T | 709 | |
S | 601 | |
C | 541 | 7.9% |
A | 282 | 4.1% |
P | 258 | 3.8% |
I | 217 | 3.2% |
B | 193 | 2.8% |
Other values (17) | 1421 |
Common
Value | Count | Frequency (%) |
1 | 2058 | |
2 | 1102 | |
3 | 681 | 11.2% |
4 | 510 | 8.4% |
0 | 382 | 6.3% |
5 | 361 | 5.9% |
6 | 274 | 4.5% |
7 | 240 | 4.0% |
8 | 184 | 3.0% |
9 | 128 | 2.1% |
Other values (4) | 149 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 12924 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 2058 | |
G | 1173 | 9.1% |
2 | 1102 | 8.5% |
N | 742 | 5.7% |
E | 718 | 5.6% |
T | 709 | 5.5% |
3 | 681 | 5.3% |
S | 601 | 4.7% |
C | 541 | 4.2% |
4 | 510 | 3.9% |
Other values (31) | 4089 |
Distinct | 80 |
---|---|
Distinct (%) | 17.5% |
Missing | 4542 |
Missing (%) | 90.8% |
Memory size | 39.2 KiB |
CC1 | |
---|---|
1 | 15 |
CHP1 | 10 |
CC2 | 10 |
BLK2 | 10 |
Other values (75) |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.43231441 |
Min length | 1 |
Characters and Unicode
Total characters | 1572 |
---|---|
Distinct characters | 31 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 8 ? |
---|---|
Unique (%) | 1.7% |
Sample
1st row | CC1 |
---|---|
2nd row | CC1 |
3rd row | CC1 |
4th row | CC1 |
5th row | G104 |
Common Values
Value | Count | Frequency (%) |
CC1 | 175 | 3.5% |
1 | 15 | 0.3% |
CHP1 | 10 | 0.2% |
CC2 | 10 | 0.2% |
BLK2 | 10 | 0.2% |
CC01 | 10 | 0.2% |
PB01 | 9 | 0.2% |
BLK1 | 9 | 0.2% |
STG1 | 6 | 0.1% |
PLTB | 6 | 0.1% |
Other values (70) | 198 | 4.0% |
(Missing) | 4542 |
Length
Value | Count | Frequency (%) |
cc1 | 175 | |
1 | 15 | 3.3% |
chp1 | 10 | 2.2% |
cc01 | 10 | 2.2% |
cc2 | 10 | 2.2% |
blk2 | 10 | 2.2% |
pb01 | 9 | 2.0% |
blk1 | 9 | 2.0% |
stg1 | 6 | 1.3% |
pltb | 6 | 1.3% |
Other values (70) | 198 |
Most occurring characters
Value | Count | Frequency (%) |
C | 506 | |
1 | 344 | |
0 | 109 | 6.9% |
B | 78 | 5.0% |
G | 65 | 4.1% |
2 | 57 | 3.6% |
P | 47 | 3.0% |
L | 46 | 2.9% |
T | 40 | 2.5% |
S | 38 | 2.4% |
Other values (21) | 242 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 964 | |
Decimal Number | 608 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
C | 506 | |
B | 78 | 8.1% |
G | 65 | 6.7% |
P | 47 | 4.9% |
L | 46 | 4.8% |
T | 40 | 4.1% |
S | 38 | 3.9% |
H | 29 | 3.0% |
K | 26 | 2.7% |
U | 17 | 1.8% |
Other values (12) | 72 | 7.5% |
Decimal Number
Value | Count | Frequency (%) |
1 | 344 | |
0 | 109 | 17.9% |
2 | 57 | 9.4% |
3 | 31 | 5.1% |
4 | 21 | 3.5% |
6 | 14 | 2.3% |
5 | 13 | 2.1% |
8 | 10 | 1.6% |
7 | 9 | 1.5% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 964 | |
Common | 608 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
C | 506 | |
B | 78 | 8.1% |
G | 65 | 6.7% |
P | 47 | 4.9% |
L | 46 | 4.8% |
T | 40 | 4.1% |
S | 38 | 3.9% |
H | 29 | 3.0% |
K | 26 | 2.7% |
U | 17 | 1.8% |
Other values (12) | 72 | 7.5% |
Common
Value | Count | Frequency (%) |
1 | 344 | |
0 | 109 | 17.9% |
2 | 57 | 9.4% |
3 | 31 | 5.1% |
4 | 21 | 3.5% |
6 | 14 | 2.3% |
5 | 13 | 2.1% |
8 | 10 | 1.6% |
7 | 9 | 1.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1572 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
C | 506 | |
1 | 344 | |
0 | 109 | 6.9% |
B | 78 | 5.0% |
G | 65 | 4.1% |
2 | 57 | 3.6% |
P | 47 | 3.0% |
L | 46 | 2.9% |
T | 40 | 2.5% |
S | 38 | 2.4% |
Other values (21) | 242 |
Distinct | 24 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
Petroleum Liquids | |
---|---|
Conventional Hydroelectric | |
Solar Photovoltaic | |
Natural Gas Fired Combustion Turbine | |
Natural Gas Fired Combined Cycle | |
Other values (19) |
Length
Max length | 38 |
---|---|
Median length | 33 |
Mean length | 23.5074 |
Min length | 7 |
Characters and Unicode
Total characters | 117537 |
---|---|
Distinct characters | 41 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Natural Gas Fired Combustion Turbine |
---|---|
2nd row | Natural Gas Fired Combustion Turbine |
3rd row | Onshore Wind Turbine |
4th row | Onshore Wind Turbine |
5th row | Natural Gas Fired Combined Cycle |
Common Values
Value | Count | Frequency (%) |
Petroleum Liquids | 928 | |
Conventional Hydroelectric | 879 | |
Solar Photovoltaic | 863 | |
Natural Gas Fired Combustion Turbine | 500 | |
Natural Gas Fired Combined Cycle | 450 | |
Natural Gas Internal Combustion Engine | 243 | 4.9% |
Onshore Wind Turbine | 242 | 4.8% |
Landfill Gas | 225 | 4.5% |
Conventional Steam Coal | 188 | 3.8% |
Natural Gas Steam Turbine | 135 | 2.7% |
Other values (14) | 347 | 6.9% |
Length
Value | Count | Frequency (%) |
gas | 1568 | 10.9% |
natural | 1343 | 9.3% |
conventional | 1067 | 7.4% |
fired | 950 | 6.6% |
hydroelectric | 934 | 6.5% |
petroleum | 932 | 6.4% |
liquids | 928 | 6.4% |
turbine | 877 | 6.1% |
solar | 872 | 6.0% |
photovoltaic | 863 | 6.0% |
Other values (29) | 4117 |
Most occurring characters
Value | Count | Frequency (%) |
o | 10446 | 8.9% |
9451 | 8.0% | |
e | 9132 | 7.8% |
i | 8650 | 7.4% |
a | 8509 | 7.2% |
t | 7730 | 6.6% |
r | 7592 | 6.5% |
l | 7473 | 6.4% |
n | 6970 | 5.9% |
u | 4911 | 4.2% |
Other values (31) | 36673 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 93474 | |
Uppercase Letter | 14527 | 12.4% |
Space Separator | 9451 | 8.0% |
Other Punctuation | 85 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 10446 | |
e | 9132 | |
i | 8650 | |
a | 8509 | |
t | 7730 | |
r | 7592 | |
l | 7473 | |
n | 6970 | 7.5% |
u | 4911 | 5.3% |
s | 3966 | 4.2% |
Other values (13) | 18095 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 2902 | |
P | 1850 | |
G | 1622 | |
N | 1360 | |
S | 1268 | |
L | 1153 | 7.9% |
F | 952 | 6.6% |
H | 934 | 6.4% |
T | 886 | 6.1% |
W | 569 | 3.9% |
Other values (6) | 1031 | 7.1% |
Space Separator
Value | Count | Frequency (%) |
9451 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 85 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 108001 | |
Common | 9536 | 8.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 10446 | 9.7% |
e | 9132 | 8.5% |
i | 8650 | 8.0% |
a | 8509 | 7.9% |
t | 7730 | 7.2% |
r | 7592 | 7.0% |
l | 7473 | 6.9% |
n | 6970 | 6.5% |
u | 4911 | 4.5% |
s | 3966 | 3.7% |
Other values (29) | 32622 |
Common
Value | Count | Frequency (%) |
9451 | ||
/ | 85 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 117537 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
o | 10446 | 8.9% |
9451 | 8.0% | |
e | 9132 | 7.8% |
i | 8650 | 7.4% |
a | 8509 | 7.2% |
t | 7730 | 6.6% |
r | 7592 | 6.5% |
l | 7473 | 6.4% |
n | 6970 | 5.9% |
u | 4911 | 4.2% |
Other values (31) | 36673 |
Distinct | 28 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
NG | |
---|---|
WAT | |
DFO | |
SUN | |
WND | |
Other values (23) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.7184 |
Min length | 2 |
Characters and Unicode
Total characters | 13592 |
---|---|
Distinct characters | 22 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 2 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | NG |
---|---|
2nd row | NG |
3rd row | WND |
4th row | WND |
5th row | NG |
Common Values
Value | Count | Frequency (%) |
NG | 1343 | |
WAT | 934 | |
DFO | 890 | |
SUN | 872 | |
WND | 242 | 4.8% |
LFG | 225 | 4.5% |
BIT | 81 | 1.6% |
SUB | 77 | 1.5% |
OBG | 61 | 1.2% |
GEO | 47 | 0.9% |
Other values (18) | 228 | 4.6% |
Length
Value | Count | Frequency (%) |
ng | 1343 | |
wat | 934 | |
dfo | 890 | |
sun | 872 | |
wnd | 242 | 4.8% |
lfg | 225 | 4.5% |
bit | 81 | 1.6% |
sub | 77 | 1.5% |
obg | 61 | 1.2% |
geo | 47 | 0.9% |
Other values (18) | 228 | 4.6% |
Most occurring characters
Value | Count | Frequency (%) |
N | 2474 | |
G | 1685 | |
W | 1274 | |
D | 1177 | |
F | 1132 | |
O | 1030 | |
T | 1016 | |
S | 1003 | |
U | 968 | 7.1% |
A | 936 | 6.9% |
Other values (12) | 897 | 6.6% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 13592 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
N | 2474 | |
G | 1685 | |
W | 1274 | |
D | 1177 | |
F | 1132 | |
O | 1030 | |
T | 1016 | |
S | 1003 | |
U | 968 | 7.1% |
A | 936 | 6.9% |
Other values (12) | 897 | 6.6% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 13592 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
N | 2474 | |
G | 1685 | |
W | 1274 | |
D | 1177 | |
F | 1132 | |
O | 1030 | |
T | 1016 | |
S | 1003 | |
U | 968 | 7.1% |
A | 936 | 6.9% |
Other values (12) | 897 | 6.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 13592 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
N | 2474 | |
G | 1685 | |
W | 1274 | |
D | 1177 | |
F | 1132 | |
O | 1030 | |
T | 1016 | |
S | 1003 | |
U | 968 | 7.1% |
A | 936 | 6.9% |
Other values (12) | 897 | 6.6% |
Distinct | 28 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
Natural Gas | |
---|---|
Water | |
Disillate Fuel Oil | |
Solar | |
Wind | |
Other values (23) |
Length
Max length | 35 |
---|---|
Median length | 27 |
Mean length | 10.2334 |
Min length | 4 |
Characters and Unicode
Total characters | 51167 |
---|---|
Distinct characters | 42 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 2 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Natural Gas |
---|---|
2nd row | Natural Gas |
3rd row | Wind |
4th row | Wind |
5th row | Natural Gas |
Common Values
Value | Count | Frequency (%) |
Natural Gas | 1343 | |
Water | 934 | |
Disillate Fuel Oil | 890 | |
Solar | 872 | |
Wind | 242 | 4.8% |
Landfill Gas | 225 | 4.5% |
Bituminous Coal | 81 | 1.6% |
Subbituminous Coal | 77 | 1.5% |
Other Biomass Gases | 61 | 1.2% |
Geothermal | 47 | 0.9% |
Other values (18) | 228 | 4.6% |
Length
Value | Count | Frequency (%) |
gas | 1575 | |
natural | 1343 | |
water | 934 | |
oil | 914 | |
fuel | 907 | |
disillate | 890 | |
solar | 872 | |
wind | 242 | 2.7% |
landfill | 225 | 2.5% |
coal | 186 | 2.1% |
Other values (32) | 868 |
Most occurring characters
Value | Count | Frequency (%) |
a | 7736 | |
l | 6677 | |
4017 | 7.9% | |
i | 3733 | 7.3% |
t | 3603 | 7.0% |
r | 3419 | 6.7% |
e | 3283 | 6.4% |
s | 3061 | 6.0% |
u | 2755 | 5.4% |
G | 1683 | 3.3% |
Other values (32) | 11200 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 38248 | |
Uppercase Letter | 8884 | 17.4% |
Space Separator | 4017 | 7.9% |
Open Punctuation | 9 | < 0.1% |
Close Punctuation | 9 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 7736 | |
l | 6677 | |
i | 3733 | |
t | 3603 | |
r | 3419 | |
e | 3283 | |
s | 3061 | 8.0% |
u | 2755 | 7.2% |
o | 1567 | 4.1% |
n | 692 | 1.8% |
Other values (11) | 1722 | 4.5% |
Uppercase Letter
Value | Count | Frequency (%) |
G | 1683 | |
N | 1360 | |
W | 1301 | |
S | 1005 | |
O | 983 | |
F | 907 | |
D | 890 | |
L | 267 | 3.0% |
C | 190 | 2.1% |
B | 184 | 2.1% |
Other values (8) | 114 | 1.3% |
Space Separator
Value | Count | Frequency (%) |
4017 |
Open Punctuation
Value | Count | Frequency (%) |
( | 9 |
Close Punctuation
Value | Count | Frequency (%) |
) | 9 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 47132 | |
Common | 4035 | 7.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 7736 | |
l | 6677 | |
i | 3733 | 7.9% |
t | 3603 | 7.6% |
r | 3419 | 7.3% |
e | 3283 | 7.0% |
s | 3061 | 6.5% |
u | 2755 | 5.8% |
G | 1683 | 3.6% |
o | 1567 | 3.3% |
Other values (29) | 9615 |
Common
Value | Count | Frequency (%) |
4017 | ||
( | 9 | 0.2% |
) | 9 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 51167 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 7736 | |
l | 6677 | |
4017 | 7.9% | |
i | 3733 | 7.3% |
t | 3603 | 7.0% |
r | 3419 | 6.7% |
e | 3283 | 6.4% |
s | 3061 | 6.0% |
u | 2755 | 5.4% |
G | 1683 | 3.3% |
Other values (32) | 11200 |
Distinct | 16 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
IC | |
---|---|
HY | |
PV | |
GT | |
ST | |
Other values (11) |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Characters and Unicode
Total characters | 10000 |
---|---|
Distinct characters | 14 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | GT |
---|---|
2nd row | GT |
3rd row | WT |
4th row | WT |
5th row | CT |
Common Values
Value | Count | Frequency (%) |
IC | 1270 | |
HY | 879 | |
PV | 863 | |
GT | 632 | |
ST | 492 | 9.8% |
CT | 300 | 6.0% |
WT | 242 | 4.8% |
CA | 152 | 3.0% |
PS | 55 | 1.1% |
FC | 35 | 0.7% |
Other values (6) | 80 | 1.6% |
Length
Value | Count | Frequency (%) |
ic | 1270 | |
hy | 879 | |
pv | 863 | |
gt | 632 | |
st | 492 | 9.8% |
ct | 300 | 6.0% |
wt | 242 | 4.8% |
ca | 152 | 3.0% |
ps | 55 | 1.1% |
fc | 35 | 0.7% |
Other values (6) | 80 | 1.6% |
Most occurring characters
Value | Count | Frequency (%) |
C | 1778 | |
T | 1707 | |
I | 1270 | |
P | 920 | |
H | 879 | |
Y | 879 | |
V | 863 | |
G | 632 | 6.3% |
S | 566 | 5.7% |
W | 244 | 2.4% |
Other values (4) | 262 | 2.6% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
C | 1778 | |
T | 1707 | |
I | 1270 | |
P | 920 | |
H | 879 | |
Y | 879 | |
V | 863 | |
G | 632 | 6.3% |
S | 566 | 5.7% |
W | 244 | 2.4% |
Other values (4) | 262 | 2.6% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 10000 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
C | 1778 | |
T | 1707 | |
I | 1270 | |
P | 920 | |
H | 879 | |
Y | 879 | |
V | 863 | |
G | 632 | 6.3% |
S | 566 | 5.7% |
W | 244 | 2.4% |
Other values (4) | 262 | 2.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
C | 1778 | |
T | 1707 | |
I | 1270 | |
P | 920 | |
H | 879 | |
Y | 879 | |
V | 863 | |
G | 632 | 6.3% |
S | 566 | 5.7% |
W | 244 | 2.4% |
Other values (4) | 262 | 2.6% |
Distinct | 50 |
---|---|
Distinct (%) | 1.1% |
Missing | 285 |
Missing (%) | 5.7% |
Memory size | 39.2 KiB |
MISO | |
---|---|
CISO | |
PJM | |
SWPP | |
NYIS | |
Other values (45) |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.746129374 |
Min length | 2 |
Characters and Unicode
Total characters | 17663 |
---|---|
Distinct characters | 24 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 5 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | ERCO |
---|---|
2nd row | ERCO |
3rd row | ERCO |
4th row | ERCO |
5th row | PJM |
Common Values
Value | Count | Frequency (%) |
MISO | 889 | |
CISO | 701 | |
PJM | 619 | |
SWPP | 344 | 6.9% |
NYIS | 337 | 6.7% |
ISNE | 264 | 5.3% |
ERCO | 234 | 4.7% |
TVA | 178 | 3.6% |
SOCO | 161 | 3.2% |
DUK | 132 | 2.6% |
Other values (40) | 856 | |
(Missing) | 285 | 5.7% |
Length
Value | Count | Frequency (%) |
miso | 889 | |
ciso | 701 | |
pjm | 619 | |
swpp | 344 | 7.3% |
nyis | 337 | 7.1% |
isne | 264 | 5.6% |
erco | 234 | 5.0% |
tva | 178 | 3.8% |
soco | 161 | 3.4% |
duk | 132 | 2.8% |
Other values (40) | 856 |
Most occurring characters
Value | Count | Frequency (%) |
S | 2916 | |
I | 2306 | |
O | 2226 | |
P | 1825 | |
C | 1628 | |
M | 1599 | |
E | 868 | 4.9% |
N | 662 | 3.7% |
J | 625 | 3.5% |
A | 566 | 3.2% |
Other values (14) | 2442 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 17663 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
S | 2916 | |
I | 2306 | |
O | 2226 | |
P | 1825 | |
C | 1628 | |
M | 1599 | |
E | 868 | 4.9% |
N | 662 | 3.7% |
J | 625 | 3.5% |
A | 566 | 3.2% |
Other values (14) | 2442 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 17663 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
S | 2916 | |
I | 2306 | |
O | 2226 | |
P | 1825 | |
C | 1628 | |
M | 1599 | |
E | 868 | 4.9% |
N | 662 | 3.7% |
J | 625 | 3.5% |
A | 566 | 3.2% |
Other values (14) | 2442 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 17663 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
S | 2916 | |
I | 2306 | |
O | 2226 | |
P | 1825 | |
C | 1628 | |
M | 1599 | |
E | 868 | 4.9% |
N | 662 | 3.7% |
J | 625 | 3.5% |
A | 566 | 3.2% |
Other values (14) | 2442 |
Distinct | 52 |
---|---|
Distinct (%) | 1.1% |
Missing | 277 |
Missing (%) | 5.5% |
Memory size | 39.2 KiB |
Midcontinent Independent Transmission System Operator, Inc.. | |
---|---|
California Independent System Operator | |
PJM Interconnection, LLC | |
Southwest Power Pool | |
New York Independent System Operator | |
Other values (47) |
Length
Max length | 100 |
---|---|
Median length | 57 |
Mean length | 35.90239255 |
Min length | 3 |
Characters and Unicode
Total characters | 169567 |
---|---|
Distinct characters | 58 |
Distinct categories | 8 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 5 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | Electric Reliability Council of Texas, Inc. |
---|---|
2nd row | Electric Reliability Council of Texas, Inc. |
3rd row | Electric Reliability Council of Texas, Inc. |
4th row | Electric Reliability Council of Texas, Inc. |
5th row | PJM Interconnection, LLC |
Common Values
Value | Count | Frequency (%) |
Midcontinent Independent Transmission System Operator, Inc.. | 889 | |
California Independent System Operator | 701 | |
PJM Interconnection, LLC | 619 | |
Southwest Power Pool | 344 | 6.9% |
New York Independent System Operator | 337 | 6.7% |
ISO New England Inc. | 264 | 5.3% |
Electric Reliability Council of Texas, Inc. | 234 | 4.7% |
Tennessee Valley Authority | 178 | 3.6% |
Southern Company Services, Inc. - Trans | 161 | 3.2% |
Duke Energy Carolinas | 132 | 2.6% |
Other values (42) | 864 | |
(Missing) | 277 | 5.5% |
Length
Value | Count | Frequency (%) |
independent | 1927 | 9.2% |
operator | 1927 | 9.2% |
system | 1927 | 9.2% |
inc | 1597 | 7.7% |
midcontinent | 889 | 4.3% |
transmission | 889 | 4.3% |
california | 704 | 3.4% |
llc | 629 | 3.0% |
new | 625 | 3.0% |
pjm | 619 | 3.0% |
Other values (104) | 9102 |
Most occurring characters
Value | Count | Frequency (%) |
n | 18370 | 10.8% |
e | 17277 | 10.2% |
16112 | 9.5% | |
t | 11771 | 6.9% |
o | 10754 | 6.3% |
r | 9942 | 5.9% |
i | 9109 | 5.4% |
a | 7488 | 4.4% |
s | 6986 | 4.1% |
c | 5404 | 3.2% |
Other values (48) | 56354 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 125065 | |
Uppercase Letter | 23441 | 13.8% |
Space Separator | 16112 | 9.5% |
Other Punctuation | 4558 | 2.7% |
Dash Punctuation | 361 | 0.2% |
Open Punctuation | 10 | < 0.1% |
Close Punctuation | 10 | < 0.1% |
Decimal Number | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
n | 18370 | |
e | 17277 | |
t | 11771 | |
o | 10754 | |
r | 9942 | |
i | 9109 | |
a | 7488 | 6.0% |
s | 6986 | 5.6% |
c | 5404 | 4.3% |
d | 5375 | 4.3% |
Other values (16) | 22589 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 4474 | |
S | 3238 | |
C | 2571 | |
O | 2191 | |
P | 1929 | |
M | 1590 | 6.8% |
T | 1563 | 6.7% |
L | 1296 | 5.5% |
E | 1122 | 4.8% |
N | 696 | 3.0% |
Other values (13) | 2771 |
Other Punctuation
Value | Count | Frequency (%) |
. | 2493 | |
, | 1960 | |
& | 105 | 2.3% |
Decimal Number
Value | Count | Frequency (%) |
1 | 9 | |
2 | 1 | 10.0% |
Space Separator
Value | Count | Frequency (%) |
16112 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 361 |
Open Punctuation
Value | Count | Frequency (%) |
( | 10 |
Close Punctuation
Value | Count | Frequency (%) |
) | 10 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 148506 | |
Common | 21061 | 12.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
n | 18370 | 12.4% |
e | 17277 | 11.6% |
t | 11771 | 7.9% |
o | 10754 | 7.2% |
r | 9942 | 6.7% |
i | 9109 | 6.1% |
a | 7488 | 5.0% |
s | 6986 | 4.7% |
c | 5404 | 3.6% |
d | 5375 | 3.6% |
Other values (39) | 46030 |
Common
Value | Count | Frequency (%) |
16112 | ||
. | 2493 | 11.8% |
, | 1960 | 9.3% |
- | 361 | 1.7% |
& | 105 | 0.5% |
( | 10 | < 0.1% |
) | 10 | < 0.1% |
1 | 9 | < 0.1% |
2 | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 169567 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
n | 18370 | 10.8% |
e | 17277 | 10.2% |
16112 | 9.5% | |
t | 11771 | 6.9% |
o | 10754 | 6.3% |
r | 9942 | 5.9% |
i | 9109 | 5.4% |
a | 7488 | 4.4% |
s | 6986 | 4.1% |
c | 5404 | 3.2% |
Other values (48) | 56354 |
Distinct | 4 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
OP | |
---|---|
SB | 386 |
OS | 80 |
OA | 35 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Characters and Unicode
Total characters | 10000 |
---|---|
Distinct characters | 5 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | OP |
---|---|
2nd row | OP |
3rd row | OP |
4th row | OP |
5th row | OP |
Common Values
Value | Count | Frequency (%) |
OP | 4499 | |
SB | 386 | 7.7% |
OS | 80 | 1.6% |
OA | 35 | 0.7% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
op | 4499 | |
sb | 386 | 7.7% |
os | 80 | 1.6% |
oa | 35 | 0.7% |
Most occurring characters
Value | Count | Frequency (%) |
O | 4614 | |
P | 4499 | |
S | 466 | 4.7% |
B | 386 | 3.9% |
A | 35 | 0.4% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
O | 4614 | |
P | 4499 | |
S | 466 | 4.7% |
B | 386 | 3.9% |
A | 35 | 0.4% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 10000 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
O | 4614 | |
P | 4499 | |
S | 466 | 4.7% |
B | 386 | 3.9% |
A | 35 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
O | 4614 | |
P | 4499 | |
S | 466 | 4.7% |
B | 386 | 3.9% |
A | 35 | 0.4% |
Distinct | 4 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
Operating | |
---|---|
Standby/Backup: available for service but not normally used | 386 |
Out of service and NOT expected to return to service in next calendar year | 80 |
Out of service but expected to return to service in next calendar year | 35 |
Length
Max length | 74 |
---|---|
Median length | 9 |
Mean length | 14.327 |
Min length | 9 |
Characters and Unicode
Total characters | 71635 |
---|---|
Distinct characters | 29 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Operating |
---|---|
2nd row | Operating |
3rd row | Operating |
4th row | Operating |
5th row | Operating |
Common Values
Value | Count | Frequency (%) |
Operating | 4499 | |
Standby/Backup: available for service but not normally used | 386 | 7.7% |
Out of service and NOT expected to return to service in next calendar year | 80 | 1.6% |
Out of service but expected to return to service in next calendar year | 35 | 0.7% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
operating | 4499 | |
service | 616 | 6.7% |
not | 466 | 5.1% |
but | 421 | 4.6% |
available | 386 | 4.2% |
for | 386 | 4.2% |
normally | 386 | 4.2% |
used | 386 | 4.2% |
standby/backup | 386 | 4.2% |
to | 230 | 2.5% |
Other values (9) | 1000 | 10.9% |
Most occurring characters
Value | Count | Frequency (%) |
e | 7308 | |
a | 7240 | |
t | 6382 | |
r | 6347 | |
n | 6197 | |
i | 5616 | 7.8% |
p | 5000 | 7.0% |
O | 4694 | 6.6% |
g | 4499 | 6.3% |
4162 | 5.8% | |
Other values (19) | 14190 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 61075 | |
Uppercase Letter | 5626 | 7.9% |
Space Separator | 4162 | 5.8% |
Other Punctuation | 772 | 1.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 7308 | |
a | 7240 | |
t | 6382 | |
r | 6347 | |
n | 6197 | |
i | 5616 | |
p | 5000 | |
g | 4499 | |
l | 1659 | 2.7% |
o | 1503 | 2.5% |
Other values (11) | 9324 |
Uppercase Letter
Value | Count | Frequency (%) |
O | 4694 | |
B | 386 | 6.9% |
S | 386 | 6.9% |
N | 80 | 1.4% |
T | 80 | 1.4% |
Other Punctuation
Value | Count | Frequency (%) |
: | 386 | |
/ | 386 |
Space Separator
Value | Count | Frequency (%) |
4162 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 66701 | |
Common | 4934 | 6.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 7308 | |
a | 7240 | |
t | 6382 | |
r | 6347 | |
n | 6197 | |
i | 5616 | |
p | 5000 | |
O | 4694 | 7.0% |
g | 4499 | 6.7% |
l | 1659 | 2.5% |
Other values (16) | 11759 |
Common
Value | Count | Frequency (%) |
4162 | ||
: | 386 | 7.8% |
/ | 386 | 7.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 71635 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 7308 | |
a | 7240 | |
t | 6382 | |
r | 6347 | |
n | 6197 | |
i | 5616 | 7.8% |
p | 5000 | 7.0% |
O | 4694 | 6.6% |
g | 4499 | 6.3% |
4162 | 5.8% | |
Other values (19) | 14190 |
Distinct | 748 |
---|---|
Distinct (%) | 15.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 56.86852 |
Minimum | 0.1 |
---|---|
Maximum | 1300 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 39.2 KiB |
Quantile statistics
Minimum | 0.1 |
---|---|
5-th percentile | 0.4 |
Q1 | 1.6 |
median | 5 |
Q3 | 55 |
95-th percentile | 245 |
Maximum | 1300 |
Range | 1299.9 |
Interquartile range (IQR) | 53.4 |
Descriptive statistics
Standard deviation | 133.6035466 |
---|---|
Coefficient of variation (CV) | 2.349341016 |
Kurtosis | 29.72833774 |
Mean | 56.86852 |
Median Absolute Deviation (MAD) | 4.5 |
Skewness | 4.826863195 |
Sum | 284342.6 |
Variance | 17849.90766 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 255 | 5.1% |
1 | 200 | 4.0% |
0.5 | 188 | 3.8% |
1.6 | 119 | 2.4% |
0.3 | 112 | 2.2% |
1.8 | 108 | 2.2% |
3 | 107 | 2.1% |
5 | 99 | 2.0% |
1.5 | 99 | 2.0% |
0.8 | 95 | 1.9% |
Other values (738) | 3618 |
Value | Count | Frequency (%) |
0.1 | 28 | 0.6% |
0.2 | 55 | 1.1% |
0.3 | 112 | |
0.4 | 64 | 1.3% |
0.5 | 188 | |
0.6 | 73 | 1.5% |
0.7 | 44 | 0.9% |
0.8 | 95 | |
0.9 | 52 | 1.0% |
1 | 200 |
Value | Count | Frequency (%) |
1300 | 6 | |
1245.6 | 2 | < 0.1% |
1242 | 1 | < 0.1% |
1205.1 | 2 | < 0.1% |
1190 | 1 | < 0.1% |
1152 | 2 | < 0.1% |
1029.6 | 1 | < 0.1% |
1008 | 1 | < 0.1% |
956.8 | 1 | < 0.1% |
952 | 2 | < 0.1% |
Distinct | 827 |
---|---|
Distinct (%) | 16.6% |
Missing | 18 |
Missing (%) | 0.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 52.1248896 |
Minimum | 0 |
---|---|
Maximum | 1300 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 39.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.4 |
Q1 | 1.5 |
median | 4.5 |
Q3 | 46.275 |
95-th percentile | 225 |
Maximum | 1300 |
Range | 1300 |
Interquartile range (IQR) | 44.775 |
Descriptive statistics
Standard deviation | 125.8161606 |
---|---|
Coefficient of variation (CV) | 2.413744404 |
Kurtosis | 31.10802866 |
Mean | 52.1248896 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 4.944792423 |
Sum | 259686.2 |
Variance | 15829.70627 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 252 | 5.0% |
0.5 | 194 | 3.9% |
1 | 175 | 3.5% |
1.5 | 132 | 2.6% |
0.3 | 121 | 2.4% |
1.8 | 117 | 2.3% |
3 | 96 | 1.9% |
1.6 | 96 | 1.9% |
0.8 | 90 | 1.8% |
0.6 | 84 | 1.7% |
Other values (817) | 3625 |
Value | Count | Frequency (%) |
0 | 1 | < 0.1% |
0.1 | 38 | 0.8% |
0.2 | 54 | 1.1% |
0.3 | 121 | |
0.4 | 73 | 1.5% |
0.5 | 194 | |
0.6 | 84 | |
0.7 | 61 | 1.2% |
0.8 | 90 | |
0.9 | 81 |
Value | Count | Frequency (%) |
1300 | 1 | |
1299 | 1 | |
1249.1 | 1 | |
1239 | 2 | |
1231 | 2 | |
1160.1 | 1 | |
1150.1 | 1 | |
1110 | 2 | |
1104.9 | 1 | |
1103.7 | 1 |
Distinct | 809 |
---|---|
Distinct (%) | 16.3% |
Missing | 27 |
Missing (%) | 0.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 53.79525437 |
Minimum | 0 |
---|---|
Maximum | 1299 |
Zeros | 14 |
Zeros (%) | 0.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 39.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.4 |
Q1 | 1.5 |
median | 4.6 |
Q3 | 49.5 |
95-th percentile | 230 |
Maximum | 1299 |
Range | 1299 |
Interquartile range (IQR) | 48 |
Descriptive statistics
Standard deviation | 128.2640542 |
---|---|
Coefficient of variation (CV) | 2.384300543 |
Kurtosis | 30.27219714 |
Mean | 53.79525437 |
Median Absolute Deviation (MAD) | 4.1 |
Skewness | 4.85992631 |
Sum | 267523.8 |
Variance | 16451.6676 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 249 | 5.0% |
0.5 | 184 | 3.7% |
1 | 170 | 3.4% |
1.5 | 126 | 2.5% |
0.3 | 123 | 2.5% |
1.8 | 116 | 2.3% |
1.6 | 105 | 2.1% |
0.8 | 103 | 2.1% |
5 | 100 | 2.0% |
3 | 87 | 1.7% |
Other values (799) | 3610 |
Value | Count | Frequency (%) |
0 | 14 | 0.3% |
0.1 | 42 | 0.8% |
0.2 | 56 | 1.1% |
0.3 | 123 | |
0.4 | 69 | 1.4% |
0.5 | 184 | |
0.6 | 80 | |
0.7 | 67 | 1.3% |
0.8 | 103 | |
0.9 | 74 |
Value | Count | Frequency (%) |
1299 | 2 | |
1265 | 2 | |
1257 | 2 | |
1249.1 | 1 | |
1198.7 | 1 | |
1179.8 | 1 | |
1135.2 | 1 | |
1134.2 | 1 | |
1131.7 | 1 | |
1110 | 2 |
Distinct | 915 |
---|---|
Distinct (%) | 18.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
2016-12 | 83 |
---|---|
2015-02 | 74 |
2001-06 | 53 |
2012-12 | 51 |
2014-12 | 41 |
Other values (910) |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Characters and Unicode
Total characters | 35000 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 203 ? |
---|---|
Unique (%) | 4.1% |
Sample
1st row | 2017-07 |
---|---|
2nd row | 2017-07 |
3rd row | 2001-07 |
4th row | 2001-07 |
5th row | 1992-03 |
Common Values
Value | Count | Frequency (%) |
2016-12 | 83 | 1.7% |
2015-02 | 74 | 1.5% |
2001-06 | 53 | 1.1% |
2012-12 | 51 | 1.0% |
2014-12 | 41 | 0.8% |
2013-12 | 41 | 0.8% |
2015-12 | 36 | 0.7% |
2003-06 | 35 | 0.7% |
2000-06 | 35 | 0.7% |
2002-07 | 34 | 0.7% |
Other values (905) | 4517 |
Length
Value | Count | Frequency (%) |
2016-12 | 83 | 1.7% |
2015-02 | 74 | 1.5% |
2001-06 | 53 | 1.1% |
2012-12 | 51 | 1.0% |
2014-12 | 41 | 0.8% |
2013-12 | 41 | 0.8% |
2015-12 | 36 | 0.7% |
2003-06 | 35 | 0.7% |
2000-06 | 35 | 0.7% |
2002-07 | 34 | 0.7% |
Other values (905) | 4517 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8053 | |
1 | 6842 | |
- | 5000 | |
2 | 4333 | |
9 | 3663 | |
6 | 1488 | 4.3% |
8 | 1312 | 3.7% |
7 | 1298 | 3.7% |
5 | 1201 | 3.4% |
4 | 908 | 2.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 30000 | |
Dash Punctuation | 5000 | 14.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 8053 | |
1 | 6842 | |
2 | 4333 | |
9 | 3663 | |
6 | 1488 | 5.0% |
8 | 1312 | 4.4% |
7 | 1298 | 4.3% |
5 | 1201 | 4.0% |
4 | 908 | 3.0% |
3 | 902 | 3.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 5000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 35000 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 8053 | |
1 | 6842 | |
- | 5000 | |
2 | 4333 | |
9 | 3663 | |
6 | 1488 | 4.3% |
8 | 1312 | 3.7% |
7 | 1298 | 3.7% |
5 | 1201 | 3.4% |
4 | 908 | 2.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8053 | |
1 | 6842 | |
- | 5000 | |
2 | 4333 | |
9 | 3663 | |
6 | 1488 | 4.3% |
8 | 1312 | 3.7% |
7 | 1298 | 3.7% |
5 | 1201 | 3.4% |
4 | 908 | 2.6% |
Distinct | 36 |
---|---|
Distinct (%) | 30.5% |
Missing | 4882 |
Missing (%) | 97.6% |
Memory size | 39.2 KiB |
2017-12 | |
---|---|
2020-11 | |
2017-08 | |
2023-12 | |
2024-12 | 6 |
Other values (31) |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Characters and Unicode
Total characters | 826 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 12 ? |
---|---|
Unique (%) | 10.2% |
Sample
1st row | 2026-12 |
---|---|
2nd row | 2021-05 |
3rd row | 2021-07 |
4th row | 2021-06 |
5th row | 2021-08 |
Common Values
Value | Count | Frequency (%) |
2017-12 | 18 | 0.4% |
2020-11 | 10 | 0.2% |
2017-08 | 8 | 0.2% |
2023-12 | 7 | 0.1% |
2024-12 | 6 | 0.1% |
2020-12 | 5 | 0.1% |
2031-12 | 5 | 0.1% |
2021-06 | 5 | 0.1% |
2025-12 | 4 | 0.1% |
2022-11 | 4 | 0.1% |
Other values (26) | 46 | 0.9% |
(Missing) | 4882 |
Length
Value | Count | Frequency (%) |
2017-12 | 18 | 15.3% |
2020-11 | 10 | 8.5% |
2017-08 | 8 | 6.8% |
2023-12 | 7 | 5.9% |
2024-12 | 6 | 5.1% |
2020-12 | 5 | 4.2% |
2031-12 | 5 | 4.2% |
2021-06 | 5 | 4.2% |
2026-12 | 4 | 3.4% |
2025-12 | 4 | 3.4% |
Other values (26) | 46 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 240 | |
0 | 180 | |
1 | 162 | |
- | 118 | |
7 | 38 | 4.6% |
8 | 21 | 2.5% |
6 | 19 | 2.3% |
3 | 18 | 2.2% |
9 | 13 | 1.6% |
4 | 11 | 1.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 708 | |
Dash Punctuation | 118 | 14.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 240 | |
0 | 180 | |
1 | 162 | |
7 | 38 | 5.4% |
8 | 21 | 3.0% |
6 | 19 | 2.7% |
3 | 18 | 2.5% |
9 | 13 | 1.8% |
4 | 11 | 1.6% |
5 | 6 | 0.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 118 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 826 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
2 | 240 | |
0 | 180 | |
1 | 162 | |
- | 118 | |
7 | 38 | 4.6% |
8 | 21 | 2.5% |
6 | 19 | 2.3% |
3 | 18 | 2.2% |
9 | 13 | 1.6% |
4 | 11 | 1.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 826 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 240 | |
0 | 180 | |
1 | 162 | |
- | 118 | |
7 | 38 | 4.6% |
8 | 21 | 2.5% |
6 | 19 | 2.3% |
3 | 18 | 2.2% |
9 | 13 | 1.6% |
4 | 11 | 1.3% |
Distinct | 23 |
---|---|
Distinct (%) | 88.5% |
Missing | 4974 |
Missing (%) | 99.5% |
Memory size | 39.2 KiB |
2021-05 | |
---|---|
2018-05 | |
2022-12 | |
2018-01 | 1 |
2019-12 | 1 |
Other values (18) |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Characters and Unicode
Total characters | 182 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 20 ? |
---|---|
Unique (%) | 76.9% |
Sample
1st row | 2022-12 |
---|---|
2nd row | 2022-12 |
3rd row | 2023-12 |
4th row | 2024-12 |
5th row | 2025-12 |
Common Values
Value | Count | Frequency (%) |
2021-05 | 2 | < 0.1% |
2018-05 | 2 | < 0.1% |
2022-12 | 2 | < 0.1% |
2018-01 | 1 | < 0.1% |
2019-12 | 1 | < 0.1% |
2018-12 | 1 | < 0.1% |
2018-09 | 1 | < 0.1% |
2019-04 | 1 | < 0.1% |
2018-11 | 1 | < 0.1% |
2018-02 | 1 | < 0.1% |
Other values (13) | 13 | 0.3% |
(Missing) | 4974 |
Length
Value | Count | Frequency (%) |
2021-05 | 2 | 7.7% |
2022-12 | 2 | 7.7% |
2018-05 | 2 | 7.7% |
2022-06 | 1 | 3.8% |
2023-12 | 1 | 3.8% |
2024-12 | 1 | 3.8% |
2025-12 | 1 | 3.8% |
2026-12 | 1 | 3.8% |
2027-12 | 1 | 3.8% |
2023-06 | 1 | 3.8% |
Other values (13) | 13 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 54 | |
0 | 44 | |
1 | 27 | |
- | 26 | |
8 | 8 | 4.4% |
5 | 6 | 3.3% |
9 | 5 | 2.7% |
6 | 5 | 2.7% |
4 | 3 | 1.6% |
3 | 3 | 1.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 156 | |
Dash Punctuation | 26 | 14.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 54 | |
0 | 44 | |
1 | 27 | |
8 | 8 | 5.1% |
5 | 6 | 3.8% |
9 | 5 | 3.2% |
6 | 5 | 3.2% |
4 | 3 | 1.9% |
3 | 3 | 1.9% |
7 | 1 | 0.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 26 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 182 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
2 | 54 | |
0 | 44 | |
1 | 27 | |
- | 26 | |
8 | 8 | 4.4% |
5 | 6 | 3.3% |
9 | 5 | 2.7% |
6 | 5 | 2.7% |
4 | 3 | 1.6% |
3 | 3 | 1.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 182 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 54 | |
0 | 44 | |
1 | 27 | |
- | 26 | |
8 | 8 | 4.4% |
5 | 6 | 3.3% |
9 | 5 | 2.7% |
6 | 5 | 2.7% |
4 | 3 | 1.6% |
3 | 3 | 1.6% |
Distinct | 12 |
---|---|
Distinct (%) | 46.2% |
Missing | 4974 |
Missing (%) | 99.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38.31153846 |
Minimum | 0.5 |
---|---|
Maximum | 155 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 39.2 KiB |
Quantile statistics
Minimum | 0.5 |
---|---|
5-th percentile | 0.825 |
Q1 | 6 |
median | 6.4 |
Q3 | 63 |
95-th percentile | 155 |
Maximum | 155 |
Range | 154.5 |
Interquartile range (IQR) | 57 |
Descriptive statistics
Standard deviation | 49.62692275 |
---|---|
Coefficient of variation (CV) | 1.295351864 |
Kurtosis | 1.44674801 |
Mean | 38.31153846 |
Median Absolute Deviation (MAD) | 5.9 |
Skewness | 1.524962208 |
Sum | 996.1 |
Variance | 2462.831462 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 6 | 0.1% |
65 | 4 | 0.1% |
57 | 3 | 0.1% |
155 | 3 | 0.1% |
6.4 | 2 | < 0.1% |
0.5 | 2 | < 0.1% |
3.5 | 1 | < 0.1% |
4 | 1 | < 0.1% |
16 | 1 | < 0.1% |
20 | 1 | < 0.1% |
Other values (2) | 2 | < 0.1% |
(Missing) | 4974 |
Value | Count | Frequency (%) |
0.5 | 2 | < 0.1% |
1.8 | 1 | < 0.1% |
3.5 | 1 | < 0.1% |
4 | 1 | < 0.1% |
5 | 1 | < 0.1% |
6 | 6 | |
6.4 | 2 | < 0.1% |
16 | 1 | < 0.1% |
20 | 1 | < 0.1% |
57 | 3 |
Value | Count | Frequency (%) |
155 | 3 | |
65 | 4 | |
57 | 3 | |
20 | 1 | < 0.1% |
16 | 1 | < 0.1% |
6.4 | 2 | < 0.1% |
6 | 6 | |
5 | 1 | < 0.1% |
4 | 1 | < 0.1% |
3.5 | 1 | < 0.1% |
Distinct | 714 |
---|---|
Distinct (%) | 14.3% |
Missing | 4 |
Missing (%) | 0.1% |
Memory size | 39.2 KiB |
San Bernardino | 150 |
---|---|
Los Angeles | 89 |
Kern | 78 |
Riverside | 70 |
Jackson | 55 |
Other values (709) |
Length
Max length | 25 |
---|---|
Median length | 19 |
Mean length | 7.685348279 |
Min length | 3 |
Characters and Unicode
Total characters | 38396 |
---|---|
Distinct characters | 53 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 159 ? |
---|---|
Unique (%) | 3.2% |
Sample
1st row | Calhoun |
---|---|
2nd row | Calhoun |
3rd row | Pecos |
4th row | Pecos |
5th row | Salem |
Common Values
Value | Count | Frequency (%) |
San Bernardino | 150 | 3.0% |
Los Angeles | 89 | 1.8% |
Kern | 78 | 1.6% |
Riverside | 70 | 1.4% |
Jackson | 55 | 1.1% |
Harris | 54 | 1.1% |
Douglas | 50 | 1.0% |
Orange | 49 | 1.0% |
Franklin | 48 | 1.0% |
Wayne | 44 | 0.9% |
Other values (704) | 4309 |
Length
Value | Count | Frequency (%) |
san | 193 | 3.3% |
bernardino | 150 | 2.6% |
los | 89 | 1.5% |
angeles | 89 | 1.5% |
kern | 78 | 1.3% |
riverside | 70 | 1.2% |
new | 70 | 1.2% |
st | 59 | 1.0% |
jackson | 55 | 0.9% |
harris | 54 | 0.9% |
Other values (740) | 4954 |
Most occurring characters
Value | Count | Frequency (%) |
a | 4002 | 10.4% |
e | 3788 | 9.9% |
n | 3339 | 8.7% |
o | 2827 | 7.4% |
r | 2739 | 7.1% |
i | 2166 | 5.6% |
l | 1824 | 4.8% |
s | 1661 | 4.3% |
t | 1468 | 3.8% |
d | 1168 | 3.0% |
Other values (43) | 13414 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 31589 | |
Uppercase Letter | 5940 | 15.5% |
Space Separator | 865 | 2.3% |
Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 4002 | |
e | 3788 | |
n | 3339 | |
o | 2827 | |
r | 2739 | |
i | 2166 | 6.9% |
l | 1824 | 5.8% |
s | 1661 | 5.3% |
t | 1468 | 4.6% |
d | 1168 | 3.7% |
Other values (16) | 6607 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 832 | |
C | 594 | 10.0% |
B | 462 | 7.8% |
L | 373 | 6.3% |
M | 362 | 6.1% |
A | 346 | 5.8% |
H | 345 | 5.8% |
W | 314 | 5.3% |
P | 278 | 4.7% |
D | 222 | 3.7% |
Other values (15) | 1812 |
Space Separator
Value | Count | Frequency (%) |
865 |
Other Punctuation
Value | Count | Frequency (%) |
' | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 37529 | |
Common | 867 | 2.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 4002 | 10.7% |
e | 3788 | 10.1% |
n | 3339 | 8.9% |
o | 2827 | 7.5% |
r | 2739 | 7.3% |
i | 2166 | 5.8% |
l | 1824 | 4.9% |
s | 1661 | 4.4% |
t | 1468 | 3.9% |
d | 1168 | 3.1% |
Other values (41) | 12547 |
Common
Value | Count | Frequency (%) |
865 | ||
' | 2 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 38396 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 4002 | 10.4% |
e | 3788 | 9.9% |
n | 3339 | 8.7% |
o | 2827 | 7.4% |
r | 2739 | 7.1% |
i | 2166 | 5.6% |
l | 1824 | 4.8% |
s | 1661 | 4.3% |
t | 1468 | 3.8% |
d | 1168 | 3.0% |
Other values (43) | 13414 |
Distinct | 1968 |
---|---|
Distinct (%) | 39.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -95.61612094 |
Minimum | -170.475661 |
---|---|
Maximum | 93.968056 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 4994 |
Negative (%) | 99.9% |
Memory size | 39.2 KiB |
Quantile statistics
Minimum | -170.475661 |
---|---|
5-th percentile | -132.821435 |
Q1 | -112.904028 |
median | -91.297894 |
Q3 | -80.780246 |
95-th percentile | -72.776111 |
Maximum | 93.968056 |
Range | 264.443717 |
Interquartile range (IQR) | 32.123782 |
Descriptive statistics
Standard deviation | 21.10085811 |
---|---|
Coefficient of variation (CV) | -0.2206830596 |
Kurtosis | 7.200314911 |
Mean | -95.61612094 |
Median Absolute Deviation (MAD) | 12.226066 |
Skewness | -0.2178077608 |
Sum | -478080.6047 |
Variance | 445.2462131 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-77.467185 | 27 | 0.5% |
-99.0919 | 27 | 0.5% |
-87.9861 | 24 | 0.5% |
-90.14868 | 22 | 0.4% |
-91.551221 | 20 | 0.4% |
-95.235078 | 20 | 0.4% |
-76.8408 | 18 | 0.4% |
-95.5306 | 17 | 0.3% |
-86.4006 | 17 | 0.3% |
-92.589364 | 17 | 0.3% |
Other values (1958) | 4791 |
Value | Count | Frequency (%) |
-170.475661 | 2 | < 0.1% |
-166.737211 | 4 | |
-165.429814 | 8 | |
-164.6544 | 1 | < 0.1% |
-164.538447 | 2 | < 0.1% |
-163.729072 | 4 | |
-163.553106 | 4 | |
-163.005833 | 4 | |
-162.965728 | 3 | 0.1% |
-162.880706 | 3 | 0.1% |
Value | Count | Frequency (%) |
93.968056 | 4 | 0.1% |
72.021944 | 2 | < 0.1% |
-68.21 | 1 | < 0.1% |
-68.63554 | 2 | < 0.1% |
-68.704368 | 13 | |
-69.583527 | 8 | |
-69.647441 | 2 | < 0.1% |
-69.812168 | 1 | < 0.1% |
-69.8658 | 4 | 0.1% |
-70.0517 | 2 | < 0.1% |
Distinct | 1972 |
---|---|
Distinct (%) | 39.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39.24465948 |
Minimum | 19.6316 |
---|---|
Maximum | 70.642877 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 39.2 KiB |
Quantile statistics
Minimum | 19.6316 |
---|---|
5-th percentile | 29.72297075 |
Q1 | 34.632509 |
median | 38.7506 |
Q3 | 42.704391 |
95-th percentile | 48.2142 |
Maximum | 70.642877 |
Range | 51.011277 |
Interquartile range (IQR) | 8.071882 |
Descriptive statistics
Standard deviation | 7.047003619 |
---|---|
Coefficient of variation (CV) | 0.179565926 |
Kurtosis | 4.077793006 |
Mean | 39.24465948 |
Median Absolute Deviation (MAD) | 4.0464 |
Skewness | 1.343023199 |
Sum | 196223.2974 |
Variance | 49.66026 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.193584 | 27 | 0.5% |
28.9275 | 27 | 0.5% |
36.0278 | 24 | 0.5% |
35.074087 | 22 | 0.4% |
40.971826 | 20 | 0.4% |
38.974022 | 20 | 0.4% |
42.9281 | 18 | 0.4% |
29.9417 | 17 | 0.3% |
33.435 | 17 | 0.3% |
33.296146 | 17 | 0.3% |
Other values (1962) | 4791 |
Value | Count | Frequency (%) |
19.6316 | 2 | < 0.1% |
19.7041 | 2 | < 0.1% |
19.7052 | 5 | |
19.7203 | 2 | < 0.1% |
19.7264 | 2 | < 0.1% |
19.7317 | 7 | |
20.0252 | 1 | < 0.1% |
20.0939 | 6 | |
20.257252 | 1 | < 0.1% |
21.106 | 4 |
Value | Count | Frequency (%) |
70.642877 | 5 | |
70.4826 | 7 | |
70.220565 | 6 | |
70.125617 | 4 | |
69.740833 | 4 | |
68.348424 | 4 | |
68.13795 | 5 | |
67.726644 | 2 | < 0.1% |
67.570931 | 3 | |
67.08798 | 3 |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
MW |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Characters and Unicode
Total characters | 10000 |
---|---|
Distinct characters | 2 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | MW |
---|---|
2nd row | MW |
3rd row | MW |
4th row | MW |
5th row | MW |
Common Values
Value | Count | Frequency (%) |
MW | 5000 |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
mw | 5000 |
Most occurring characters
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 10000 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
MW |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Characters and Unicode
Total characters | 10000 |
---|---|
Distinct characters | 2 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | MW |
---|---|
2nd row | MW |
3rd row | MW |
4th row | MW |
5th row | MW |
Common Values
Value | Count | Frequency (%) |
MW | 5000 |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
mw | 5000 |
Most occurring characters
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 10000 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
MW |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Characters and Unicode
Total characters | 10000 |
---|---|
Distinct characters | 2 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | MW |
---|---|
2nd row | MW |
3rd row | MW |
4th row | MW |
5th row | MW |
Common Values
Value | Count | Frequency (%) |
MW | 5000 |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
mw | 5000 |
Most occurring characters
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 10000 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
MW |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Characters and Unicode
Total characters | 10000 |
---|---|
Distinct characters | 2 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | MW |
---|---|
2nd row | MW |
3rd row | MW |
4th row | MW |
5th row | MW |
Common Values
Value | Count | Frequency (%) |
MW | 5000 |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
mw | 5000 |
Most occurring characters
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 10000 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 39.2 KiB |
MW |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Characters and Unicode
Total characters | 10000 |
---|---|
Distinct characters | 2 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | MW |
---|---|
2nd row | MW |
3rd row | MW |
4th row | MW |
5th row | MW |
Common Values
Value | Count | Frequency (%) |
MW | 5000 |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
mw | 5000 |
Most occurring characters
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 10000 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 5000 | |
W | 5000 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
Unnamed: 0 | period | stateid | stateName | sector | sectorName | entityid | entityName | plantid | plantName | generatorid | unit | technology | energy_source_code | energy-source-desc | prime_mover_code | balancing_authority_code | balancing-authority-name | status | statusDescription | nameplate-capacity-mw | net-summer-capacity-mw | net-winter-capacity-mw | operating-year-month | planned-retirement-year-month | planned-derate-year-month | planned-derate-summer-cap-mw | planned-uprate-year-month | planned-uprate-summer-cap-mw | county | longitude | latitude | nameplate-capacity-mw-units | net-summer-capacity-mw-units | net-winter-capacity-mw-units | planned-derate-summer-cap-mw-units | planned-uprate-summer-cap-mw-units | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 0 | 2020-09 | TX | Texas | ipp-non-chp | IPP Non-CHP | 61199 | Peaker Power, LLC | 60459 | Port Comfort Power LLC | PC1 | NaN | Natural Gas Fired Combustion Turbine | NG | Natural Gas | GT | ERCO | Electric Reliability Council of Texas, Inc. | OP | Operating | 60.5 | 43.0 | 46.0 | 2017-07 | NaN | NaN | NaN | NaN | NaN | Calhoun | -96.546210 | 28.648070 | MW | MW | MW | MW | MW |
1 | 1 | 2020-09 | TX | Texas | ipp-non-chp | IPP Non-CHP | 61199 | Peaker Power, LLC | 60459 | Port Comfort Power LLC | PC2 | NaN | Natural Gas Fired Combustion Turbine | NG | Natural Gas | GT | ERCO | Electric Reliability Council of Texas, Inc. | OP | Operating | 60.5 | 43.0 | 46.0 | 2017-07 | NaN | NaN | NaN | NaN | NaN | Calhoun | -96.546210 | 28.648070 | MW | MW | MW | MW | MW |
2 | 2 | 2020-09 | TX | Texas | ipp-non-chp | IPP Non-CHP | 14628 | Pecos Wind I LP | 55796 | Woodward Mountain I | 1 | NaN | Onshore Wind Turbine | WND | Wind | WT | ERCO | Electric Reliability Council of Texas, Inc. | OP | Operating | 82.0 | 82.0 | 82.0 | 2001-07 | NaN | NaN | NaN | NaN | NaN | Pecos | -102.414067 | 30.951400 | MW | MW | MW | MW | MW |
3 | 3 | 2020-09 | TX | Texas | ipp-non-chp | IPP Non-CHP | 14629 | Pecos Wind II LP | 55795 | Woodward Mountain II | 1 | NaN | Onshore Wind Turbine | WND | Wind | WT | ERCO | Electric Reliability Council of Texas, Inc. | OP | Operating | 78.0 | 78.0 | 78.0 | 2001-07 | NaN | NaN | NaN | NaN | NaN | Pecos | -102.414067 | 30.951400 | MW | MW | MW | MW | MW |
4 | 4 | 2020-09 | NJ | New Jersey | ipp-non-chp | IPP Non-CHP | 50160 | Pedricktown Cogeneration Company LP | 10099 | Pedricktown Cogeneration Company LP | GEN1 | CC1 | Natural Gas Fired Combined Cycle | NG | Natural Gas | CT | PJM | PJM Interconnection, LLC | OP | Operating | 95.2 | 112.8 | 112.6 | 1992-03 | NaN | NaN | NaN | NaN | NaN | Salem | -75.423800 | 39.766800 | MW | MW | MW | MW | MW |
5 | 5 | 2020-09 | NJ | New Jersey | ipp-non-chp | IPP Non-CHP | 50160 | Pedricktown Cogeneration Company LP | 10099 | Pedricktown Cogeneration Company LP | GEN2 | CC1 | Natural Gas Fired Combined Cycle | NG | Natural Gas | CA | PJM | PJM Interconnection, LLC | OP | Operating | 45.0 | NaN | NaN | 1992-03 | NaN | NaN | NaN | NaN | NaN | Salem | -75.423800 | 39.766800 | MW | MW | MW | MW | MW |
6 | 6 | 2020-09 | MN | Minnesota | ipp-non-chp | IPP Non-CHP | 60823 | Pegasus Community Solar | 61175 | Pegasus Community Solar | CPCS1 | NaN | Solar Photovoltaic | SUN | Solar | PV | MISO | Midcontinent Independent Transmission System Operator, Inc.. | OP | Operating | 1.0 | 0.9 | 0.9 | 2017-08 | NaN | NaN | NaN | NaN | NaN | Stearns | -95.124682 | 45.495453 | MW | MW | MW | MW | MW |
7 | 7 | 2020-09 | MN | Minnesota | ipp-non-chp | IPP Non-CHP | 60823 | Pegasus Community Solar | 61175 | Pegasus Community Solar | CPCS2 | NaN | Solar Photovoltaic | SUN | Solar | PV | MISO | Midcontinent Independent Transmission System Operator, Inc.. | OP | Operating | 1.0 | 0.9 | 0.9 | 2017-08 | NaN | NaN | NaN | NaN | NaN | Stearns | -95.124682 | 45.495453 | MW | MW | MW | MW | MW |
8 | 8 | 2020-09 | MI | Michigan | ipp-non-chp | IPP Non-CHP | 61521 | Pegasus Wind, LLC | 61916 | Pegasus Wind | PWEC | NaN | Onshore Wind Turbine | WND | Wind | WT | MISO | Midcontinent Independent Transmission System Operator, Inc.. | OP | Operating | 48.0 | 48.0 | 48.0 | 2019-12 | NaN | NaN | NaN | NaN | NaN | Tuscola | -83.507210 | 43.452003 | MW | MW | MW | MW | MW |
9 | 9 | 2020-09 | MI | Michigan | ipp-non-chp | IPP Non-CHP | 61521 | Pegasus Wind, LLC | 61916 | Pegasus Wind | PWEC2 | NaN | Onshore Wind Turbine | WND | Wind | WT | MISO | Midcontinent Independent Transmission System Operator, Inc.. | OP | Operating | 130.0 | 130.0 | 130.0 | 2020-09 | NaN | NaN | NaN | NaN | NaN | Tuscola | -83.507210 | 43.452003 | MW | MW | MW | MW | MW |
Last rows
Unnamed: 0 | period | stateid | stateName | sector | sectorName | entityid | entityName | plantid | plantName | generatorid | unit | technology | energy_source_code | energy-source-desc | prime_mover_code | balancing_authority_code | balancing-authority-name | status | statusDescription | nameplate-capacity-mw | net-summer-capacity-mw | net-winter-capacity-mw | operating-year-month | planned-retirement-year-month | planned-derate-year-month | planned-derate-summer-cap-mw | planned-uprate-year-month | planned-uprate-summer-cap-mw | county | longitude | latitude | nameplate-capacity-mw-units | net-summer-capacity-mw-units | net-winter-capacity-mw-units | planned-derate-summer-cap-mw-units | planned-uprate-summer-cap-mw-units | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
4990 | 4990 | 2017-01 | IL | Illinois | electric-utility | Electric Utility | 3153 | City of Casey - (IL) | 56053 | Casey City of | 2 | NaN | Petroleum Liquids | DFO | Disillate Fuel Oil | IC | MISO | Midcontinent Independent Transmission System Operator, Inc.. | SB | Standby/Backup: available for service but not normally used | 1.8 | 1.8 | 1.8 | 2002-05 | NaN | NaN | NaN | NaN | NaN | Clark | -87.992592 | 39.310555 | MW | MW | MW | MW | MW |
4991 | 4991 | 2017-01 | IL | Illinois | electric-utility | Electric Utility | 3153 | City of Casey - (IL) | 56053 | Casey City of | 3 | NaN | Petroleum Liquids | DFO | Disillate Fuel Oil | IC | MISO | Midcontinent Independent Transmission System Operator, Inc.. | SB | Standby/Backup: available for service but not normally used | 1.8 | 1.8 | 1.8 | 2002-05 | NaN | NaN | NaN | NaN | NaN | Clark | -87.992592 | 39.310555 | MW | MW | MW | MW | MW |
4992 | 4992 | 2017-01 | CO | Colorado | electric-utility | Electric Utility | 3227 | City of Center - (CO) | 491 | Center | 3 | NaN | Petroleum Liquids | DFO | Disillate Fuel Oil | IC | PSCO | Public Service Company of Colorado | SB | Standby/Backup: available for service but not normally used | 0.5 | 0.5 | 0.5 | 1963-07 | NaN | NaN | NaN | NaN | NaN | Saguache | -106.104670 | 37.753606 | MW | MW | MW | MW | MW |
4993 | 4993 | 2017-01 | CO | Colorado | electric-utility | Electric Utility | 3227 | City of Center - (CO) | 491 | Center | 5 | NaN | Petroleum Liquids | DFO | Disillate Fuel Oil | IC | PSCO | Public Service Company of Colorado | SB | Standby/Backup: available for service but not normally used | 1.0 | 1.0 | 1.0 | 1959-08 | NaN | NaN | NaN | NaN | NaN | Saguache | -106.104670 | 37.753606 | MW | MW | MW | MW | MW |
4994 | 4994 | 2017-09 | SC | South Carolina | ipp-non-chp | IPP Non-CHP | 54810 | Broad River Energy LLC | 55166 | Broad River Energy Center | CT01 | NaN | Natural Gas Fired Combustion Turbine | NG | Natural Gas | GT | DUK | Duke Energy Carolinas | OP | Operating | 197.0 | 173.4 | 200.8 | 2000-07 | NaN | NaN | NaN | NaN | NaN | Cherokee | -81.575000 | 35.078600 | MW | MW | MW | MW | MW |
4995 | 4995 | 2017-09 | SC | South Carolina | ipp-non-chp | IPP Non-CHP | 54810 | Broad River Energy LLC | 55166 | Broad River Energy Center | CT02 | NaN | Natural Gas Fired Combustion Turbine | NG | Natural Gas | GT | DUK | Duke Energy Carolinas | OP | Operating | 197.0 | 170.5 | 197.3 | 2000-07 | NaN | NaN | NaN | NaN | NaN | Cherokee | -81.575000 | 35.078600 | MW | MW | MW | MW | MW |
4996 | 4996 | 2017-09 | SC | South Carolina | ipp-non-chp | IPP Non-CHP | 54810 | Broad River Energy LLC | 55166 | Broad River Energy Center | CT03 | NaN | Natural Gas Fired Combustion Turbine | NG | Natural Gas | GT | DUK | Duke Energy Carolinas | OP | Operating | 197.0 | 169.4 | 196.0 | 2000-07 | NaN | NaN | NaN | NaN | NaN | Cherokee | -81.575000 | 35.078600 | MW | MW | MW | MW | MW |
4997 | 4997 | 2017-09 | SC | South Carolina | ipp-non-chp | IPP Non-CHP | 54810 | Broad River Energy LLC | 55166 | Broad River Energy Center | CT04 | NaN | Natural Gas Fired Combustion Turbine | NG | Natural Gas | GT | DUK | Duke Energy Carolinas | OP | Operating | 197.0 | 174.0 | 201.4 | 2001-06 | NaN | NaN | NaN | NaN | NaN | Cherokee | -81.575000 | 35.078600 | MW | MW | MW | MW | MW |
4998 | 4998 | 2017-06 | NY | New York | ipp-non-chp | IPP Non-CHP | 5511 | CCI Roseton LLC | 8006 | Roseton Generating Facility | 2 | NaN | Natural Gas Steam Turbine | NG | Natural Gas | ST | NYIS | New York Independent System Operator | OP | Operating | 621.0 | 604.0 | 605.7 | 1974-09 | NaN | NaN | NaN | NaN | NaN | Orange | -73.966269 | 41.573783 | MW | MW | MW | MW | MW |
4999 | 4999 | 2017-06 | NC | North Carolina | ipp-non-chp | IPP Non-CHP | 60865 | CD Global Solar Holdings, LLC | 61258 | Innovative Solar 35, LLC | ISS35 | NaN | Solar Photovoltaic | SUN | Solar | PV | CPLE | Duke Energy Progress East | OP | Operating | 1.9 | 1.9 | 1.9 | 2017-02 | NaN | NaN | NaN | NaN | NaN | Duplin | -77.825000 | 35.046000 | MW | MW | MW | MW | MW |