Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 5000 |
| Missing cells | 12105 |
| Missing cells (%) | 10.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 898.6 KiB |
| Average record size in memory | 184.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 14 |
generation-units has constant value "megawatthours" | Constant |
gross-generation-units has constant value "megawatthours" | Constant |
total-consumption-btu-units has constant value "MMBtu" | Constant |
consumption-for-eg-btu-units has constant value "MMBtu" | Constant |
plantName has a high cardinality: 1196 distinct values | High cardinality |
Unnamed: 0 is highly correlated with period and 2 other fields | High correlation |
plantCode is highly correlated with period and 2 other fields | High correlation |
generation is highly correlated with fuel2002 and 5 other fields | High correlation |
gross-generation is highly correlated with generation and 4 other fields | High correlation |
total-consumption is highly correlated with fuel2002 and 4 other fields | High correlation |
total-consumption-btu is highly correlated with generation and 4 other fields | High correlation |
consumption-for-eg is highly correlated with fuel2002 and 5 other fields | High correlation |
consumption-for-eg-btu is highly correlated with generation and 3 other fields | High correlation |
average-heat-content is highly correlated with fuel2002 and 6 other fields | High correlation |
period is highly correlated with Unnamed: 0 and 3 other fields | High correlation |
fuel2002 is highly correlated with fuelTypeDescription and 9 other fields | High correlation |
fuelTypeDescription is highly correlated with fuel2002 and 6 other fields | High correlation |
state is highly correlated with Unnamed: 0 and 9 other fields | High correlation |
stateDescription is highly correlated with Unnamed: 0 and 9 other fields | High correlation |
primeMover is highly correlated with consumption-for-eg-btu-units and 3 other fields | High correlation |
total-consumption-units is highly correlated with fuel2002 and 6 other fields | High correlation |
consumption-for-eg-units is highly correlated with fuel2002 and 6 other fields | High correlation |
average-heat-content-units is highly correlated with fuel2002 and 6 other fields | High correlation |
generation-units is highly correlated with consumption-for-eg-btu-units and 11 other fields | High correlation |
gross-generation-units is highly correlated with consumption-for-eg-btu-units and 11 other fields | High correlation |
total-consumption-btu-units is highly correlated with consumption-for-eg-btu-units and 11 other fields | High correlation |
consumption-for-eg-btu-units is highly correlated with total-consumption-units and 11 other fields | High correlation |
total-consumption has 1269 (25.4%) missing values | Missing |
total-consumption-units has 2326 (46.5%) missing values | Missing |
consumption-for-eg has 1269 (25.4%) missing values | Missing |
consumption-for-eg-units has 2326 (46.5%) missing values | Missing |
average-heat-content has 2589 (51.8%) missing values | Missing |
average-heat-content-units has 2326 (46.5%) missing values | Missing |
Unnamed: 0 is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
generation has 594 (11.9%) zeros | Zeros |
gross-generation has 606 (12.1%) zeros | Zeros |
total-consumption has 1320 (26.4%) zeros | Zeros |
total-consumption-btu has 597 (11.9%) zeros | Zeros |
consumption-for-eg has 1343 (26.9%) zeros | Zeros |
consumption-for-eg-btu has 596 (11.9%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-17 22:34:34.934056 |
|---|---|
| Analysis finished | 2022-11-17 22:34:52.566320 |
| Duration | 17.63 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 5000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2499.5 |
| Minimum | 0 |
|---|---|
| Maximum | 4999 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 249.95 |
| Q1 | 1249.75 |
| median | 2499.5 |
| Q3 | 3749.25 |
| 95-th percentile | 4749.05 |
| Maximum | 4999 |
| Range | 4999 |
| Interquartile range (IQR) | 2499.5 |
Descriptive statistics
| Standard deviation | 1443.520003 |
|---|---|
| Coefficient of variation (CV) | 0.577523506 |
| Kurtosis | -1.2 |
| Mean | 2499.5 |
| Median Absolute Deviation (MAD) | 1250 |
| Skewness | 0 |
| Sum | 12497500 |
| Variance | 2083750 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 3330 | 1 | < 0.1% |
| 3337 | 1 | < 0.1% |
| 3336 | 1 | < 0.1% |
| 3335 | 1 | < 0.1% |
| 3334 | 1 | < 0.1% |
| 3333 | 1 | < 0.1% |
| 3332 | 1 | < 0.1% |
| 3331 | 1 | < 0.1% |
| 3329 | 1 | < 0.1% |
| Other values (4990) | 4990 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 4999 | 1 | |
| 4998 | 1 | |
| 4997 | 1 | |
| 4996 | 1 | |
| 4995 | 1 | |
| 4994 | 1 | |
| 4993 | 1 | |
| 4992 | 1 | |
| 4991 | 1 | |
| 4990 | 1 |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| 2001-04 | |
|---|---|
| 2001-12 | |
| 2001-05 | |
| 2002-04 | |
| 2002-05 | |
| Other values (5) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 35000 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2001-04 |
|---|---|
| 2nd row | 2001-04 |
| 3rd row | 2001-04 |
| 4th row | 2001-04 |
| 5th row | 2001-04 |
Common Values
| Value | Count | Frequency (%) |
| 2001-04 | 2185 | |
| 2001-12 | 1211 | |
| 2001-05 | 401 | 8.0% |
| 2002-04 | 353 | 7.1% |
| 2002-05 | 325 | 6.5% |
| 2002-11 | 310 | 6.2% |
| 2001-06 | 83 | 1.7% |
| 2003-02 | 83 | 1.7% |
| 2001-09 | 44 | 0.9% |
| 2001-10 | 5 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2001-04 | 2185 | |
| 2001-12 | 1211 | |
| 2001-05 | 401 | 8.0% |
| 2002-04 | 353 | 7.1% |
| 2002-05 | 325 | 6.5% |
| 2002-11 | 310 | 6.2% |
| 2001-06 | 83 | 1.7% |
| 2003-02 | 83 | 1.7% |
| 2001-09 | 44 | 0.9% |
| 2001-10 | 5 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 13479 | |
| 2 | 7282 | |
| 1 | 5765 | |
| - | 5000 | 14.3% |
| 4 | 2538 | 7.3% |
| 5 | 726 | 2.1% |
| 6 | 83 | 0.2% |
| 3 | 83 | 0.2% |
| 9 | 44 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 30000 | |
| Dash Punctuation | 5000 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 13479 | |
| 2 | 7282 | |
| 1 | 5765 | |
| 4 | 2538 | 8.5% |
| 5 | 726 | 2.4% |
| 6 | 83 | 0.3% |
| 3 | 83 | 0.3% |
| 9 | 44 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 35000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 13479 | |
| 2 | 7282 | |
| 1 | 5765 | |
| - | 5000 | 14.3% |
| 4 | 2538 | 7.3% |
| 5 | 726 | 2.1% |
| 6 | 83 | 0.2% |
| 3 | 83 | 0.2% |
| 9 | 44 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 13479 | |
| 2 | 7282 | |
| 1 | 5765 | |
| - | 5000 | 14.3% |
| 4 | 2538 | 7.3% |
| 5 | 726 | 2.1% |
| 6 | 83 | 0.2% |
| 3 | 83 | 0.2% |
| 9 | 44 | 0.1% |
| Distinct | 1173 |
|---|---|
| Distinct (%) | 23.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31046.0786 |
| Minimum | 2 |
|---|---|
| Maximum | 55564 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 1048 |
| Q1 | 7183 |
| median | 50095 |
| Q3 | 54085 |
| 95-th percentile | 55037 |
| Maximum | 55564 |
| Range | 55562 |
| Interquartile range (IQR) | 46902 |
Descriptive statistics
| Standard deviation | 23186.19182 |
|---|---|
| Coefficient of variation (CV) | 0.7468315764 |
| Kurtosis | -1.912303771 |
| Mean | 31046.0786 |
| Median Absolute Deviation (MAD) | 5258 |
| Skewness | -0.1562502582 |
| Sum | 155230393 |
| Variance | 537599491.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 54428 | 21 | 0.4% |
| 54464 | 19 | 0.4% |
| 50366 | 18 | 0.4% |
| 54090 | 15 | 0.3% |
| 54087 | 15 | 0.3% |
| 50398 | 15 | 0.3% |
| 52152 | 15 | 0.3% |
| 50395 | 15 | 0.3% |
| 54091 | 15 | 0.3% |
| 50479 | 14 | 0.3% |
| Other values (1163) | 4838 |
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 3 | 2 | < 0.1% |
| 170 | 3 | |
| 171 | 5 | |
| 173 | 7 | |
| 174 | 3 | |
| 180 | 3 | |
| 182 | 3 | |
| 187 | 3 | |
| 188 | 3 |
| Value | Count | Frequency (%) |
| 55564 | 3 | |
| 55563 | 3 | |
| 55562 | 3 | |
| 55561 | 3 | |
| 55560 | 3 | |
| 55557 | 3 | |
| 55554 | 3 | |
| 55546 | 3 | |
| 55545 | 3 | |
| 55542 | 5 |
| Distinct | 1196 |
|---|---|
| Distinct (%) | 23.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| Oil Storage | 30 |
|---|---|
| University of Notre Dame | 18 |
| Louisiana Mill | 15 |
| Mansfield Mill | 15 |
| Nekoosa Mill | 15 |
| Other values (1191) |
Length
| Max length | 39 |
|---|---|
| Median length | 28 |
| Mean length | 16.3592 |
| Min length | 2 |
Characters and Unicode
| Total characters | 81796 |
|---|---|
| Distinct characters | 69 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 16 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Chevak |
|---|---|
| 2nd row | Chevak |
| 3rd row | EEK |
| 4th row | EEK |
| 5th row | EEK |
Common Values
| Value | Count | Frequency (%) |
| Oil Storage | 30 | 0.6% |
| University of Notre Dame | 18 | 0.4% |
| Louisiana Mill | 15 | 0.3% |
| Mansfield Mill | 15 | 0.3% |
| Nekoosa Mill | 15 | 0.3% |
| Printing & Communication Paper | 15 | 0.3% |
| Georgetown Mill | 15 | 0.3% |
| International Paper Co Savanna | 15 | 0.3% |
| Eagle Point Cogen | 14 | 0.3% |
| General Electric Erie PA Power | 14 | 0.3% |
| Other values (1186) | 4834 |
Length
| Value | Count | Frequency (%) |
| cogen | 252 | 2.0% |
| hydro | 250 | 1.9% |
| co | 226 | 1.8% |
| energy | 213 | 1.7% |
| power | 200 | 1.6% |
| mill | 193 | 1.5% |
| inc | 185 | 1.4% |
| plant | 136 | 1.1% |
| recovery | 130 | 1.0% |
| corp | 118 | 0.9% |
| Other values (1510) | 10974 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7911 | 9.7% | |
| e | 7512 | 9.2% |
| o | 5601 | 6.8% |
| a | 5533 | 6.8% |
| r | 5387 | 6.6% |
| n | 5039 | 6.2% |
| i | 4249 | 5.2% |
| t | 3970 | 4.9% |
| l | 3826 | 4.7% |
| s | 2977 | 3.6% |
| Other values (59) | 29791 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 58499 | |
| Uppercase Letter | 14885 | 18.2% |
| Space Separator | 7911 | 9.7% |
| Decimal Number | 368 | 0.4% |
| Other Punctuation | 65 | 0.1% |
| Dash Punctuation | 40 | < 0.1% |
| Open Punctuation | 14 | < 0.1% |
| Close Punctuation | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 7512 | |
| o | 5601 | |
| a | 5533 | |
| r | 5387 | |
| n | 5039 | |
| i | 4249 | 7.3% |
| t | 3970 | 6.8% |
| l | 3826 | 6.5% |
| s | 2977 | 5.1% |
| c | 1866 | 3.2% |
| Other values (16) | 12539 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1908 | 12.8% |
| P | 1495 | 10.0% |
| S | 1183 | 7.9% |
| M | 878 | 5.9% |
| R | 849 | 5.7% |
| L | 844 | 5.7% |
| H | 775 | 5.2% |
| A | 665 | 4.5% |
| G | 649 | 4.4% |
| I | 642 | 4.3% |
| Other values (16) | 4997 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 68 | |
| 2 | 63 | |
| 3 | 51 | |
| 5 | 44 | |
| 9 | 38 | |
| 4 | 35 | |
| 8 | 26 | 7.1% |
| 6 | 18 | 4.9% |
| 0 | 16 | 4.3% |
| 7 | 9 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 48 | |
| # | 11 | 16.9% |
| ' | 6 | 9.2% |
Space Separator
| Value | Count | Frequency (%) |
| 7911 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 40 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 14 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 73384 | |
| Common | 8412 | 10.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 7512 | 10.2% |
| o | 5601 | 7.6% |
| a | 5533 | 7.5% |
| r | 5387 | 7.3% |
| n | 5039 | 6.9% |
| i | 4249 | 5.8% |
| t | 3970 | 5.4% |
| l | 3826 | 5.2% |
| s | 2977 | 4.1% |
| C | 1908 | 2.6% |
| Other values (42) | 27382 |
Common
| Value | Count | Frequency (%) |
| 7911 | ||
| 1 | 68 | 0.8% |
| 2 | 63 | 0.7% |
| 3 | 51 | 0.6% |
| & | 48 | 0.6% |
| 5 | 44 | 0.5% |
| - | 40 | 0.5% |
| 9 | 38 | 0.5% |
| 4 | 35 | 0.4% |
| 8 | 26 | 0.3% |
| Other values (7) | 88 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 81796 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7911 | 9.7% | |
| e | 7512 | 9.2% |
| o | 5601 | 6.8% |
| a | 5533 | 6.8% |
| r | 5387 | 6.6% |
| n | 5039 | 6.2% |
| i | 4249 | 5.2% |
| t | 3970 | 4.9% |
| l | 3826 | 4.7% |
| s | 2977 | 3.6% |
| Other values (59) | 29791 |
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| ALL | |
|---|---|
| NG | |
| WAT | |
| DFO | |
| BIT | |
| Other values (28) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.7798 |
| Min length | 2 |
Characters and Unicode
| Total characters | 13899 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ALL |
|---|---|
| 2nd row | DFO |
| 3rd row | DFO |
| 4th row | ALL |
| 5th row | DFO |
Common Values
| Value | Count | Frequency (%) |
| ALL | 1269 | |
| NG | 957 | |
| WAT | 716 | |
| DFO | 684 | |
| BIT | 213 | 4.3% |
| WDS | 180 | 3.6% |
| RFO | 177 | 3.5% |
| LFG | 130 | 2.6% |
| WND | 92 | 1.8% |
| BLQ | 73 | 1.5% |
| Other values (23) | 509 |
Length
| Value | Count | Frequency (%) |
| all | 1269 | |
| ng | 957 | |
| wat | 716 | |
| dfo | 684 | |
| bit | 213 | 4.3% |
| wds | 180 | 3.6% |
| rfo | 177 | 3.5% |
| lfg | 130 | 2.6% |
| wnd | 92 | 1.8% |
| blq | 73 | 1.5% |
| Other values (23) | 509 |
Most occurring characters
| Value | Count | Frequency (%) |
| L | 2781 | |
| A | 2001 | |
| G | 1229 | |
| N | 1126 | |
| W | 1054 | 7.6% |
| O | 1044 | 7.5% |
| F | 1027 | 7.4% |
| D | 998 | 7.2% |
| T | 976 | 7.0% |
| B | 473 | 3.4% |
| Other values (11) | 1190 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 13899 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 2781 | |
| A | 2001 | |
| G | 1229 | |
| N | 1126 | |
| W | 1054 | 7.6% |
| O | 1044 | 7.5% |
| F | 1027 | 7.4% |
| D | 998 | 7.2% |
| T | 976 | 7.0% |
| B | 473 | 3.4% |
| Other values (11) | 1190 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13899 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 2781 | |
| A | 2001 | |
| G | 1229 | |
| N | 1126 | |
| W | 1054 | 7.6% |
| O | 1044 | 7.5% |
| F | 1027 | 7.4% |
| D | 998 | 7.2% |
| T | 976 | 7.0% |
| B | 473 | 3.4% |
| Other values (11) | 1190 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13899 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| L | 2781 | |
| A | 2001 | |
| G | 1229 | |
| N | 1126 | |
| W | 1054 | 7.6% |
| O | 1044 | 7.5% |
| F | 1027 | 7.4% |
| D | 998 | 7.2% |
| T | 976 | 7.0% |
| B | 473 | 3.4% |
| Other values (11) | 1190 |
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| Total | |
|---|---|
| Natural Gas | |
| Hydroelectric Conventional | |
| Distillate Fuel Oil | |
| Wood Waste Solids | |
| Other values (14) |
Length
| Max length | 28 |
|---|---|
| Median length | 24 |
| Mean length | 13.2 |
| Min length | 4 |
Characters and Unicode
| Total characters | 66000 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Total |
|---|---|
| 2nd row | Distillate Fuel Oil |
| 3rd row | Distillate Fuel Oil |
| 4th row | Total |
| 5th row | Distillate Fuel Oil |
Common Values
| Value | Count | Frequency (%) |
| Total | 1269 | |
| Natural Gas | 957 | |
| Hydroelectric Conventional | 708 | |
| Distillate Fuel Oil | 684 | |
| Wood Waste Solids | 269 | 5.4% |
| Coal | 266 | 5.3% |
| Municiapl Landfill Gas | 196 | 3.9% |
| Residual Fuel Oil | 177 | 3.5% |
| Other | 112 | 2.2% |
| Wind | 92 | 1.8% |
| Other values (9) | 270 | 5.4% |
Length
| Value | Count | Frequency (%) |
| total | 1269 | |
| gas | 1153 | |
| natural | 957 | |
| oil | 887 | |
| fuel | 861 | |
| hydroelectric | 716 | 7.4% |
| conventional | 708 | 7.4% |
| distillate | 684 | 7.1% |
| waste | 311 | 3.2% |
| other | 282 | 2.9% |
| Other values (18) | 1803 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 8262 | |
| a | 7110 | |
| t | 5683 | 8.6% |
| e | 4943 | 7.5% |
| i | 4831 | 7.3% |
| o | 4676 | 7.1% |
| 4631 | 7.0% | |
| r | 2833 | 4.3% |
| s | 2830 | 4.3% |
| n | 2712 | 4.1% |
| Other values (27) | 17489 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 51920 | |
| Uppercase Letter | 9449 | 14.3% |
| Space Separator | 4631 | 7.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 8262 | |
| a | 7110 | |
| t | 5683 | |
| e | 4943 | |
| i | 4831 | |
| o | 4676 | |
| r | 2833 | 5.5% |
| s | 2830 | 5.5% |
| n | 2712 | 5.2% |
| u | 2243 | 4.3% |
| Other values (12) | 5797 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1269 | |
| G | 1249 | |
| O | 1117 | |
| C | 1024 | |
| N | 967 | |
| F | 861 | |
| H | 716 | |
| D | 684 | |
| W | 672 | |
| S | 279 | 3.0% |
| Other values (4) | 611 |
Space Separator
| Value | Count | Frequency (%) |
| 4631 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 61369 | |
| Common | 4631 | 7.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 8262 | |
| a | 7110 | |
| t | 5683 | 9.3% |
| e | 4943 | 8.1% |
| i | 4831 | 7.9% |
| o | 4676 | 7.6% |
| r | 2833 | 4.6% |
| s | 2830 | 4.6% |
| n | 2712 | 4.4% |
| u | 2243 | 3.7% |
| Other values (26) | 15246 |
Common
| Value | Count | Frequency (%) |
| 4631 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 66000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 8262 | |
| a | 7110 | |
| t | 5683 | 8.6% |
| e | 4943 | 7.5% |
| i | 4831 | 7.3% |
| o | 4676 | 7.1% |
| 4631 | 7.0% | |
| r | 2833 | 4.3% |
| s | 2830 | 4.3% |
| n | 2712 | 4.1% |
| Other values (27) | 17489 |
| Distinct | 50 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| CA | |
|---|---|
| TX | 300 |
| NY | 294 |
| TN | 274 |
| AK | 248 |
| Other values (45) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AK |
|---|---|
| 2nd row | AK |
| 3rd row | AK |
| 4th row | AK |
| 5th row | AK |
Common Values
| Value | Count | Frequency (%) |
| CA | 674 | 13.5% |
| TX | 300 | 6.0% |
| NY | 294 | 5.9% |
| TN | 274 | 5.5% |
| AK | 248 | 5.0% |
| FL | 247 | 4.9% |
| PA | 177 | 3.5% |
| GA | 167 | 3.3% |
| MA | 165 | 3.3% |
| LA | 139 | 2.8% |
| Other values (40) | 2315 |
Length
| Value | Count | Frequency (%) |
| ca | 674 | 13.5% |
| tx | 300 | 6.0% |
| ny | 294 | 5.9% |
| tn | 274 | 5.5% |
| ak | 248 | 5.0% |
| fl | 247 | 4.9% |
| pa | 177 | 3.5% |
| ga | 167 | 3.3% |
| ma | 165 | 3.3% |
| la | 139 | 2.8% |
| Other values (40) | 2315 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 2068 | |
| N | 1221 | |
| C | 964 | |
| I | 685 | 6.9% |
| T | 671 | 6.7% |
| M | 633 | 6.3% |
| L | 593 | 5.9% |
| Y | 320 | 3.2% |
| K | 302 | 3.0% |
| X | 300 | 3.0% |
| Other values (14) | 2243 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2068 | |
| N | 1221 | |
| C | 964 | |
| I | 685 | 6.9% |
| T | 671 | 6.7% |
| M | 633 | 6.3% |
| L | 593 | 5.9% |
| Y | 320 | 3.2% |
| K | 302 | 3.0% |
| X | 300 | 3.0% |
| Other values (14) | 2243 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 2068 | |
| N | 1221 | |
| C | 964 | |
| I | 685 | 6.9% |
| T | 671 | 6.7% |
| M | 633 | 6.3% |
| L | 593 | 5.9% |
| Y | 320 | 3.2% |
| K | 302 | 3.0% |
| X | 300 | 3.0% |
| Other values (14) | 2243 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 2068 | |
| N | 1221 | |
| C | 964 | |
| I | 685 | 6.9% |
| T | 671 | 6.7% |
| M | 633 | 6.3% |
| L | 593 | 5.9% |
| Y | 320 | 3.2% |
| K | 302 | 3.0% |
| X | 300 | 3.0% |
| Other values (14) | 2243 |
| Distinct | 50 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| California | |
|---|---|
| Texas | 300 |
| New York | 294 |
| Tennessee | 274 |
| Alaska | 248 |
| Other values (45) |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 8.5858 |
| Min length | 4 |
Characters and Unicode
| Total characters | 42929 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Alaska |
|---|---|
| 2nd row | Alaska |
| 3rd row | Alaska |
| 4th row | Alaska |
| 5th row | Alaska |
Common Values
| Value | Count | Frequency (%) |
| California | 674 | 13.5% |
| Texas | 300 | 6.0% |
| New York | 294 | 5.9% |
| Tennessee | 274 | 5.5% |
| Alaska | 248 | 5.0% |
| Florida | 247 | 4.9% |
| Pennsylvania | 177 | 3.5% |
| Georgia | 167 | 3.3% |
| Massachusetts | 165 | 3.3% |
| Louisiana | 139 | 2.8% |
| Other values (40) | 2315 |
Length
| Value | Count | Frequency (%) |
| california | 674 | 11.4% |
| new | 569 | 9.7% |
| texas | 300 | 5.1% |
| york | 294 | 5.0% |
| tennessee | 274 | 4.7% |
| alaska | 248 | 4.2% |
| florida | 247 | 4.2% |
| carolina | 208 | 3.5% |
| pennsylvania | 177 | 3.0% |
| north | 177 | 3.0% |
| Other values (42) | 2719 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5871 | |
| i | 4547 | 10.6% |
| n | 3791 | 8.8% |
| s | 3346 | 7.8% |
| e | 3304 | 7.7% |
| o | 3151 | 7.3% |
| r | 2359 | 5.5% |
| l | 1995 | 4.6% |
| t | 1071 | 2.5% |
| C | 964 | 2.2% |
| Other values (36) | 12530 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36155 | |
| Uppercase Letter | 5887 | 13.7% |
| Space Separator | 887 | 2.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5871 | |
| i | 4547 | |
| n | 3791 | |
| s | 3346 | |
| e | 3304 | |
| o | 3151 | |
| r | 2359 | |
| l | 1995 | 5.5% |
| t | 1071 | 3.0% |
| h | 954 | 2.6% |
| Other values (14) | 5766 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 964 | |
| N | 797 | |
| M | 633 | |
| T | 574 | |
| A | 416 | 7.1% |
| I | 405 | 6.9% |
| W | 300 | 5.1% |
| Y | 294 | 5.0% |
| F | 247 | 4.2% |
| P | 177 | 3.0% |
| Other values (11) | 1080 |
Space Separator
| Value | Count | Frequency (%) |
| 887 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 42042 | |
| Common | 887 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5871 | |
| i | 4547 | 10.8% |
| n | 3791 | 9.0% |
| s | 3346 | 8.0% |
| e | 3304 | 7.9% |
| o | 3151 | 7.5% |
| r | 2359 | 5.6% |
| l | 1995 | 4.7% |
| t | 1071 | 2.5% |
| C | 964 | 2.3% |
| Other values (35) | 11643 |
Common
| Value | Count | Frequency (%) |
| 887 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42929 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5871 | |
| i | 4547 | 10.6% |
| n | 3791 | 8.8% |
| s | 3346 | 7.8% |
| e | 3304 | 7.7% |
| o | 3151 | 7.3% |
| r | 2359 | 5.5% |
| l | 1995 | 4.6% |
| t | 1071 | 2.5% |
| C | 964 | 2.2% |
| Other values (36) | 12530 |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| ALL | |
|---|---|
| ST | 19 |
| GT | 9 |
| HY | 2 |
| Other values (3) | 4 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.2632 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11316 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ALL |
|---|---|
| 2nd row | ALL |
| 3rd row | |
| 4th row | ALL |
| 5th row | ALL |
Common Values
| Value | Count | Frequency (%) |
| ALL | 3141 | |
| 1825 | ||
| ST | 19 | 0.4% |
| GT | 9 | 0.2% |
| HY | 2 | < 0.1% |
| IC | 2 | < 0.1% |
| CA | 1 | < 0.1% |
| CT | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| all | 3141 | |
| st | 19 | 0.6% |
| gt | 9 | 0.3% |
| hy | 2 | 0.1% |
| ic | 2 | 0.1% |
| ca | 1 | < 0.1% |
| ct | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| L | 6282 | |
| A | 3142 | |
| 1825 | 16.1% | |
| T | 29 | 0.3% |
| S | 19 | 0.2% |
| G | 9 | 0.1% |
| C | 4 | < 0.1% |
| H | 2 | < 0.1% |
| Y | 2 | < 0.1% |
| I | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9491 | |
| Space Separator | 1825 | 16.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 6282 | |
| A | 3142 | |
| T | 29 | 0.3% |
| S | 19 | 0.2% |
| G | 9 | 0.1% |
| C | 4 | < 0.1% |
| H | 2 | < 0.1% |
| Y | 2 | < 0.1% |
| I | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1825 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9491 | |
| Common | 1825 | 16.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 6282 | |
| A | 3142 | |
| T | 29 | 0.3% |
| S | 19 | 0.2% |
| G | 9 | 0.1% |
| C | 4 | < 0.1% |
| H | 2 | < 0.1% |
| Y | 2 | < 0.1% |
| I | 2 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 1825 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11316 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| L | 6282 | |
| A | 3142 | |
| 1825 | 16.1% | |
| T | 29 | 0.3% |
| S | 19 | 0.2% |
| G | 9 | 0.1% |
| C | 4 | < 0.1% |
| H | 2 | < 0.1% |
| Y | 2 | < 0.1% |
| I | 2 | < 0.1% |
| Distinct | 1903 |
|---|---|
| Distinct (%) | 38.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25561.98483 |
| Minimum | -41941 |
|---|---|
| Maximum | 1695623 |
| Zeros | 594 |
| Zeros (%) | 11.9% |
| Negative | 47 |
| Negative (%) | 0.9% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | -41941 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 115.9175 |
| median | 1548 |
| Q3 | 9280 |
| 95-th percentile | 100456 |
| Maximum | 1695623 |
| Range | 1737564 |
| Interquartile range (IQR) | 9164.0825 |
Descriptive statistics
| Standard deviation | 107424.6026 |
|---|---|
| Coefficient of variation (CV) | 4.202514136 |
| Kurtosis | 94.12484749 |
| Mean | 25561.98483 |
| Median Absolute Deviation (MAD) | 1548 |
| Skewness | 8.526353334 |
| Sum | 127809924.1 |
| Variance | 1.154004524 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 594 | 11.9% |
| 34 | 10 | 0.2% |
| 52 | 9 | 0.2% |
| 6 | 9 | 0.2% |
| 1 | 9 | 0.2% |
| 0.35 | 8 | 0.2% |
| 8 | 8 | 0.2% |
| 3 | 8 | 0.2% |
| 13 | 7 | 0.1% |
| 50 | 7 | 0.1% |
| Other values (1893) | 4331 |
| Value | Count | Frequency (%) |
| -41941 | 3 | |
| -16523 | 3 | |
| -15983 | 2 | < 0.1% |
| -7238 | 1 | < 0.1% |
| -190 | 6 | |
| -167 | 3 | |
| -120 | 3 | |
| -92 | 3 | |
| -72 | 3 | |
| -63 | 3 |
| Value | Count | Frequency (%) |
| 1695623 | 1 | < 0.1% |
| 1694082 | 2 | |
| 1653345 | 1 | < 0.1% |
| 1649251 | 2 | |
| 1219209 | 1 | < 0.1% |
| 1217532 | 2 | |
| 1072678 | 1 | < 0.1% |
| 913414 | 1 | < 0.1% |
| 878063 | 3 | |
| 850228 | 1 | < 0.1% |
| Distinct | 1923 |
|---|---|
| Distinct (%) | 38.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26500.2457 |
| Minimum | -43238.14 |
|---|---|
| Maximum | 1748064.95 |
| Zeros | 606 |
| Zeros (%) | 12.1% |
| Negative | 30 |
| Negative (%) | 0.6% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | -43238.14 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 120.3925 |
| median | 1584.28 |
| Q3 | 9981.265 |
| 95-th percentile | 103009.28 |
| Maximum | 1748064.95 |
| Range | 1791303.09 |
| Interquartile range (IQR) | 9860.8725 |
Descriptive statistics
| Standard deviation | 111413.2384 |
|---|---|
| Coefficient of variation (CV) | 4.204234168 |
| Kurtosis | 93.81610436 |
| Mean | 26500.2457 |
| Median Absolute Deviation (MAD) | 1584.28 |
| Skewness | 8.527532021 |
| Sum | 132501228.5 |
| Variance | 1.241290969 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 606 | 12.1% |
| 6.12 | 9 | 0.2% |
| 34.69 | 7 | 0.1% |
| 3.06 | 7 | 0.1% |
| 61.71 | 6 | 0.1% |
| 0.5 | 6 | 0.1% |
| 14753.54 | 6 | 0.1% |
| 1288.26 | 6 | 0.1% |
| 104.31 | 6 | 0.1% |
| 0.36 | 6 | 0.1% |
| Other values (1913) | 4335 |
| Value | Count | Frequency (%) |
| -43238.14 | 3 | 0.1% |
| -17034.02 | 3 | 0.1% |
| -195.88 | 6 | 0.1% |
| -123.71 | 3 | 0.1% |
| -64.29 | 3 | 0.1% |
| -57.14 | 3 | 0.1% |
| -21.43 | 3 | 0.1% |
| -17.35 | 3 | 0.1% |
| -6.12 | 3 | 0.1% |
| 0 | 606 |
| Value | Count | Frequency (%) |
| 1748064.95 | 1 | < 0.1% |
| 1746476.29 | 2 | |
| 1704479.38 | 1 | < 0.1% |
| 1700258.76 | 2 | |
| 1308878.99 | 1 | < 0.1% |
| 1307078.65 | 2 | |
| 1105853.61 | 1 | < 0.1% |
| 941663.92 | 1 | < 0.1% |
| 905219.59 | 3 | |
| 876523.71 | 1 | < 0.1% |
| Distinct | 1197 |
|---|---|
| Distinct (%) | 32.1% |
| Missing | 1269 |
| Missing (%) | 25.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 105171.0708 |
| Minimum | 0 |
|---|---|
| Maximum | 8920563 |
| Zeros | 1320 |
| Zeros (%) | 26.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 959 |
| Q3 | 28750.26 |
| 95-th percentile | 450905 |
| Maximum | 8920563 |
| Range | 8920563 |
| Interquartile range (IQR) | 28750.26 |
Descriptive statistics
| Standard deviation | 491967.9015 |
|---|---|
| Coefficient of variation (CV) | 4.67778732 |
| Kurtosis | 154.1507362 |
| Mean | 105171.0708 |
| Median Absolute Deviation (MAD) | 959 |
| Skewness | 10.92355893 |
| Sum | 392393265.2 |
| Variance | 2.420324161 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1320 | |
| 2 | 9 | 0.2% |
| 38 | 8 | 0.2% |
| 1 | 8 | 0.2% |
| 10 | 6 | 0.1% |
| 12 | 6 | 0.1% |
| 4 | 6 | 0.1% |
| 203 | 6 | 0.1% |
| 5 | 6 | 0.1% |
| 6 | 6 | 0.1% |
| Other values (1187) | 2350 | |
| (Missing) | 1269 |
| Value | Count | Frequency (%) |
| 0 | 1320 | |
| 1 | 8 | 0.2% |
| 2 | 9 | 0.2% |
| 2.71 | 2 | < 0.1% |
| 3 | 4 | 0.1% |
| 4 | 6 | 0.1% |
| 5 | 6 | 0.1% |
| 6 | 6 | 0.1% |
| 7 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 8920563 | 2 | |
| 8640100 | 2 | |
| 6872410 | 1 | |
| 6674931 | 2 | |
| 6613203 | 2 | |
| 4058493 | 2 | |
| 3582772 | 1 | |
| 3103938 | 2 | |
| 3092731 | 2 | |
| 3071941 | 2 |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2326 |
| Missing (%) | 46.5% |
| Memory size | 39.2 KiB |
| MMBtu per Mcf | |
|---|---|
| MMBtu per barrels | |
| MMBtu per short tons |
Length
| Max length | 20 |
|---|---|
| Median length | 17 |
| Mean length | 15.87808527 |
| Min length | 13 |
Characters and Unicode
| Total characters | 42458 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MMBtu per barrels |
|---|---|
| 2nd row | MMBtu per barrels |
| 3rd row | MMBtu per barrels |
| 4th row | MMBtu per barrels |
| 5th row | MMBtu per barrels |
Common Values
| Value | Count | Frequency (%) |
| MMBtu per Mcf | 1185 | |
| MMBtu per barrels | 909 | 18.2% |
| MMBtu per short tons | 580 | 11.6% |
| (Missing) | 2326 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| mmbtu | 2674 | |
| per | 2674 | |
| mcf | 1185 | |
| barrels | 909 | 10.6% |
| short | 580 | 6.7% |
| tons | 580 | 6.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 6533 | |
| 5928 | ||
| r | 5072 | |
| t | 3834 | |
| e | 3583 | |
| u | 2674 | |
| p | 2674 | |
| B | 2674 | |
| s | 2069 | 4.9% |
| c | 1185 | 2.8% |
| Other values (7) | 6232 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27323 | |
| Uppercase Letter | 9207 | 21.7% |
| Space Separator | 5928 | 14.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 5072 | |
| t | 3834 | |
| e | 3583 | |
| u | 2674 | |
| p | 2674 | |
| s | 2069 | |
| c | 1185 | 4.3% |
| f | 1185 | 4.3% |
| o | 1160 | 4.2% |
| b | 909 | 3.3% |
| Other values (4) | 2978 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 6533 | |
| B | 2674 |
Space Separator
| Value | Count | Frequency (%) |
| 5928 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36530 | |
| Common | 5928 | 14.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 6533 | |
| r | 5072 | |
| t | 3834 | |
| e | 3583 | |
| u | 2674 | |
| p | 2674 | |
| B | 2674 | |
| s | 2069 | 5.7% |
| c | 1185 | 3.2% |
| f | 1185 | 3.2% |
| Other values (6) | 5047 |
Common
| Value | Count | Frequency (%) |
| 5928 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42458 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 6533 | |
| 5928 | ||
| r | 5072 | |
| t | 3834 | |
| e | 3583 | |
| u | 2674 | |
| p | 2674 | |
| B | 2674 | |
| s | 2069 | 4.9% |
| c | 1185 | 2.8% |
| Other values (7) | 6232 |
| Distinct | 1933 |
|---|---|
| Distinct (%) | 38.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 317364.4852 |
| Minimum | -10.17 |
|---|---|
| Maximum | 16319726 |
| Zeros | 597 |
| Zeros (%) | 11.9% |
| Negative | 3 |
| Negative (%) | 0.1% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | -10.17 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1608.5 |
| median | 25266.1 |
| Q3 | 179287.88 |
| 95-th percentile | 1235527 |
| Maximum | 16319726 |
| Range | 16319736.17 |
| Interquartile range (IQR) | 177679.38 |
Descriptive statistics
| Standard deviation | 1116123.194 |
|---|---|
| Coefficient of variation (CV) | 3.516849698 |
| Kurtosis | 75.42235507 |
| Mean | 317364.4852 |
| Median Absolute Deviation (MAD) | 25266.1 |
| Skewness | 7.60439878 |
| Sum | 1586822426 |
| Variance | 1.245730984 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 597 | 11.9% |
| 10 | 9 | 0.2% |
| 6 | 8 | 0.2% |
| 1.46 | 6 | 0.1% |
| 12 | 6 | 0.1% |
| 664 | 6 | 0.1% |
| 760 | 6 | 0.1% |
| 5.08 | 6 | 0.1% |
| 6086.14 | 6 | 0.1% |
| 15 | 5 | 0.1% |
| Other values (1923) | 4345 |
| Value | Count | Frequency (%) |
| -10.17 | 3 | 0.1% |
| 0 | 597 | |
| 1.46 | 6 | 0.1% |
| 2 | 3 | 0.1% |
| 2.31 | 3 | 0.1% |
| 3.63 | 3 | 0.1% |
| 4 | 3 | 0.1% |
| 4.01 | 3 | 0.1% |
| 5.08 | 6 | 0.1% |
| 6 | 8 | 0.2% |
| Value | Count | Frequency (%) |
| 16319726 | 1 | < 0.1% |
| 16285564 | 2 | |
| 16198257 | 1 | < 0.1% |
| 16185812 | 2 | |
| 12092462 | 1 | < 0.1% |
| 12068953 | 2 | |
| 10368496 | 1 | < 0.1% |
| 9168734 | 3 | |
| 8885167 | 1 | < 0.1% |
| 8845523 | 1 | < 0.1% |
| Distinct | 1206 |
|---|---|
| Distinct (%) | 32.3% |
| Missing | 1269 |
| Missing (%) | 25.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 76556.90803 |
| Minimum | 0 |
|---|---|
| Maximum | 6872410 |
| Zeros | 1343 |
| Zeros (%) | 26.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 440 |
| Q3 | 15564.36 |
| 95-th percentile | 332846 |
| Maximum | 6872410 |
| Range | 6872410 |
| Interquartile range (IQR) | 15564.36 |
Descriptive statistics
| Standard deviation | 366391.4347 |
|---|---|
| Coefficient of variation (CV) | 4.785870329 |
| Kurtosis | 159.5527835 |
| Mean | 76556.90803 |
| Median Absolute Deviation (MAD) | 440 |
| Skewness | 10.95510124 |
| Sum | 285633823.9 |
| Variance | 1.342426834 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1343 | |
| 2 | 9 | 0.2% |
| 1 | 8 | 0.2% |
| 38 | 8 | 0.2% |
| 10 | 7 | 0.1% |
| 12 | 6 | 0.1% |
| 4 | 6 | 0.1% |
| 162 | 5 | 0.1% |
| 107 | 4 | 0.1% |
| 24 | 4 | 0.1% |
| Other values (1196) | 2331 | |
| (Missing) | 1269 |
| Value | Count | Frequency (%) |
| 0 | 1343 | |
| 0.24 | 2 | < 0.1% |
| 0.38 | 2 | < 0.1% |
| 1 | 8 | 0.2% |
| 1.04 | 2 | < 0.1% |
| 1.13 | 2 | < 0.1% |
| 1.93 | 2 | < 0.1% |
| 2 | 9 | 0.2% |
| 2.71 | 2 | < 0.1% |
| 3 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 6872410 | 1 | |
| 6674931 | 2 | |
| 6484218.56 | 2 | |
| 4045125 | 2 | |
| 3582772 | 1 | |
| 3399147.44 | 2 | |
| 2770564.33 | 2 | |
| 2526577 | 1 | |
| 2428558.65 | 2 | |
| 2422422 | 2 |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2326 |
| Missing (%) | 46.5% |
| Memory size | 39.2 KiB |
| Mcf | |
|---|---|
| barrels | |
| short tons |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 5.878085266 |
| Min length | 3 |
Characters and Unicode
| Total characters | 15718 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | barrels |
|---|---|
| 2nd row | barrels |
| 3rd row | barrels |
| 4th row | barrels |
| 5th row | barrels |
Common Values
| Value | Count | Frequency (%) |
| Mcf | 1185 | |
| barrels | 909 | 18.2% |
| short tons | 580 | 11.6% |
| (Missing) | 2326 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| mcf | 1185 | |
| barrels | 909 | |
| short | 580 | |
| tons | 580 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2398 | |
| s | 2069 | |
| M | 1185 | |
| c | 1185 | |
| f | 1185 | |
| o | 1160 | |
| t | 1160 | |
| b | 909 | 5.8% |
| a | 909 | 5.8% |
| e | 909 | 5.8% |
| Other values (4) | 2649 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13953 | |
| Uppercase Letter | 1185 | 7.5% |
| Space Separator | 580 | 3.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2398 | |
| s | 2069 | |
| c | 1185 | |
| f | 1185 | |
| o | 1160 | |
| t | 1160 | |
| b | 909 | 6.5% |
| a | 909 | 6.5% |
| e | 909 | 6.5% |
| l | 909 | 6.5% |
| Other values (2) | 1160 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1185 |
Space Separator
| Value | Count | Frequency (%) |
| 580 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15138 | |
| Common | 580 | 3.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2398 | |
| s | 2069 | |
| M | 1185 | |
| c | 1185 | |
| f | 1185 | |
| o | 1160 | |
| t | 1160 | |
| b | 909 | 6.0% |
| a | 909 | 6.0% |
| e | 909 | 6.0% |
| Other values (3) | 2069 |
Common
| Value | Count | Frequency (%) |
| 580 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15718 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2398 | |
| s | 2069 | |
| M | 1185 | |
| c | 1185 | |
| f | 1185 | |
| o | 1160 | |
| t | 1160 | |
| b | 909 | 5.8% |
| a | 909 | 5.8% |
| e | 909 | 5.8% |
| Other values (4) | 2649 |
| Distinct | 1923 |
|---|---|
| Distinct (%) | 38.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 265433.5286 |
| Minimum | -66828 |
|---|---|
| Maximum | 16319726 |
| Zeros | 596 |
| Zeros (%) | 11.9% |
| Negative | 25 |
| Negative (%) | 0.5% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | -66828 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1194 |
| median | 15755 |
| Q3 | 112889.46 |
| 95-th percentile | 896264.3885 |
| Maximum | 16319726 |
| Range | 16386554 |
| Interquartile range (IQR) | 111695.46 |
Descriptive statistics
| Standard deviation | 1090511.728 |
|---|---|
| Coefficient of variation (CV) | 4.108417404 |
| Kurtosis | 84.3254756 |
| Mean | 265433.5286 |
| Median Absolute Deviation (MAD) | 15755 |
| Skewness | 8.185109812 |
| Sum | 1327167643 |
| Variance | 1.18921583 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 596 | 11.9% |
| 10 | 10 | 0.2% |
| 6 | 8 | 0.2% |
| 1384 | 7 | 0.1% |
| 664 | 6 | 0.1% |
| 5.08 | 6 | 0.1% |
| 12 | 6 | 0.1% |
| 6086.14 | 6 | 0.1% |
| 63 | 6 | 0.1% |
| 760 | 6 | 0.1% |
| Other values (1913) | 4343 |
| Value | Count | Frequency (%) |
| -66828 | 1 | < 0.1% |
| -63071 | 2 | |
| -37007 | 2 | |
| -34717 | 3 | |
| -25906 | 3 | |
| -3775 | 3 | |
| -3446 | 2 | |
| -2612 | 2 | |
| -815 | 2 | |
| -330 | 2 |
| Value | Count | Frequency (%) |
| 16319726 | 1 | < 0.1% |
| 16285564 | 2 | |
| 16198257 | 1 | < 0.1% |
| 16185812 | 2 | |
| 12092462 | 1 | < 0.1% |
| 12068953 | 2 | |
| 10368496 | 1 | < 0.1% |
| 9168734 | 3 | |
| 8885167 | 1 | < 0.1% |
| 8845523 | 1 | < 0.1% |
| Distinct | 506 |
|---|---|
| Distinct (%) | 21.0% |
| Missing | 2589 |
| Missing (%) | 51.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.144659477 |
| Minimum | 0.06 |
|---|---|
| Maximum | 33.92 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0.06 |
|---|---|
| 5-th percentile | 0.56 |
| Q1 | 1.02 |
| median | 5.802 |
| Q3 | 8.4 |
| 95-th percentile | 25.5 |
| Maximum | 33.92 |
| Range | 33.86 |
| Interquartile range (IQR) | 7.38 |
Descriptive statistics
| Standard deviation | 8.062987251 |
|---|---|
| Coefficient of variation (CV) | 1.128533456 |
| Kurtosis | 0.9500839915 |
| Mean | 7.144659477 |
| Median Absolute Deviation (MAD) | 4.775 |
| Skewness | 1.415656485 |
| Sum | 17225.774 |
| Variance | 65.01176342 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 162 | 3.2% |
| 1.03 | 154 | 3.1% |
| 1.02 | 119 | 2.4% |
| 1.01 | 80 | 1.6% |
| 1.04 | 66 | 1.3% |
| 1.027 | 55 | 1.1% |
| 6.3 | 42 | 0.8% |
| 1.05 | 37 | 0.7% |
| 5.838 | 35 | 0.7% |
| 5.8 | 29 | 0.6% |
| Other values (496) | 1632 | |
| (Missing) | 2589 |
| Value | Count | Frequency (%) |
| 0.06 | 2 | |
| 0.08 | 2 | |
| 0.09 | 4 | |
| 0.092 | 2 | |
| 0.1 | 2 | |
| 0.27 | 4 | |
| 0.3 | 2 | |
| 0.36 | 4 | |
| 0.369 | 2 | |
| 0.37 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 33.92 | 2 | < 0.1% |
| 33 | 2 | < 0.1% |
| 32.667 | 2 | < 0.1% |
| 32 | 2 | < 0.1% |
| 31.599 | 1 | < 0.1% |
| 31 | 6 | |
| 30.999 | 2 | < 0.1% |
| 30.9 | 2 | < 0.1% |
| 30.593 | 2 | < 0.1% |
| 30.4 | 2 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2326 |
| Missing (%) | 46.5% |
| Memory size | 39.2 KiB |
| MMBtu per Mcf | |
|---|---|
| MMBtu per barrels | |
| MMBtu per short tons |
Length
| Max length | 20 |
|---|---|
| Median length | 17 |
| Mean length | 15.87808527 |
| Min length | 13 |
Characters and Unicode
| Total characters | 42458 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MMBtu per barrels |
|---|---|
| 2nd row | MMBtu per barrels |
| 3rd row | MMBtu per barrels |
| 4th row | MMBtu per barrels |
| 5th row | MMBtu per barrels |
Common Values
| Value | Count | Frequency (%) |
| MMBtu per Mcf | 1185 | |
| MMBtu per barrels | 909 | 18.2% |
| MMBtu per short tons | 580 | 11.6% |
| (Missing) | 2326 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| mmbtu | 2674 | |
| per | 2674 | |
| mcf | 1185 | |
| barrels | 909 | 10.6% |
| short | 580 | 6.7% |
| tons | 580 | 6.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 6533 | |
| 5928 | ||
| r | 5072 | |
| t | 3834 | |
| e | 3583 | |
| u | 2674 | |
| p | 2674 | |
| B | 2674 | |
| s | 2069 | 4.9% |
| c | 1185 | 2.8% |
| Other values (7) | 6232 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27323 | |
| Uppercase Letter | 9207 | 21.7% |
| Space Separator | 5928 | 14.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 5072 | |
| t | 3834 | |
| e | 3583 | |
| u | 2674 | |
| p | 2674 | |
| s | 2069 | |
| c | 1185 | 4.3% |
| f | 1185 | 4.3% |
| o | 1160 | 4.2% |
| b | 909 | 3.3% |
| Other values (4) | 2978 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 6533 | |
| B | 2674 |
Space Separator
| Value | Count | Frequency (%) |
| 5928 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36530 | |
| Common | 5928 | 14.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 6533 | |
| r | 5072 | |
| t | 3834 | |
| e | 3583 | |
| u | 2674 | |
| p | 2674 | |
| B | 2674 | |
| s | 2069 | 5.7% |
| c | 1185 | 3.2% |
| f | 1185 | 3.2% |
| Other values (6) | 5047 |
Common
| Value | Count | Frequency (%) |
| 5928 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42458 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 6533 | |
| 5928 | ||
| r | 5072 | |
| t | 3834 | |
| e | 3583 | |
| u | 2674 | |
| p | 2674 | |
| B | 2674 | |
| s | 2069 | 4.9% |
| c | 1185 | 2.8% |
| Other values (7) | 6232 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| megawatthours |
|---|
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Characters and Unicode
| Total characters | 65000 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | megawatthours |
|---|---|
| 2nd row | megawatthours |
| 3rd row | megawatthours |
| 4th row | megawatthours |
| 5th row | megawatthours |
Common Values
| Value | Count | Frequency (%) |
| megawatthours | 5000 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| megawatthours | 5000 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 10000 | |
| t | 10000 | |
| m | 5000 | |
| e | 5000 | |
| g | 5000 | |
| w | 5000 | |
| h | 5000 | |
| o | 5000 | |
| u | 5000 | |
| r | 5000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 65000 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10000 | |
| t | 10000 | |
| m | 5000 | |
| e | 5000 | |
| g | 5000 | |
| w | 5000 | |
| h | 5000 | |
| o | 5000 | |
| u | 5000 | |
| r | 5000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 65000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 10000 | |
| t | 10000 | |
| m | 5000 | |
| e | 5000 | |
| g | 5000 | |
| w | 5000 | |
| h | 5000 | |
| o | 5000 | |
| u | 5000 | |
| r | 5000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 65000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 10000 | |
| t | 10000 | |
| m | 5000 | |
| e | 5000 | |
| g | 5000 | |
| w | 5000 | |
| h | 5000 | |
| o | 5000 | |
| u | 5000 | |
| r | 5000 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| megawatthours |
|---|
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Characters and Unicode
| Total characters | 65000 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | megawatthours |
|---|---|
| 2nd row | megawatthours |
| 3rd row | megawatthours |
| 4th row | megawatthours |
| 5th row | megawatthours |
Common Values
| Value | Count | Frequency (%) |
| megawatthours | 5000 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| megawatthours | 5000 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 10000 | |
| t | 10000 | |
| m | 5000 | |
| e | 5000 | |
| g | 5000 | |
| w | 5000 | |
| h | 5000 | |
| o | 5000 | |
| u | 5000 | |
| r | 5000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 65000 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10000 | |
| t | 10000 | |
| m | 5000 | |
| e | 5000 | |
| g | 5000 | |
| w | 5000 | |
| h | 5000 | |
| o | 5000 | |
| u | 5000 | |
| r | 5000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 65000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 10000 | |
| t | 10000 | |
| m | 5000 | |
| e | 5000 | |
| g | 5000 | |
| w | 5000 | |
| h | 5000 | |
| o | 5000 | |
| u | 5000 | |
| r | 5000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 65000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 10000 | |
| t | 10000 | |
| m | 5000 | |
| e | 5000 | |
| g | 5000 | |
| w | 5000 | |
| h | 5000 | |
| o | 5000 | |
| u | 5000 | |
| r | 5000 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| MMBtu |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 25000 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MMBtu |
|---|---|
| 2nd row | MMBtu |
| 3rd row | MMBtu |
| 4th row | MMBtu |
| 5th row | MMBtu |
Common Values
| Value | Count | Frequency (%) |
| MMBtu | 5000 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| mmbtu | 5000 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 10000 | |
| B | 5000 | |
| t | 5000 | |
| u | 5000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15000 | |
| Lowercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 10000 | |
| B | 5000 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 5000 | |
| u | 5000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 10000 | |
| B | 5000 | |
| t | 5000 | |
| u | 5000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 10000 | |
| B | 5000 | |
| t | 5000 | |
| u | 5000 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| MMBtu |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 25000 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MMBtu |
|---|---|
| 2nd row | MMBtu |
| 3rd row | MMBtu |
| 4th row | MMBtu |
| 5th row | MMBtu |
Common Values
| Value | Count | Frequency (%) |
| MMBtu | 5000 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| mmbtu | 5000 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 10000 | |
| B | 5000 | |
| t | 5000 | |
| u | 5000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15000 | |
| Lowercase Letter | 10000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 10000 | |
| B | 5000 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 5000 | |
| u | 5000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 10000 | |
| B | 5000 | |
| t | 5000 | |
| u | 5000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 10000 | |
| B | 5000 | |
| t | 5000 | |
| u | 5000 |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Unnamed: 0 | period | plantCode | plantName | fuel2002 | fuelTypeDescription | state | stateDescription | primeMover | generation | gross-generation | total-consumption | total-consumption-units | total-consumption-btu | consumption-for-eg | consumption-for-eg-units | consumption-for-eg-btu | average-heat-content | average-heat-content-units | generation-units | gross-generation-units | total-consumption-btu-units | consumption-for-eg-btu-units | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 2001-04 | 6311 | Chevak | ALL | Total | AK | Alaska | ALL | 135.88 | 138.65 | NaN | NaN | 1457.0 | NaN | NaN | 1457.0 | NaN | NaN | megawatthours | megawatthours | MMBtu | MMBtu |
| 1 | 1 | 2001-04 | 6311 | Chevak | DFO | Distillate Fuel Oil | AK | Alaska | ALL | 135.88 | 138.65 | 250.0 | MMBtu per barrels | 1457.0 | 250.0 | barrels | 1457.0 | 5.828 | MMBtu per barrels | megawatthours | megawatthours | MMBtu | MMBtu |
| 2 | 2 | 2001-04 | 6312 | EEK | DFO | Distillate Fuel Oil | AK | Alaska | 50.89 | 51.93 | 100.0 | MMBtu per barrels | 583.0 | 100.0 | barrels | 583.0 | 5.830 | MMBtu per barrels | megawatthours | megawatthours | MMBtu | MMBtu | |
| 3 | 3 | 2001-04 | 6312 | EEK | ALL | Total | AK | Alaska | ALL | 50.89 | 51.93 | NaN | NaN | 583.0 | NaN | NaN | 583.0 | NaN | NaN | megawatthours | megawatthours | MMBtu | MMBtu |
| 4 | 4 | 2001-04 | 6312 | EEK | DFO | Distillate Fuel Oil | AK | Alaska | ALL | 50.89 | 51.93 | 100.0 | MMBtu per barrels | 583.0 | 100.0 | barrels | 583.0 | 5.830 | MMBtu per barrels | megawatthours | megawatthours | MMBtu | MMBtu |
| 5 | 5 | 2001-04 | 6313 | ELIM | DFO | Distillate Fuel Oil | AK | Alaska | 76.00 | 77.55 | 133.0 | MMBtu per barrels | 775.0 | 133.0 | barrels | 775.0 | 5.827 | MMBtu per barrels | megawatthours | megawatthours | MMBtu | MMBtu | |
| 6 | 6 | 2001-04 | 6313 | ELIM | ALL | Total | AK | Alaska | ALL | 76.00 | 77.55 | NaN | NaN | 775.0 | NaN | NaN | 775.0 | NaN | NaN | megawatthours | megawatthours | MMBtu | MMBtu |
| 7 | 7 | 2001-04 | 6313 | ELIM | DFO | Distillate Fuel Oil | AK | Alaska | ALL | 76.00 | 77.55 | 133.0 | MMBtu per barrels | 775.0 | 133.0 | barrels | 775.0 | 5.827 | MMBtu per barrels | megawatthours | megawatthours | MMBtu | MMBtu |
| 8 | 8 | 2001-04 | 6314 | Emmonak | DFO | Distillate Fuel Oil | AK | Alaska | 177.04 | 180.65 | 340.0 | MMBtu per barrels | 1982.0 | 340.0 | barrels | 1982.0 | 5.829 | MMBtu per barrels | megawatthours | megawatthours | MMBtu | MMBtu | |
| 9 | 9 | 2001-04 | 6314 | Emmonak | ALL | Total | AK | Alaska | ALL | 177.04 | 180.65 | NaN | NaN | 1982.0 | NaN | NaN | 1982.0 | NaN | NaN | megawatthours | megawatthours | MMBtu | MMBtu |
Last rows
| Unnamed: 0 | period | plantCode | plantName | fuel2002 | fuelTypeDescription | state | stateDescription | primeMover | generation | gross-generation | total-consumption | total-consumption-units | total-consumption-btu | consumption-for-eg | consumption-for-eg-units | consumption-for-eg-btu | average-heat-content | average-heat-content-units | generation-units | gross-generation-units | total-consumption-btu-units | consumption-for-eg-btu-units | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4990 | 4990 | 2001-04 | 54526 | Lyonsdale Power Co LLC | WDS | Wood Waste Solids | NY | New York | ALL | 8645.0 | 9188.01 | 13549.0 | NaN | 123702.0 | 13549.0 | NaN | 123702.0 | 9.13 | NaN | megawatthours | megawatthours | MMBtu | MMBtu |
| 4991 | 4991 | 2001-04 | 54529 | Ridge | LFG | Municiapl Landfill Gas | FL | Florida | 296.0 | 317.77 | 18040.0 | MMBtu per Mcf | 10102.0 | 18040.0 | Mcf | 10102.0 | 0.56 | MMBtu per Mcf | megawatthours | megawatthours | MMBtu | MMBtu | |
| 4992 | 4992 | 2001-04 | 54529 | Ridge | TDF | Other | FL | Florida | 6376.0 | 6844.94 | 7152.0 | MMBtu per short tons | 194534.0 | 7152.0 | short tons | 194534.0 | 27.20 | MMBtu per short tons | megawatthours | megawatthours | MMBtu | MMBtu | |
| 4993 | 4993 | 2001-04 | 54529 | Ridge | WDS | Wood Waste Solids | FL | Florida | 9289.0 | 9972.18 | 16671.0 | NaN | 133368.0 | 16671.0 | NaN | 133368.0 | 8.00 | NaN | megawatthours | megawatthours | MMBtu | MMBtu | |
| 4994 | 4994 | 2001-04 | 54529 | Ridge | ALL | Total | FL | Florida | ALL | 15961.0 | 17134.89 | NaN | NaN | 338004.0 | NaN | NaN | 338004.0 | NaN | NaN | megawatthours | megawatthours | MMBtu | MMBtu |
| 4995 | 4995 | 2001-04 | 54529 | Ridge | LFG | Municiapl Landfill Gas | FL | Florida | ALL | 296.0 | 317.77 | 18040.0 | MMBtu per Mcf | 10102.0 | 18040.0 | Mcf | 10102.0 | 0.56 | MMBtu per Mcf | megawatthours | megawatthours | MMBtu | MMBtu |
| 4996 | 4996 | 2001-04 | 54529 | Ridge | TDF | Other | FL | Florida | ALL | 6376.0 | 6844.94 | 7152.0 | MMBtu per short tons | 194534.0 | 7152.0 | short tons | 194534.0 | 27.20 | MMBtu per short tons | megawatthours | megawatthours | MMBtu | MMBtu |
| 4997 | 4997 | 2001-12 | 55323 | City of Tacoma Steam Plant | MSN | Other | WA | Washington | 0.0 | 0.00 | 0.0 | MMBtu per short tons | 0.0 | 0.0 | short tons | 0.0 | NaN | MMBtu per short tons | megawatthours | megawatthours | MMBtu | MMBtu | |
| 4998 | 4998 | 2001-12 | 55323 | City of Tacoma Steam Plant | NG | Natural Gas | WA | Washington | 0.0 | 0.00 | 0.0 | MMBtu per Mcf | 0.0 | 0.0 | Mcf | 0.0 | NaN | MMBtu per Mcf | megawatthours | megawatthours | MMBtu | MMBtu | |
| 4999 | 4999 | 2001-12 | 55323 | City of Tacoma Steam Plant | OBS | other renewables | WA | Washington | 0.0 | 0.00 | 0.0 | MMBtu per short tons | 0.0 | 0.0 | short tons | 0.0 | NaN | MMBtu per short tons | megawatthours | megawatthours | MMBtu | MMBtu |