Dataset statistics
Number of variables | 28 |
---|---|
Number of observations | 10000 |
Missing cells | 186 |
Missing cells (%) | 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.4 MiB |
Average record size in memory | 147.0 B |
Variable types
Categorical | 10 |
---|---|
Numeric | 6 |
Boolean | 12 |
has_materialtype has constant value "True" | Constant |
material_id has a high cardinality: 1230 distinct values | High cardinality |
material_group has a high cardinality: 132 distinct values | High cardinality |
material_group.1 has a high cardinality: 149 distinct values | High cardinality |
part_desc has a high cardinality: 8301 distinct values | High cardinality |
Date has a high cardinality: 336 distinct values | High cardinality |
m_weight is highly correlated with area1 and 1 other fields | High correlation |
has_coatings.1 is highly correlated with has_matlspecs.1 | High correlation |
has_matlspecs.1 is highly correlated with has_coatings.1 | High correlation |
area1 is highly correlated with m_weight and 1 other fields | High correlation |
area2 is highly correlated with m_weight and 1 other fields | High correlation |
area3 is highly correlated with area4 | High correlation |
area4 is highly correlated with area3 | High correlation |
m_weight is highly correlated with area2 | High correlation |
has_coatings.1 is highly correlated with has_matlspecs.1 | High correlation |
has_matlspecs.1 is highly correlated with has_coatings.1 | High correlation |
area1 is highly correlated with area2 | High correlation |
area2 is highly correlated with m_weight and 1 other fields | High correlation |
area3 is highly correlated with area4 | High correlation |
area4 is highly correlated with area3 | High correlation |
has_coatings.1 is highly correlated with has_matlspecs.1 | High correlation |
has_matlspecs.1 is highly correlated with has_coatings.1 | High correlation |
area1 is highly correlated with area2 | High correlation |
area2 is highly correlated with area1 | High correlation |
area3 is highly correlated with area4 | High correlation |
area4 is highly correlated with area3 | High correlation |
has_coatings.1 is highly correlated with has_matlspecs.1 and 1 other fields | High correlation |
qty_replaced is highly correlated with has_materialtype | High correlation |
has_matlspecs.1 is highly correlated with has_coatings.1 and 1 other fields | High correlation |
has_qspecs is highly correlated with has_materialtype | High correlation |
has_weldspecs is highly correlated with has_materialtype | High correlation |
surface_matl is highly correlated with has_materialtype | High correlation |
surface_matl.1 is highly correlated with has_materialtype | High correlation |
rig_plant is highly correlated with has_materialtype | High correlation |
has_coatings is highly correlated with has_materialtype | High correlation |
has_qspecs.1 is highly correlated with has_materialtype | High correlation |
has_documents is highly correlated with has_materialtype | High correlation |
has_weldspecs.1 is highly correlated with has_materialtype | High correlation |
material_type is highly correlated with has_materialtype | High correlation |
has_materialtype is highly correlated with has_coatings.1 and 15 other fields | High correlation |
material_type.1 is highly correlated with has_materialtype | High correlation |
has_matlspecs is highly correlated with has_materialtype | High correlation |
has_documents.1 is highly correlated with has_materialtype | High correlation |
m_weight is highly correlated with area1 and 1 other fields | High correlation |
has_coatings.1 is highly correlated with has_matlspecs.1 | High correlation |
has_matlspecs.1 is highly correlated with has_coatings.1 | High correlation |
has_weldspecs.1 is highly correlated with area4 | High correlation |
area1 is highly correlated with m_weight and 1 other fields | High correlation |
area2 is highly correlated with m_weight and 1 other fields | High correlation |
area3 is highly correlated with area4 | High correlation |
area4 is highly correlated with has_weldspecs.1 and 1 other fields | High correlation |
part_desc is uniformly distributed | Uniform |
weight has 313 (3.1%) zeros | Zeros |
Reproduction
Analysis started | 2022-06-09 13:53:22.548177 |
---|---|
Analysis finished | 2022-06-09 13:53:31.909940 |
Duration | 9.36 seconds |
Software version | pandas-profiling v3.2.0 |
Download configuration | config.json |
Distinct | 1230 |
---|---|
Distinct (%) | 12.3% |
Missing | 3 |
Missing (%) | < 0.1% |
Memory size | 78.2 KiB |
M1111114811 | 144 |
---|---|
M1111133284 | 135 |
M1111181242 | 101 |
M182878 | 97 |
M1111145389 | 95 |
Other values (1225) |
Length
Max length | 14 |
---|---|
Median length | 11 |
Mean length | 10.35460638 |
Min length | 7 |
Characters and Unicode
Total characters | 103515 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 272 ? |
---|---|
Unique (%) | 2.7% |
Sample
1st row | 18-151-111 |
---|---|
2nd row | 18-151-111 |
3rd row | 18-187-411 |
4th row | 18-187-411 |
5th row | 18-222-291 |
Common Values
Value | Count | Frequency (%) |
M1111114811 | 144 | 1.4% |
M1111133284 | 135 | 1.4% |
M1111181242 | 101 | 1.0% |
M182878 | 97 | 1.0% |
M1111145389 | 95 | 0.9% |
M1111126145 | 93 | 0.9% |
M151323 | 89 | 0.9% |
M1111142726 | 88 | 0.9% |
M1111161879 | 84 | 0.8% |
M1111131151 | 80 | 0.8% |
Other values (1220) | 8991 |
Length
Value | Count | Frequency (%) |
m1111114811 | 144 | 1.4% |
m1111133284 | 135 | 1.4% |
m1111181242 | 101 | 1.0% |
m182878 | 97 | 1.0% |
m1111145389 | 95 | 1.0% |
m1111126145 | 93 | 0.9% |
m151323 | 89 | 0.9% |
m1111142726 | 88 | 0.9% |
m1111161879 | 84 | 0.8% |
m1111131151 | 80 | 0.8% |
Other values (1220) | 8991 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 52052 | |
M | 9845 | 9.5% |
2 | 5848 | 5.6% |
4 | 5567 | 5.4% |
5 | 5191 | 5.0% |
7 | 5174 | 5.0% |
3 | 5127 | 5.0% |
8 | 4698 | 4.5% |
9 | 4581 | 4.4% |
6 | 4320 | 4.2% |
Other values (2) | 1112 | 1.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 92558 | |
Uppercase Letter | 9883 | 9.5% |
Dash Punctuation | 1074 | 1.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 52052 | |
2 | 5848 | 6.3% |
4 | 5567 | 6.0% |
5 | 5191 | 5.6% |
7 | 5174 | 5.6% |
3 | 5127 | 5.5% |
8 | 4698 | 5.1% |
9 | 4581 | 4.9% |
6 | 4320 | 4.7% |
Uppercase Letter
Value | Count | Frequency (%) |
M | 9845 | |
D | 38 | 0.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1074 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 93632 | |
Latin | 9883 | 9.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 52052 | |
2 | 5848 | 6.2% |
4 | 5567 | 5.9% |
5 | 5191 | 5.5% |
7 | 5174 | 5.5% |
3 | 5127 | 5.5% |
8 | 4698 | 5.0% |
9 | 4581 | 4.9% |
6 | 4320 | 4.6% |
- | 1074 | 1.1% |
Latin
Value | Count | Frequency (%) |
M | 9845 | |
D | 38 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 103515 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 52052 | |
M | 9845 | 9.5% |
2 | 5848 | 5.6% |
4 | 5567 | 5.4% |
5 | 5191 | 5.0% |
7 | 5174 | 5.0% |
3 | 5127 | 5.0% |
8 | 4698 | 4.5% |
9 | 4581 | 4.4% |
6 | 4320 | 4.2% |
Other values (2) | 1112 | 1.1% |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
ZW1S0 | |
---|---|
EWHG |
Common Values
Value | Count | Frequency (%) |
ZW1S0 | 8571 | |
EWHG | 1429 | 14.3% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
zw1s0 | 8571 | |
ewhg | 1429 | 14.3% |
Most occurring characters
Value | Count | Frequency (%) |
W | 10000 | |
Z | 8571 | |
1 | 8571 | |
S | 8571 | |
0 | 8571 | |
E | 1429 | 2.9% |
H | 1429 | 2.9% |
G | 1429 | 2.9% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 31429 | |
Decimal Number | 17142 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
W | 10000 | |
Z | 8571 | |
S | 8571 | |
E | 1429 | 4.5% |
H | 1429 | 4.5% |
G | 1429 | 4.5% |
Decimal Number
Value | Count | Frequency (%) |
1 | 8571 | |
0 | 8571 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 31429 | |
Common | 17142 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
W | 10000 | |
Z | 8571 | |
S | 8571 | |
E | 1429 | 4.5% |
H | 1429 | 4.5% |
G | 1429 | 4.5% |
Common
Value | Count | Frequency (%) |
1 | 8571 | |
0 | 8571 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 48571 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
W | 10000 | |
Z | 8571 | |
1 | 8571 | |
S | 8571 | |
0 | 8571 | |
E | 1429 | 2.9% |
H | 1429 | 2.9% |
G | 1429 | 2.9% |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
1 | |
---|---|
0 |
Common Values
Value | Count | Frequency (%) |
1 | 5656 | |
0 | 4344 |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
1 | 5656 | |
0 | 4344 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 5656 | |
0 | 4344 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 10000 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 5656 | |
0 | 4344 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 10000 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 5656 | |
0 | 4344 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 5656 | |
0 | 4344 |
Distinct | 973 |
---|---|
Distinct (%) | 9.8% |
Missing | 90 |
Missing (%) | 0.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 43144.16178 |
Minimum | 0 |
---|---|
Maximum | 325000 |
Zeros | 56 |
Zeros (%) | 0.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 127 |
Q1 | 1803 |
median | 14868.5 |
Q3 | 89722 |
95-th percentile | 103300 |
Maximum | 325000 |
Range | 325000 |
Interquartile range (IQR) | 87919 |
Descriptive statistics
Standard deviation | 44161.50452 |
---|---|
Coefficient of variation (CV) | 1.023580079 |
Kurtosis | -0.831130803 |
Mean | 43144.16178 |
Median Absolute Deviation (MAD) | 14770.5 |
Skewness | 0.4405112038 |
Sum | 427558643.3 |
Variance | 1950238482 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
102000 | 259 | 2.6% |
95350 | 162 | 1.6% |
88900 | 144 | 1.4% |
97852 | 135 | 1.4% |
101500 | 119 | 1.2% |
68100 | 101 | 1.0% |
84000 | 97 | 1.0% |
107435 | 95 | 0.9% |
100210 | 95 | 0.9% |
105600 | 93 | 0.9% |
Other values (963) | 8610 | |
(Missing) | 90 | 0.9% |
Value | Count | Frequency (%) |
0 | 56 | |
2.2 | 3 | < 0.1% |
2.35 | 2 | < 0.1% |
4.3 | 1 | < 0.1% |
5 | 7 | 0.1% |
6 | 1 | < 0.1% |
8.7 | 2 | < 0.1% |
10 | 6 | 0.1% |
10.001 | 2 | < 0.1% |
10.6 | 3 | < 0.1% |
Value | Count | Frequency (%) |
325000 | 5 | 0.1% |
196297 | 9 | 0.1% |
140500 | 19 | 0.2% |
116569 | 31 | 0.3% |
116250 | 47 | |
107435 | 95 | |
105826 | 17 | 0.2% |
105600 | 93 | |
105464 | 15 | 0.1% |
104557 | 49 |
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
HALB | |
---|---|
90 | |
ROH | 47 |
FERT | 32 |
ZCOP | 3 |
Common Values
Value | Count | Frequency (%) |
HALB | 9828 | |
90 | 0.9% | |
ROH | 47 | 0.5% |
FERT | 32 | 0.3% |
ZCOP | 3 | < 0.1% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
halb | 9828 | |
roh | 47 | 0.5% |
fert | 32 | 0.3% |
zcop | 3 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
H | 9875 | |
A | 9828 | |
L | 9828 | |
B | 9828 | |
90 | 0.2% | |
R | 79 | 0.2% |
O | 50 | 0.1% |
F | 32 | 0.1% |
E | 32 | 0.1% |
T | 32 | 0.1% |
Other values (3) | 9 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 39593 | |
Space Separator | 90 | 0.2% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
H | 9875 | |
A | 9828 | |
L | 9828 | |
B | 9828 | |
R | 79 | 0.2% |
O | 50 | 0.1% |
F | 32 | 0.1% |
E | 32 | 0.1% |
T | 32 | 0.1% |
Z | 3 | < 0.1% |
Other values (2) | 6 | < 0.1% |
Space Separator
Value | Count | Frequency (%) |
90 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 39593 | |
Common | 90 | 0.2% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
H | 9875 | |
A | 9828 | |
L | 9828 | |
B | 9828 | |
R | 79 | 0.2% |
O | 50 | 0.1% |
F | 32 | 0.1% |
E | 32 | 0.1% |
T | 32 | 0.1% |
Z | 3 | < 0.1% |
Other values (2) | 6 | < 0.1% |
Common
Value | Count | Frequency (%) |
90 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 39683 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
H | 9875 | |
A | 9828 | |
L | 9828 | |
B | 9828 | |
90 | 0.2% | |
R | 79 | 0.2% |
O | 50 | 0.1% |
F | 32 | 0.1% |
E | 32 | 0.1% |
T | 32 | 0.1% |
Other values (3) | 9 | < 0.1% |
Distinct | 132 |
---|---|
Distinct (%) | 1.3% |
Missing | 90 |
Missing (%) | 0.9% |
Memory size | 78.2 KiB |
A-A05-SWA | |
---|---|
99 | |
A-S22-TRW | |
9999 | |
M-T03-UWA | |
Other values (127) |
Common Values
Value | Count | Frequency (%) |
A-A05-SWA | 2130 | |
99 | 1862 | |
A-S22-TRW | 803 | 8.0% |
9999 | 611 | 6.1% |
M-T03-UWA | 425 | 4.2% |
M-C02-00A | 353 | 3.5% |
A-S15-TRS | 314 | 3.1% |
A-T03-RUN | 266 | 2.7% |
M-H02-THA | 234 | 2.3% |
M-H03-HAA | 227 | 2.3% |
Other values (122) | 2685 |
Length
Value | Count | Frequency (%) |
a-a05-swa | 2130 | |
99 | 1862 | |
a-s22-trw | 803 | 8.1% |
9999 | 611 | 6.2% |
m-t03-uwa | 425 | 4.3% |
m-c02-00a | 353 | 3.6% |
a-s15-trs | 314 | 3.2% |
a-t03-run | 266 | 2.7% |
m-h02-tha | 234 | 2.4% |
m-h03-haa | 227 | 2.3% |
Other values (123) | 2688 |
Most occurring characters
Value | Count | Frequency (%) |
- | 14871 | |
A | 11555 | |
0 | 8462 | |
9 | 6171 | |
S | 4516 | 6.2% |
W | 3726 | 5.1% |
T | 3053 | 4.2% |
5 | 2640 | 3.6% |
M | 2509 | 3.4% |
2 | 2320 | 3.2% |
Other values (28) | 13275 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 34973 | |
Decimal Number | 23251 | |
Dash Punctuation | 14871 | |
Space Separator | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 11555 | |
S | 4516 | 12.9% |
W | 3726 | 10.7% |
T | 3053 | 8.7% |
M | 2509 | 7.2% |
H | 1895 | 5.4% |
R | 1790 | 5.1% |
C | 1109 | 3.2% |
U | 816 | 2.3% |
O | 727 | 2.1% |
Other values (16) | 3277 | 9.4% |
Decimal Number
Value | Count | Frequency (%) |
0 | 8462 | |
9 | 6171 | |
5 | 2640 | 11.4% |
2 | 2320 | 10.0% |
3 | 1729 | 7.4% |
1 | 1496 | 6.4% |
4 | 417 | 1.8% |
8 | 10 | < 0.1% |
6 | 5 | < 0.1% |
7 | 1 | < 0.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 14871 |
Space Separator
Value | Count | Frequency (%) |
3 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 38125 | |
Latin | 34973 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 11555 | |
S | 4516 | 12.9% |
W | 3726 | 10.7% |
T | 3053 | 8.7% |
M | 2509 | 7.2% |
H | 1895 | 5.4% |
R | 1790 | 5.1% |
C | 1109 | 3.2% |
U | 816 | 2.3% |
O | 727 | 2.1% |
Other values (16) | 3277 | 9.4% |
Common
Value | Count | Frequency (%) |
- | 14871 | |
0 | 8462 | |
9 | 6171 | |
5 | 2640 | 6.9% |
2 | 2320 | 6.1% |
3 | 1729 | 4.5% |
1 | 1496 | 3.9% |
4 | 417 | 1.1% |
8 | 10 | < 0.1% |
6 | 5 | < 0.1% |
Other values (2) | 4 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 73098 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 14871 | |
A | 11555 | |
0 | 8462 | |
9 | 6171 | |
S | 4516 | 6.2% |
W | 3726 | 5.1% |
T | 3053 | 4.2% |
5 | 2640 | 3.6% |
M | 2509 | 3.4% |
2 | 2320 | 3.2% |
Other values (28) | 13275 |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
True | |
---|---|
False |
Value | Count | Frequency (%) |
True | 5445 | |
False | 4555 |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 3 |
Missing (%) | < 0.1% |
Memory size | 78.2 KiB |
False | |
---|---|
True | |
(Missing) | 3 |
Value | Count | Frequency (%) |
False | 8625 | |
True | 1372 | 13.7% |
(Missing) | 3 | < 0.1% |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
True | |
---|---|
False | 518 |
Value | Count | Frequency (%) |
True | 9482 | |
False | 518 | 5.2% |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
False | |
---|---|
True | 28 |
Value | Count | Frequency (%) |
False | 9972 | |
True | 28 | 0.3% |
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
FALSE | |
---|---|
TRUE | 14 |
? | 2 |
Common Values
Value | Count | Frequency (%) |
FALSE | 9984 | |
TRUE | 14 | 0.1% |
? | 2 | < 0.1% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
false | 9984 | |
true | 14 | 0.1% |
2 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
E | 9998 | |
F | 9984 | |
A | 9984 | |
L | 9984 | |
S | 9984 | |
T | 14 | < 0.1% |
R | 14 | < 0.1% |
U | 14 | < 0.1% |
? | 2 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 49976 | |
Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
E | 9998 | |
F | 9984 | |
A | 9984 | |
L | 9984 | |
S | 9984 | |
T | 14 | < 0.1% |
R | 14 | < 0.1% |
U | 14 | < 0.1% |
Other Punctuation
Value | Count | Frequency (%) |
? | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 49976 | |
Common | 2 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
E | 9998 | |
F | 9984 | |
A | 9984 | |
L | 9984 | |
S | 9984 | |
T | 14 | < 0.1% |
R | 14 | < 0.1% |
U | 14 | < 0.1% |
Common
Value | Count | Frequency (%) |
? | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 49978 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
E | 9998 | |
F | 9984 | |
A | 9984 | |
L | 9984 | |
S | 9984 | |
T | 14 | < 0.1% |
R | 14 | < 0.1% |
U | 14 | < 0.1% |
? | 2 | < 0.1% |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
False | |
---|---|
True | 800 |
Value | Count | Frequency (%) |
False | 9200 | |
True | 800 | 8.0% |
Distinct | 629 |
---|---|
Distinct (%) | 6.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39.9274899 |
Minimum | 0 |
---|---|
Maximum | 11355 |
Zeros | 313 |
Zeros (%) | 3.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.01 |
Q1 | 0.1 |
median | 0.3 |
Q3 | 2 |
95-th percentile | 95.3 |
Maximum | 11355 |
Range | 11355 |
Interquartile range (IQR) | 1.9 |
Descriptive statistics
Standard deviation | 299.3141348 |
---|---|
Coefficient of variation (CV) | 7.496442564 |
Kurtosis | 576.1314944 |
Mean | 39.9274899 |
Median Absolute Deviation (MAD) | 0.28 |
Skewness | 19.94402167 |
Sum | 399274.899 |
Variance | 89588.95127 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.1 | 1956 | |
0.01 | 561 | 5.6% |
0.2 | 509 | 5.1% |
0.5 | 436 | 4.4% |
1 | 428 | 4.3% |
0.22 | 389 | 3.9% |
0 | 313 | 3.1% |
0.3 | 208 | 2.1% |
2 | 200 | 2.0% |
0.4 | 188 | 1.9% |
Other values (619) | 4812 |
Value | Count | Frequency (%) |
0 | 313 | |
0.003 | 4 | < 0.1% |
0.004 | 2 | < 0.1% |
0.009 | 2 | < 0.1% |
0.01 | 561 | |
0.011 | 1 | < 0.1% |
0.02 | 119 | 1.2% |
0.025 | 3 | < 0.1% |
0.03 | 111 | 1.1% |
0.04 | 90 | 0.9% |
Value | Count | Frequency (%) |
11355 | 1 | |
10194 | 2 | |
8226 | 1 | |
5820 | 1 | |
4454 | 1 | |
4433 | 1 | |
4400 | 1 | |
4340 | 1 | |
4000 | 1 | |
3744 | 1 |
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
HALB | |
---|---|
ROH | 78 |
ZCOP | 35 |
FERT | 3 |
Common Values
Value | Count | Frequency (%) |
HALB | 9884 | |
ROH | 78 | 0.8% |
ZCOP | 35 | 0.4% |
FERT | 3 | < 0.1% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
halb | 9884 | |
roh | 78 | 0.8% |
zcop | 35 | 0.4% |
fert | 3 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
H | 9962 | |
A | 9884 | |
L | 9884 | |
B | 9884 | |
O | 113 | 0.3% |
R | 81 | 0.2% |
Z | 35 | 0.1% |
C | 35 | 0.1% |
P | 35 | 0.1% |
F | 3 | < 0.1% |
Other values (2) | 6 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 39922 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
H | 9962 | |
A | 9884 | |
L | 9884 | |
B | 9884 | |
O | 113 | 0.3% |
R | 81 | 0.2% |
Z | 35 | 0.1% |
C | 35 | 0.1% |
P | 35 | 0.1% |
F | 3 | < 0.1% |
Other values (2) | 6 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 39922 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
H | 9962 | |
A | 9884 | |
L | 9884 | |
B | 9884 | |
O | 113 | 0.3% |
R | 81 | 0.2% |
Z | 35 | 0.1% |
C | 35 | 0.1% |
P | 35 | 0.1% |
F | 3 | < 0.1% |
Other values (2) | 6 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 39922 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
H | 9962 | |
A | 9884 | |
L | 9884 | |
B | 9884 | |
O | 113 | 0.3% |
R | 81 | 0.2% |
Z | 35 | 0.1% |
C | 35 | 0.1% |
P | 35 | 0.1% |
F | 3 | < 0.1% |
Other values (2) | 6 | < 0.1% |
Distinct | 149 |
---|---|
Distinct (%) | 1.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
O-F03-000 | |
---|---|
O-S04-ST0 | |
O-F02-000 | |
F-S04-STA | |
O-M01-000 | 344 |
Other values (144) |
Common Values
Value | Count | Frequency (%) |
O-F03-000 | 2951 | |
O-S04-ST0 | 1054 | 10.5% |
O-F02-000 | 981 | 9.8% |
F-S04-STA | 460 | 4.6% |
O-M01-000 | 344 | 3.4% |
O-C08-000 | 326 | 3.3% |
M-L04-SS0 | 306 | 3.1% |
O-S05-000 | 248 | 2.5% |
O-C06-000 | 239 | 2.4% |
O-S04-ME0 | 239 | 2.4% |
Other values (139) | 2852 |
Length
Value | Count | Frequency (%) |
o-f03-000 | 2951 | |
o-s04-st0 | 1054 | 10.5% |
o-f02-000 | 981 | 9.8% |
f-s04-sta | 460 | 4.6% |
o-m01-000 | 344 | 3.4% |
o-c08-000 | 326 | 3.3% |
m-l04-ss0 | 306 | 3.1% |
o-s05-000 | 248 | 2.5% |
o-c06-000 | 239 | 2.4% |
o-s04-me0 | 239 | 2.4% |
Other values (139) | 2852 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 29321 | |
- | 19531 | |
O | 7429 | 8.4% |
S | 5634 | 6.4% |
F | 4718 | 5.3% |
M | 3732 | 4.2% |
3 | 3442 | 3.9% |
4 | 2815 | 3.2% |
T | 1848 | 2.1% |
2 | 1253 | 1.4% |
Other values (27) | 8790 | 9.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 39724 | |
Uppercase Letter | 29258 | |
Dash Punctuation | 19531 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
O | 7429 | |
S | 5634 | |
F | 4718 | |
M | 3732 | |
T | 1848 | 6.3% |
C | 1198 | 4.1% |
A | 1061 | 3.6% |
L | 532 | 1.8% |
G | 489 | 1.7% |
E | 440 | 1.5% |
Other values (16) | 2177 | 7.4% |
Decimal Number
Value | Count | Frequency (%) |
0 | 29321 | |
3 | 3442 | 8.7% |
4 | 2815 | 7.1% |
2 | 1253 | 3.2% |
1 | 1152 | 2.9% |
6 | 571 | 1.4% |
9 | 537 | 1.4% |
8 | 327 | 0.8% |
5 | 290 | 0.7% |
7 | 16 | < 0.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 19531 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 59255 | |
Latin | 29258 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
O | 7429 | |
S | 5634 | |
F | 4718 | |
M | 3732 | |
T | 1848 | 6.3% |
C | 1198 | 4.1% |
A | 1061 | 3.6% |
L | 532 | 1.8% |
G | 489 | 1.7% |
E | 440 | 1.5% |
Other values (16) | 2177 | 7.4% |
Common
Value | Count | Frequency (%) |
0 | 29321 | |
- | 19531 | |
3 | 3442 | 5.8% |
4 | 2815 | 4.8% |
2 | 1253 | 2.1% |
1 | 1152 | 1.9% |
6 | 571 | 1.0% |
9 | 537 | 0.9% |
8 | 327 | 0.6% |
5 | 290 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 88513 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 29321 | |
- | 19531 | |
O | 7429 | 8.4% |
S | 5634 | 6.4% |
F | 4718 | 5.3% |
M | 3732 | 4.2% |
3 | 3442 | 3.9% |
4 | 2815 | 3.2% |
T | 1848 | 2.1% |
2 | 1253 | 1.4% |
Other values (27) | 8790 | 9.9% |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
True | |
---|---|
False | 427 |
Value | Count | Frequency (%) |
True | 9573 | |
False | 427 | 4.3% |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
True |
---|
Value | Count | Frequency (%) |
True | 10000 |
has_coatings.1
Boolean
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
False | |
---|---|
True |
Value | Count | Frequency (%) |
False | 6768 | |
True | 3232 |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
False | |
---|---|
True |
Value | Count | Frequency (%) |
False | 7581 | |
True | 2419 | 24.2% |
has_matlspecs.1
Boolean
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
False | |
---|---|
True |
Value | Count | Frequency (%) |
False | 5943 | |
True | 4057 |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
False | |
---|---|
True | 125 |
Value | Count | Frequency (%) |
False | 9875 | |
True | 125 | 1.2% |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
False | |
---|---|
True | 272 |
Value | Count | Frequency (%) |
False | 9728 | |
True | 272 | 2.7% |
Distinct | 186 |
---|---|
Distinct (%) | 1.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 75.4982 |
Minimum | 1 |
---|---|
Maximum | 227 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 41 |
Q3 | 155 |
95-th percentile | 198 |
Maximum | 227 |
Range | 226 |
Interquartile range (IQR) | 154 |
Descriptive statistics
Standard deviation | 75.98282922 |
---|---|
Coefficient of variation (CV) | 1.006419083 |
Kurtosis | -1.448357323 |
Mean | 75.4982 |
Median Absolute Deviation (MAD) | 40 |
Skewness | 0.4084646976 |
Sum | 754982 |
Variance | 5773.390336 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3424 | |
3 | 495 | 5.0% |
166 | 408 | 4.1% |
168 | 368 | 3.7% |
145 | 335 | 3.4% |
34 | 263 | 2.6% |
123 | 243 | 2.4% |
32 | 213 | 2.1% |
41 | 179 | 1.8% |
167 | 175 | 1.8% |
Other values (176) | 3897 |
Value | Count | Frequency (%) |
1 | 3424 | |
2 | 26 | 0.3% |
3 | 495 | 5.0% |
4 | 5 | 0.1% |
5 | 23 | 0.2% |
6 | 6 | 0.1% |
8 | 13 | 0.1% |
9 | 2 | < 0.1% |
10 | 1 | < 0.1% |
11 | 21 | 0.2% |
Value | Count | Frequency (%) |
227 | 1 | < 0.1% |
226 | 5 | 0.1% |
225 | 3 | < 0.1% |
224 | 9 | 0.1% |
223 | 2 | < 0.1% |
222 | 40 | |
220 | 43 | |
219 | 4 | < 0.1% |
217 | 1 | < 0.1% |
216 | 18 |
Distinct | 41 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17.094 |
Minimum | 1 |
---|---|
Maximum | 43 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 17 |
Q3 | 31 |
95-th percentile | 38 |
Maximum | 43 |
Range | 42 |
Interquartile range (IQR) | 30 |
Descriptive statistics
Standard deviation | 14.50449593 |
---|---|
Coefficient of variation (CV) | 0.8485138601 |
Kurtosis | -1.560779662 |
Mean | 17.094 |
Median Absolute Deviation (MAD) | 16 |
Skewness | 0.1045796381 |
Sum | 170940 |
Variance | 210.380402 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3424 | |
33 | 1061 | 10.6% |
25 | 554 | 5.5% |
2 | 521 | 5.2% |
16 | 487 | 4.9% |
30 | 427 | 4.3% |
17 | 355 | 3.5% |
35 | 351 | 3.5% |
41 | 304 | 3.0% |
36 | 292 | 2.9% |
Other values (31) | 2224 |
Value | Count | Frequency (%) |
1 | 3424 | |
2 | 521 | 5.2% |
3 | 28 | 0.3% |
4 | 6 | 0.1% |
5 | 37 | 0.4% |
6 | 11 | 0.1% |
8 | 59 | 0.6% |
9 | 2 | < 0.1% |
10 | 1 | < 0.1% |
11 | 23 | 0.2% |
Value | Count | Frequency (%) |
43 | 60 | 0.6% |
42 | 81 | 0.8% |
41 | 304 | |
40 | 29 | 0.3% |
39 | 23 | 0.2% |
38 | 103 | 1.0% |
37 | 123 | 1.2% |
36 | 292 | |
35 | 351 | |
34 | 7 | 0.1% |
Distinct | 439 |
---|---|
Distinct (%) | 4.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 238.9678 |
Minimum | 1 |
---|---|
Maximum | 556 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 82 |
median | 218 |
Q3 | 422 |
95-th percentile | 509 |
Maximum | 556 |
Range | 555 |
Interquartile range (IQR) | 340 |
Descriptive statistics
Standard deviation | 178.5097755 |
---|---|
Coefficient of variation (CV) | 0.7470034685 |
Kurtosis | -1.373218026 |
Mean | 238.9678 |
Median Absolute Deviation (MAD) | 150 |
Skewness | 0.3276961124 |
Sum | 2389678 |
Variance | 31865.73994 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
106 | 887 | 8.9% |
496 | 450 | 4.5% |
71 | 430 | 4.3% |
60 | 372 | 3.7% |
2 | 355 | 3.5% |
509 | 271 | 2.7% |
3 | 258 | 2.6% |
224 | 248 | 2.5% |
111 | 174 | 1.7% |
105 | 173 | 1.7% |
Other values (429) | 6382 |
Value | Count | Frequency (%) |
1 | 3 | < 0.1% |
2 | 355 | |
3 | 258 | |
4 | 64 | 0.6% |
5 | 1 | < 0.1% |
6 | 44 | 0.4% |
7 | 12 | 0.1% |
8 | 71 | 0.7% |
9 | 28 | 0.3% |
10 | 29 | 0.3% |
Value | Count | Frequency (%) |
556 | 1 | < 0.1% |
549 | 10 | 0.1% |
548 | 6 | 0.1% |
546 | 60 | |
545 | 51 | |
544 | 6 | 0.1% |
543 | 8 | 0.1% |
542 | 13 | 0.1% |
541 | 2 | < 0.1% |
540 | 2 | < 0.1% |
Distinct | 93 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 45.5133 |
Minimum | 1 |
---|---|
Maximum | 101 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 21 |
median | 44 |
Q3 | 74 |
95-th percentile | 88 |
Maximum | 101 |
Range | 100 |
Interquartile range (IQR) | 53 |
Descriptive statistics
Standard deviation | 29.21666279 |
---|---|
Coefficient of variation (CV) | 0.6419368138 |
Kurtosis | -1.318299814 |
Mean | 45.5133 |
Median Absolute Deviation (MAD) | 23 |
Skewness | 0.1786769073 |
Sum | 455133 |
Variance | 853.6133844 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
24 | 1041 | 10.4% |
21 | 863 | 8.6% |
88 | 772 | 7.7% |
1 | 616 | 6.2% |
86 | 453 | 4.5% |
19 | 429 | 4.3% |
23 | 293 | 2.9% |
46 | 251 | 2.5% |
10 | 250 | 2.5% |
2 | 249 | 2.5% |
Other values (83) | 4783 |
Value | Count | Frequency (%) |
1 | 616 | |
2 | 249 | |
6 | 7 | 0.1% |
7 | 43 | 0.4% |
8 | 15 | 0.1% |
9 | 66 | 0.7% |
10 | 250 | |
12 | 36 | 0.4% |
13 | 20 | 0.2% |
14 | 6 | 0.1% |
Value | Count | Frequency (%) |
101 | 1 | < 0.1% |
97 | 156 | |
96 | 18 | 0.2% |
95 | 20 | 0.2% |
94 | 10 | 0.1% |
93 | 27 | 0.3% |
92 | 3 | < 0.1% |
91 | 18 | 0.2% |
90 | 2 | < 0.1% |
89 | 11 | 0.1% |
Distinct | 8301 |
---|---|
Distinct (%) | 83.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
ltr product guaranteed shorthole max change tilting bracket enclosure chassis actual functional variety Casing head or Wellhead | 3 |
---|---|
date difference truck the mobilizing verticallyup Drill string | 3 |
combustion levers superior rear system level actuation springloaded mowing products fluids | 3 |
stateoftheart cooled independent repair like strategically low motorplanetary joystick blast spooling allow opened shifts metric optimal switch page Blowout preventer (BOP) Pipe ram & blind ram | 3 |
stable rods making comfort multiple along mounting alters Drill string | 3 |
Other values (8296) |
Length
Max length | 242 |
---|---|
Median length | 160 |
Mean length | 113.4972 |
Min length | 30 |
Characters and Unicode
Total characters | 1134972 |
---|---|
Distinct characters | 45 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 6704 ? |
---|---|
Unique (%) | 67.0% |
Sample
1st row | ltr product guaranteed shorthole max change tilting bracket enclosure chassis actual functional variety Casing head or Wellhead |
---|---|
2nd row | beacon carried diesel support center induction directs rotating horizontal locations places quality offers Casing head or Wellhead |
3rd row | driller direct performer work trusted major risk tight hard even operational traverse www Drill floor |
4th row | center tine strategically sabotagefree premium travel brake length oil comls near fingerboard Rotary table |
5th row | ltr product guaranteed shorthole max change tilting bracket enclosure chassis actual functional variety Casing head or Wellhead |
Common Values
Value | Count | Frequency (%) |
ltr product guaranteed shorthole max change tilting bracket enclosure chassis actual functional variety Casing head or Wellhead | 3 | < 0.1% |
date difference truck the mobilizing verticallyup Drill string | 3 | < 0.1% |
combustion levers superior rear system level actuation springloaded mowing products fluids | 3 | < 0.1% |
stateoftheart cooled independent repair like strategically low motorplanetary joystick blast spooling allow opened shifts metric optimal switch page Blowout preventer (BOP) Pipe ram & blind ram | 3 | < 0.1% |
stable rods making comfort multiple along mounting alters Drill string | 3 | < 0.1% |
operate functionality ever configured kit equipped enough chassis core rigid stainless expectations trademark provides position options Bell nipple | 3 | < 0.1% |
wash modular crown markets fully mud stro keeping Drill floor | 3 | < 0.1% |
times relatively shortly challenge staff vertically needs machines after string fitted complete layout emptying | 3 | < 0.1% |
maneuverability tramming remembers handle incorporates modular decades | 3 | < 0.1% |
requires balance contents electric list sensing secionts gauge hoist lifted are shift environmental width | 3 | < 0.1% |
Other values (8291) | 9970 |
Length
Value | Count | Frequency (%) |
drill | 2643 | 1.8% |
pipe | 1243 | 0.8% |
ram | 1166 | 0.8% |
bop | 1109 | 0.7% |
preventer | 1109 | 0.7% |
blowout | 1109 | 0.7% |
floor | 1103 | 0.7% |
or | 762 | 0.5% |
string | 749 | 0.5% |
head | 738 | 0.5% |
Other values (1341) | 137053 |
Most occurring characters
Value | Count | Frequency (%) |
140666 | ||
e | 112827 | 9.9% |
i | 79735 | 7.0% |
r | 76494 | 6.7% |
a | 74084 | 6.5% |
t | 73584 | 6.5% |
n | 67322 | 5.9% |
l | 66118 | 5.8% |
o | 65353 | 5.8% |
s | 58132 | 5.1% |
Other values (35) | 320657 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 975059 | |
Space Separator | 140666 | 12.4% |
Uppercase Letter | 13740 | 1.2% |
Open Punctuation | 2342 | 0.2% |
Close Punctuation | 2342 | 0.2% |
Other Punctuation | 583 | 0.1% |
Dash Punctuation | 240 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 112827 | |
i | 79735 | 8.2% |
r | 76494 | 7.8% |
a | 74084 | 7.6% |
t | 73584 | 7.5% |
n | 67322 | 6.9% |
l | 66118 | 6.8% |
o | 65353 | 6.7% |
s | 58132 | 6.0% |
c | 41317 | 4.2% |
Other values (16) | 260093 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 3344 | |
D | 2201 | |
P | 1692 | |
S | 1472 | |
R | 1171 | 8.5% |
O | 1109 | 8.1% |
C | 775 | 5.6% |
W | 640 | 4.7% |
A | 526 | 3.8% |
M | 336 | 2.4% |
Other values (4) | 474 | 3.4% |
Space Separator
Value | Count | Frequency (%) |
140666 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2342 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2342 |
Other Punctuation
Value | Count | Frequency (%) |
& | 583 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 240 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 988799 | |
Common | 146173 | 12.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 112827 | |
i | 79735 | 8.1% |
r | 76494 | 7.7% |
a | 74084 | 7.5% |
t | 73584 | 7.4% |
n | 67322 | 6.8% |
l | 66118 | 6.7% |
o | 65353 | 6.6% |
s | 58132 | 5.9% |
c | 41317 | 4.2% |
Other values (30) | 273833 |
Common
Value | Count | Frequency (%) |
140666 | ||
( | 2342 | 1.6% |
) | 2342 | 1.6% |
& | 583 | 0.4% |
- | 240 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1134972 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
140666 | ||
e | 112827 | 9.9% |
i | 79735 | 7.0% |
r | 76494 | 6.7% |
a | 74084 | 6.5% |
t | 73584 | 6.5% |
n | 67322 | 5.9% |
l | 66118 | 5.8% |
o | 65353 | 5.8% |
s | 58132 | 5.1% |
Other values (35) | 320657 |
Distinct | 336 |
---|---|
Distinct (%) | 3.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
12-11-2012 | 47 |
---|---|
9-22-2012 | 45 |
3-3-2012 | 44 |
2-18-2012 | 43 |
10-10-2012 | 42 |
Other values (331) |
Length
Max length | 10 |
---|---|
Median length | 9 |
Mean length | 8.9293 |
Min length | 8 |
Characters and Unicode
Total characters | 89293 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 11-5-2012 |
---|---|
2nd row | 12-23-2012 |
3rd row | 11-14-2012 |
4th row | 9-15-2012 |
5th row | 5-22-2012 |
Common Values
Value | Count | Frequency (%) |
12-11-2012 | 47 | 0.5% |
9-22-2012 | 45 | 0.4% |
3-3-2012 | 44 | 0.4% |
2-18-2012 | 43 | 0.4% |
10-10-2012 | 42 | 0.4% |
2-15-2012 | 42 | 0.4% |
9-18-2012 | 42 | 0.4% |
10-5-2012 | 41 | 0.4% |
6-16-2012 | 41 | 0.4% |
2-1-2012 | 41 | 0.4% |
Other values (326) | 9572 |
Length
Value | Count | Frequency (%) |
12-11-2012 | 47 | 0.5% |
9-22-2012 | 45 | 0.4% |
3-3-2012 | 44 | 0.4% |
2-18-2012 | 43 | 0.4% |
10-10-2012 | 42 | 0.4% |
2-15-2012 | 42 | 0.4% |
9-18-2012 | 42 | 0.4% |
2-1-2012 | 41 | 0.4% |
7-7-2012 | 41 | 0.4% |
6-16-2012 | 41 | 0.4% |
Other values (326) | 9572 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 26003 | |
- | 20000 | |
1 | 18847 | |
0 | 11548 | |
6 | 1947 | 2.2% |
3 | 1923 | 2.2% |
5 | 1907 | 2.1% |
4 | 1891 | 2.1% |
8 | 1890 | 2.1% |
7 | 1851 | 2.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 69293 | |
Dash Punctuation | 20000 | 22.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 26003 | |
1 | 18847 | |
0 | 11548 | |
6 | 1947 | 2.8% |
3 | 1923 | 2.8% |
5 | 1907 | 2.8% |
4 | 1891 | 2.7% |
8 | 1890 | 2.7% |
7 | 1851 | 2.7% |
9 | 1486 | 2.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 20000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 89293 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
2 | 26003 | |
- | 20000 | |
1 | 18847 | |
0 | 11548 | |
6 | 1947 | 2.2% |
3 | 1923 | 2.2% |
5 | 1907 | 2.1% |
4 | 1891 | 2.1% |
8 | 1890 | 2.1% |
7 | 1851 | 2.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 89293 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 26003 | |
- | 20000 | |
1 | 18847 | |
0 | 11548 | |
6 | 1947 | 2.2% |
3 | 1923 | 2.2% |
5 | 1907 | 2.1% |
4 | 1891 | 2.1% |
8 | 1890 | 2.1% |
7 | 1851 | 2.1% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
material_id | rig_plant | qty_replaced | m_weight | material_type | material_group | surface_matl | has_coatings | has_documents | has_matlspecs | has_weldspecs | has_qspecs | weight | material_type.1 | material_group.1 | surface_matl.1 | has_materialtype | has_coatings.1 | has_documents.1 | has_matlspecs.1 | has_weldspecs.1 | has_qspecs.1 | area1 | area2 | area3 | area4 | part_desc | Date | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 18-151-111 | ZW1S0 | 1 | 121.0 | HALB | M-T03-UWA | True | False | True | False | FALSE | False | 0.200 | HALB | O-S05-000 | True | True | False | False | False | False | False | 42 | 17 | 4 | 2 | ltr product guaranteed shorthole max change tilting bracket enclosure chassis actual functional variety Casing head or Wellhead | 11-5-2012 |
1 | 18-151-111 | ZW1S0 | 1 | 121.0 | HALB | M-T03-UWA | True | False | True | False | FALSE | False | 0.030 | HALB | O-S05-000 | True | True | False | False | False | False | False | 42 | 17 | 435 | 75 | beacon carried diesel support center induction directs rotating horizontal locations places quality offers Casing head or Wellhead | 12-23-2012 |
2 | 18-187-411 | ZW1S0 | 1 | 480.0 | HALB | 99 | True | False | True | False | FALSE | False | 0.900 | HALB | M-L04-SS0 | True | True | True | False | True | False | False | 44 | 18 | 495 | 85 | driller direct performer work trusted major risk tight hard even operational traverse www Drill floor | 11-14-2012 |
3 | 18-187-411 | ZW1S0 | 1 | 480.0 | HALB | 99 | True | False | True | False | FALSE | False | 5.700 | HALB | O-F03-000 | True | True | False | False | False | False | False | 44 | 18 | 325 | 63 | center tine strategically sabotagefree premium travel brake length oil comls near fingerboard Rotary table | 9-15-2012 |
4 | 18-222-291 | EWHG | 0 | 2650.0 | HALB | 9999 | True | False | True | False | FALSE | False | 4.700 | HALB | O-S04-ST0 | True | True | True | False | True | False | False | 51 | 18 | 248 | 49 | ltr product guaranteed shorthole max change tilting bracket enclosure chassis actual functional variety Casing head or Wellhead | 5-22-2012 |
5 | 18-222-291 | ZW1S0 | 1 | 2650.0 | HALB | 9999 | True | False | True | False | FALSE | False | 8.500 | HALB | M-L04-SS0 | True | True | True | False | True | False | False | 51 | 18 | 199 | 42 | opposed regressed purge unattended maneuverability within variable major utilizing toughest from pending components mobilizing decrease centralizer contact range representation Bell nipple | 7-25-2012 |
6 | 18-222-291 | ZW1S0 | 1 | 2650.0 | HALB | 9999 | True | False | True | False | FALSE | False | 85.001 | HALB | 99 | True | True | True | True | True | False | False | 51 | 18 | 530 | 93 | adjust fluids energy positioning working upgrade staff hold injuries from support brake ring shift rotary northern aqhq holes Drill floor | 6-21-2012 |
7 | 18-222-291 | ZW1S0 | 1 | 2650.0 | HALB | 9999 | True | False | True | False | FALSE | False | 4.700 | HALB | O-S04-ST0 | True | True | True | False | True | False | False | 51 | 18 | 248 | 49 | transport lcs well hoist special featuring professional brakes torque can larger super raises easy circuit ensuring been umx | 8-28-2012 |
8 | 18-316-151 | ZW1S0 | 1 | 440.0 | HALB | 9999 | True | False | True | False | FALSE | False | 1.000 | HALB | O-S05-000 | True | True | True | False | True | False | False | 127 | 27 | 429 | 74 | raise announce tractor functionality gives innovative eliminating reach fluids norac increase breached bodies shortly casing forward metric Bell nipple | 2-25-2012 |
9 | 18-316-161 | ZW1S0 | 1 | 1460.0 | HALB | M-T03-XTA | False | False | True | False | FALSE | False | 0.100 | HALB | O-F03-000 | True | True | True | False | True | False | False | 127 | 27 | 106 | 24 | norac production expanded spt combined cable point presenting tractors actuator tricone comlx sabotagefree jaw maneuverability following lattice further braking Bell nipple | 4-3-2012 |
Last rows
material_id | rig_plant | qty_replaced | m_weight | material_type | material_group | surface_matl | has_coatings | has_documents | has_matlspecs | has_weldspecs | has_qspecs | weight | material_type.1 | material_group.1 | surface_matl.1 | has_materialtype | has_coatings.1 | has_documents.1 | has_matlspecs.1 | has_weldspecs.1 | has_qspecs.1 | area1 | area2 | area3 | area4 | part_desc | Date | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
9990 | M7111114166 | ZW1S0 | 0 | 7959.0 | HALB | 99 | True | False | True | False | FALSE | False | 1.19 | HALB | F-S04-STA | True | True | False | True | False | False | False | 99 | 25 | 509 | 88 | replacement dramatically spaces remain warranty amongst however each uniquely insert allowing Casing head or Wellhead | 4-11-2012 |
9991 | M7111114166 | ZW1S0 | 0 | 7959.0 | HALB | 99 | True | False | True | False | FALSE | False | 2.07 | HALB | F-S04-STA | True | True | False | True | False | False | False | 99 | 25 | 509 | 88 | capable roller highly narrow further successful and boom times cages steam patent drawbar loading Drill string | 4-7-2012 |
9992 | M7111114166 | ZW1S0 | 1 | 7959.0 | HALB | 99 | True | False | True | False | FALSE | False | 0.10 | HALB | O-S04-ST0 | True | True | False | False | False | False | False | 99 | 25 | 71 | 21 | liter unit requirement eliminates storage comlff lpm predetermined tricone expanded cover when intuitive transmitting recovery fully tells movement longyears pilot Blowout preventer (BOP) Annular type | 9-24-2012 |
9993 | M7111114166 | ZW1S0 | 1 | 7959.0 | HALB | 99 | True | False | True | False | FALSE | False | 0.00 | HALB | O-S04-PO0 | True | True | False | False | False | False | False | 99 | 25 | 72 | 21 | ultramatrix breakdown comlx avoid ability jet handler telescopic work dci Drill bit | 7-19-2012 |
9994 | M7111114166 | ZW1S0 | 0 | 7959.0 | HALB | 99 | True | False | True | False | FALSE | False | 1.83 | HALB | F-S04-STA | True | True | False | True | False | False | False | 99 | 25 | 509 | 88 | nos zone cages articulate automated hyd over another stringent mainly rigs list simple excellence | 9-28-2012 |
9995 | M7111154512 | EWHG | 0 | 4590.0 | HALB | 99 | True | False | True | False | FALSE | True | 0.63 | HALB | O-C06-000 | True | True | False | True | False | False | False | 109 | 25 | 236 | 48 | increases chemical versatility regions using versatililty cab Motor or power source | 1-10-2012 |
9996 | M7111154512 | EWHG | 0 | 4590.0 | HALB | 99 | True | False | True | False | FALSE | True | 0.71 | HALB | O-F02-000 | True | True | False | True | False | False | True | 109 | 25 | 8 | 2 | smallformat mmsec quickconnect feature hopper soft stopemaster reduce capable hour Blowout preventer (BOP) Annular type | 1-28-2012 |
9997 | NaN | ZW1S0 | 1 | 58500.0 | HALB | 99 | False | False | True | False | ? | False | 0.01 | HALB | O-F03-000 | True | True | False | False | False | False | False | 166 | 33 | 496 | 86 | comstopemate job actual lesko any jet countries when ability rigs both jets download midsized Standpipe | 8-7-2012 |
9998 | NaN | EWHG | 1 | 19501.0 | HALB | O-C04-000 | False | False | True | False | FALSE | False | 0.10 | HALB | O-F02-000 | True | True | False | False | False | False | False | 105 | 25 | 2 | 1 | circuits stopemaster towing rotation featuring move loading operator lengths professional general combined remote terrain back ring Traveling block | 5-26-2012 |
9999 | NaN | EWHG | 0 | 0.0 | HALB | A-S22-TRW | True | False | True | False | FALSE | False | 1.00 | HALB | O-C08-000 | True | True | False | False | False | False | False | 1 | 1 | 442 | 76 | wwwboartlongyearcomls times environmental applications minimum userfriendly western bar thrust design Traveling block | 6-16-2012 |