Dataset statistics
Number of variables | 46 |
---|---|
Number of observations | 98053 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 34.4 MiB |
Average record size in memory | 368.0 B |
Variable types
Numeric | 12 |
---|---|
Categorical | 30 |
Boolean | 4 |
examide has constant value "False" | Constant |
citoglipton has constant value "False" | Constant |
metformin-rosiglitazone has constant value "False" | Constant |
diag_1 has a high cardinality: 713 distinct values | High cardinality |
diag_2 has a high cardinality: 740 distinct values | High cardinality |
diag_3 has a high cardinality: 786 distinct values | High cardinality |
tolbutamide is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
insulin is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
A1Cresult is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
diabetesMed is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
max_glu_serum is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
glyburide is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
metformin is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
metformin-pioglitazone is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
metformin-rosiglitazone is highly correlated with tolbutamide and 29 other fields | High correlation |
race is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
miglitol is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
gender is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
chlorpropamide is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
repaglinide is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
examide is highly correlated with tolbutamide and 29 other fields | High correlation |
acetohexamide is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
age is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
troglitazone is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
change is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
glipizide is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
readmitted is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
acarbose is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
glimepiride-pioglitazone is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
pioglitazone is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
glimepiride is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
citoglipton is highly correlated with tolbutamide and 29 other fields | High correlation |
glipizide-metformin is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
glyburide-metformin is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
tolazamide is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
rosiglitazone is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
nateglinide is highly correlated with metformin-rosiglitazone and 2 other fields | High correlation |
number_emergency is highly skewed (γ1 = 22.71034016) | Skewed |
df_index has unique values | Unique |
num_procedures has 44574 (45.5%) zeros | Zeros |
number_outpatient has 81680 (83.3%) zeros | Zeros |
number_emergency has 86846 (88.6%) zeros | Zeros |
number_inpatient has 64634 (65.9%) zeros | Zeros |
Reproduction
Analysis started | 2021-05-05 21:22:20.387978 |
---|---|
Analysis finished | 2021-05-05 21:23:16.876767 |
Duration | 56.49 seconds |
Software version | pandas-profiling v2.11.0 |
Download configuration | config.yaml |
Distinct | 98053 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 51115.56242 |
---|---|
Minimum | 1 |
Maximum | 101765 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 766.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5180.6 |
Q1 | 25575 |
median | 51369 |
Q3 | 76379 |
95-th percentile | 96683.4 |
Maximum | 101765 |
Range | 101764 |
Interquartile range (IQR) | 50804 |
Descriptive statistics
Standard deviation | 29307.25248 |
---|---|
Coefficient of variation (CV) | 0.573352832 |
Kurtosis | -1.191414224 |
Mean | 51115.56242 |
Median Absolute Deviation (MAD) | 25399 |
Skewness | -0.01478077774 |
Sum | 5012034242 |
Variance | 858915047.7 |
Monotocity | Strictly increasing |
Value | Count | Frequency (%) |
2047 | 1 | < 0.1% |
80562 | 1 | < 0.1% |
29339 | 1 | < 0.1% |
19100 | 1 | < 0.1% |
17053 | 1 | < 0.1% |
23198 | 1 | < 0.1% |
21151 | 1 | < 0.1% |
101028 | 1 | < 0.1% |
98981 | 1 | < 0.1% |
76464 | 1 | < 0.1% |
Other values (98043) | 98043 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 |
Value | Count | Frequency (%) |
101765 | 1 | |
101764 | 1 | |
101763 | 1 | |
101762 | 1 | |
101761 | 1 |
Distinct | 5 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 766.2 KiB |
Caucasian | |
---|---|
AfricanAmerican | |
Hispanic | 1984 |
Other | 1484 |
Asian | 625 |
Length
Max length | 15 |
---|---|
Median length | 9 |
Mean length | 10.0490857 |
Min length | 5 |
Characters and Unicode
Total characters | 985343 |
---|---|
Distinct characters | 17 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Caucasian |
---|---|
2nd row | AfricanAmerican |
3rd row | Caucasian |
4th row | Caucasian |
5th row | Caucasian |
Value | Count | Frequency (%) |
Caucasian | 75079 | |
AfricanAmerican | 18881 | 19.3% |
Hispanic | 1984 | 2.0% |
Other | 1484 | 1.5% |
Asian | 625 | 0.6% |
Value | Count | Frequency (%) |
caucasian | 75079 | |
africanamerican | 18881 | 19.3% |
hispanic | 1984 | 2.0% |
other | 1484 | 1.5% |
asian | 625 | 0.6% |
Most occurring characters
Value | Count | Frequency (%) |
a | 265608 | |
i | 117434 | |
n | 115450 | |
c | 114825 | |
s | 77688 | 7.9% |
C | 75079 | 7.6% |
u | 75079 | 7.6% |
r | 39246 | 4.0% |
A | 38387 | 3.9% |
e | 20365 | 2.1% |
Other values (7) | 46182 | 4.7% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 868409 | |
Uppercase Letter | 116934 | 11.9% |
Most frequent character per category
Value | Count | Frequency (%) |
a | 265608 | |
i | 117434 | |
n | 115450 | |
c | 114825 | |
s | 77688 | 8.9% |
u | 75079 | 8.6% |
r | 39246 | 4.5% |
e | 20365 | 2.3% |
f | 18881 | 2.2% |
m | 18881 | 2.2% |
Other values (3) | 4952 | 0.6% |
Value | Count | Frequency (%) |
C | 75079 | |
A | 38387 | |
H | 1984 | 1.7% |
O | 1484 | 1.3% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 985343 |
Most frequent character per script
Value | Count | Frequency (%) |
a | 265608 | |
i | 117434 | |
n | 115450 | |
c | 114825 | |
s | 77688 | 7.9% |
C | 75079 | 7.6% |
u | 75079 | 7.6% |
r | 39246 | 4.0% |
A | 38387 | 3.9% |
e | 20365 | 2.1% |
Other values (7) | 46182 | 4.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 985343 |
Most frequent character per block
Value | Count | Frequency (%) |
a | 265608 | |
i | 117434 | |
n | 115450 | |
c | 114825 | |
s | 77688 | 7.9% |
C | 75079 | 7.6% |
u | 75079 | 7.6% |
r | 39246 | 4.0% |
A | 38387 | 3.9% |
e | 20365 | 2.1% |
Other values (7) | 46182 | 4.7% |
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 766.2 KiB |
Female | |
---|---|
Male | |
Unknown/Invalid | 1 |
Length
Max length | 15 |
---|---|
Median length | 6 |
Mean length | 5.077753868 |
Min length | 4 |
Characters and Unicode
Total characters | 497889 |
---|---|
Distinct characters | 16 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Female |
---|---|
2nd row | Female |
3rd row | Male |
4th row | Male |
5th row | Male |
Value | Count | Frequency (%) |
Female | 52833 | |
Male | 45219 | |
Unknown/Invalid | 1 | < 0.1% |
Value | Count | Frequency (%) |
female | 52833 | |
male | 45219 | |
unknown/invalid | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
e | 150885 | |
a | 98053 | |
l | 98053 | |
F | 52833 | 10.6% |
m | 52833 | 10.6% |
M | 45219 | 9.1% |
n | 4 | < 0.1% |
U | 1 | < 0.1% |
k | 1 | < 0.1% |
o | 1 | < 0.1% |
Other values (6) | 6 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 399834 | |
Uppercase Letter | 98054 | 19.7% |
Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Value | Count | Frequency (%) |
e | 150885 | |
a | 98053 | |
l | 98053 | |
m | 52833 | 13.2% |
n | 4 | < 0.1% |
k | 1 | < 0.1% |
o | 1 | < 0.1% |
w | 1 | < 0.1% |
v | 1 | < 0.1% |
i | 1 | < 0.1% |
Value | Count | Frequency (%) |
F | 52833 | |
M | 45219 | |
U | 1 | < 0.1% |
I | 1 | < 0.1% |
Value | Count | Frequency (%) |
/ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 497888 | |
Common | 1 | < 0.1% |
Most frequent character per script
Value | Count | Frequency (%) |
e | 150885 | |
a | 98053 | |
l | 98053 | |
F | 52833 | 10.6% |
m | 52833 | 10.6% |
M | 45219 | 9.1% |
n | 4 | < 0.1% |
U | 1 | < 0.1% |
k | 1 | < 0.1% |
o | 1 | < 0.1% |
Other values (5) | 5 | < 0.1% |
Value | Count | Frequency (%) |
/ | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 497889 |
Most frequent character per block
Value | Count | Frequency (%) |
e | 150885 | |
a | 98053 | |
l | 98053 | |
F | 52833 | 10.6% |
m | 52833 | 10.6% |
M | 45219 | 9.1% |
n | 4 | < 0.1% |
U | 1 | < 0.1% |
k | 1 | < 0.1% |
o | 1 | < 0.1% |
Other values (6) | 6 | < 0.1% |
Distinct | 10 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 766.2 KiB |
[70-80) | |
---|---|
[60-70) | |
[80-90) | |
[50-60) | |
[40-50) | |
Other values (5) |
Length
Max length | 8 |
---|---|
Median length | 7 |
Mean length | 7.027046597 |
Min length | 6 |
Characters and Unicode
Total characters | 689023 |
---|---|
Distinct characters | 13 |
Distinct categories | 4 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | [10-20) |
---|---|
2nd row | [20-30) |
3rd row | [30-40) |
4th row | [40-50) |
5th row | [50-60) |
Value | Count | Frequency (%) |
[70-80) | 25306 | |
[60-70) | 21809 | |
[80-90) | 16702 | |
[50-60) | 16697 | |
[40-50) | 9265 | 9.4% |
[30-40) | 3548 | 3.6% |
[90-100) | 2717 | 2.8% |
[20-30) | 1478 | 1.5% |
[10-20) | 466 | 0.5% |
[0-10) | 65 | 0.1% |
Value | Count | Frequency (%) |
70-80 | 25306 | |
60-70 | 21809 | |
80-90 | 16702 | |
50-60 | 16697 | |
40-50 | 9265 | 9.4% |
30-40 | 3548 | 3.6% |
90-100 | 2717 | 2.8% |
20-30 | 1478 | 1.5% |
10-20 | 466 | 0.5% |
0-10 | 65 | 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 198823 | |
[ | 98053 | |
- | 98053 | |
) | 98053 | |
7 | 47115 | 6.8% |
8 | 42008 | 6.1% |
6 | 38506 | 5.6% |
5 | 25962 | 3.8% |
9 | 19419 | 2.8% |
4 | 12813 | 1.9% |
Other values (3) | 10218 | 1.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 394864 | |
Open Punctuation | 98053 | 14.2% |
Dash Punctuation | 98053 | 14.2% |
Close Punctuation | 98053 | 14.2% |
Most frequent character per category
Value | Count | Frequency (%) |
0 | 198823 | |
7 | 47115 | 11.9% |
8 | 42008 | 10.6% |
6 | 38506 | 9.8% |
5 | 25962 | 6.6% |
9 | 19419 | 4.9% |
4 | 12813 | 3.2% |
3 | 5026 | 1.3% |
1 | 3248 | 0.8% |
2 | 1944 | 0.5% |
Value | Count | Frequency (%) |
[ | 98053 |
Value | Count | Frequency (%) |
- | 98053 |
Value | Count | Frequency (%) |
) | 98053 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 689023 |
Most frequent character per script
Value | Count | Frequency (%) |
0 | 198823 | |
[ | 98053 | |
- | 98053 | |
) | 98053 | |
7 | 47115 | 6.8% |
8 | 42008 | 6.1% |
6 | 38506 | 5.6% |
5 | 25962 | 3.8% |
9 | 19419 | 2.8% |
4 | 12813 | 1.9% |
Other values (3) | 10218 | 1.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 689023 |
Most frequent character per block
Value | Count | Frequency (%) |
0 | 198823 | |
[ | 98053 | |
- | 98053 | |
) | 98053 | |
7 | 47115 | 6.8% |
8 | 42008 | 6.1% |
6 | 38506 | 5.6% |
5 | 25962 | 3.8% |
9 | 19419 | 2.8% |
4 | 12813 | 1.9% |
Other values (3) | 10218 | 1.5% |
admission_type_id
Real number (ℝ≥0)
Distinct | 8 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.025812571 |
---|---|
Minimum | 1 |
Maximum | 8 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 766.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 3 |
95-th percentile | 6 |
Maximum | 8 |
Range | 7 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.450117239 |
---|---|
Coefficient of variation (CV) | 0.7158200418 |
Kurtosis | 1.912034312 |
Mean | 2.025812571 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.587030686 |
Sum | 198637 |
Variance | 2.102840007 |
Monotocity | Not monotonic |
Value | Count | Frequency (%) |
1 | 52178 | |
3 | 18194 | 18.6% |
2 | 17543 | 17.9% |
6 | 5135 | 5.2% |
5 | 4661 | 4.8% |
8 | 312 | 0.3% |
7 | 20 | < 0.1% |
4 | 10 | < 0.1% |
Value | Count | Frequency (%) |
1 | 52178 | |
2 | 17543 | 17.9% |
3 | 18194 | 18.6% |
4 | 10 | < 0.1% |
5 | 4661 | 4.8% |
Value | Count | Frequency (%) |
8 | 312 | 0.3% |
7 | 20 | < 0.1% |
6 | 5135 | |
5 | 4661 | |
4 | 10 | < 0.1% |
discharge_disposition_id
Real number (ℝ≥0)
Distinct | 26 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.753368076 |
---|---|
Minimum | 1 |
Maximum | 28 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 766.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 4 |
95-th percentile | 18 |
Maximum | 28 |
Range | 27 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 5.309391793 |
---|---|
Coefficient of variation (CV) | 1.414567313 |
Kurtosis | 5.838277851 |
Mean | 3.753368076 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.534770768 |
Sum | 368029 |
Variance | 28.18964121 |
Monotocity | Not monotonic |
Value | Count | Frequency (%) |
1 | 57610 | |
3 | 13564 | 13.8% |
6 | 12626 | 12.9% |
18 | 3624 | 3.7% |
2 | 2049 | 2.1% |
22 | 1970 | 2.0% |
11 | 1606 | 1.6% |
5 | 1127 | 1.1% |
25 | 941 | 1.0% |
4 | 756 | 0.8% |
Other values (16) | 2180 | 2.2% |
Value | Count | Frequency (%) |
1 | 57610 | |
2 | 2049 | 2.1% |
3 | 13564 | 13.8% |
4 | 756 | 0.8% |
5 | 1127 | 1.1% |
Value | Count | Frequency (%) |
28 | 137 | 0.1% |
27 | 5 | < 0.1% |
25 | 941 | |
24 | 48 | < 0.1% |
23 | 400 |
admission_source_id
Real number (ℝ≥0)
Distinct | 17 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.776692197 |
---|---|
Minimum | 1 |
Maximum | 25 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 766.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 7 |
Q3 | 7 |
95-th percentile | 17 |
Maximum | 25 |
Range | 24 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 4.071639573 |
---|---|
Coefficient of variation (CV) | 0.7048392807 |
Kurtosis | 1.733779297 |
Mean | 5.776692197 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.027278586 |
Sum | 566422 |
Variance | 16.57824881 |
Monotocity | Not monotonic |
Value | Count | Frequency (%) |
7 | 55951 | |
1 | 28356 | |
17 | 6602 | 6.7% |
4 | 2945 | 3.0% |
6 | 1893 | 1.9% |
2 | 1031 | 1.1% |
5 | 846 | 0.9% |
3 | 179 | 0.2% |
20 | 160 | 0.2% |
9 | 49 | < 0.1% |
Other values (7) | 41 | < 0.1% |
Value | Count | Frequency (%) |
1 | 28356 | |
2 | 1031 | 1.1% |
3 | 179 | 0.2% |
4 | 2945 | 3.0% |
5 | 846 | 0.9% |
Value | Count | Frequency (%) |
25 | 2 | < 0.1% |
22 | 12 | < 0.1% |
20 | 160 | 0.2% |
17 | 6602 | |
14 | 2 | < 0.1% |
time_in_hospital
Real number (ℝ≥0)
Distinct | 14 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.42197587 |
---|---|
Minimum | 1 |
Maximum | 14 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 766.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 4 |
Q3 | 6 |
95-th percentile | 11 |
Maximum | 14 |
Range | 13 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.993074463 |
---|---|
Coefficient of variation (CV) | 0.6768635902 |
Kurtosis | 0.817949426 |
Mean | 4.42197587 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 1.123569649 |
Sum | 433588 |
Variance | 8.958494742 |
Monotocity | Not monotonic |
Value | Count | Frequency (%) |
3 | 17049 | |
2 | 16441 | |
1 | 13490 | |
4 | 13434 | |
5 | 9699 | |
6 | 7320 | |
7 | 5694 | 5.8% |
8 | 4276 | 4.4% |
9 | 2928 | 3.0% |
10 | 2287 | 2.3% |
Other values (4) | 5435 | 5.5% |
Value | Count | Frequency (%) |
1 | 13490 | |
2 | 16441 | |
3 | 17049 | |
4 | 13434 | |
5 | 9699 |
Value | Count | Frequency (%) |
14 | 1017 | |
13 | 1185 | |
12 | 1424 | |
11 | 1809 | |
10 | 2287 |
num_lab_procedures
Real number (ℝ≥0)
Distinct | 118 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 43.14807298 |
---|---|
Minimum | 1 |
Maximum | 132 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 766.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 4 |
Q1 | 31 |
median | 44 |
Q3 | 57 |
95-th percentile | 73 |
Maximum | 132 |
Range | 131 |
Interquartile range (IQR) | 26 |
Descriptive statistics
Standard deviation | 19.71203294 |
---|---|
Coefficient of variation (CV) | 0.4568461945 |
Kurtosis | -0.2451976515 |
Mean | 43.14807298 |
Median Absolute Deviation (MAD) | 13 |
Skewness | -0.2355346172 |
Sum | 4230798 |
Variance | 388.5642427 |
Monotocity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3096 | 3.2% |
43 | 2724 | 2.8% |
44 | 2414 | 2.5% |
45 | 2306 | 2.4% |
38 | 2131 | 2.2% |
46 | 2120 | 2.2% |
40 | 2113 | 2.2% |
41 | 2046 | 2.1% |
42 | 2031 | 2.1% |
47 | 2028 | 2.1% |
Other values (108) | 75044 |
Value | Count | Frequency (%) |
1 | 3096 | |
2 | 1062 | 1.1% |
3 | 647 | 0.7% |
4 | 364 | 0.4% |
5 | 277 | 0.3% |
Value | Count | Frequency (%) |
132 | 1 | |
129 | 1 | |
126 | 1 | |
121 | 1 | |
120 | 1 |
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.350749085 |
---|---|
Minimum | 0 |
Maximum | 6 |
Zeros | 44574 |
Zeros (%) | 45.5% |
Memory size | 766.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 2 |
95-th percentile | 5 |
Maximum | 6 |
Range | 6 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.708505881 |
---|---|
Coefficient of variation (CV) | 1.264858071 |
Kurtosis | 0.8236555027 |
Mean | 1.350749085 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.303916989 |
Sum | 132445 |
Variance | 2.918992346 |
Monotocity | Not monotonic |
Value | Count | Frequency (%) |
0 | 44574 | |
1 | 20029 | |
2 | 12383 | 12.6% |
3 | 9210 | 9.4% |
6 | 4811 | 4.9% |
4 | 4076 | 4.2% |
5 | 2970 | 3.0% |
Value | Count | Frequency (%) |
0 | 44574 | |
1 | 20029 | |
2 | 12383 | 12.6% |
3 | 9210 | 9.4% |
4 | 4076 | 4.2% |
Value | Count | Frequency (%) |
6 | 4811 | 4.9% |
5 | 2970 | 3.0% |
4 | 4076 | 4.2% |
3 | 9210 | |
2 | 12383 |
num_medications
Real number (ℝ≥0)
Distinct | 75 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.11964958 |
---|---|
Minimum | 1 |
Maximum | 81 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 766.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 6 |
Q1 | 11 |
median | 15 |
Q3 | 20 |
95-th percentile | 31 |
Maximum | 81 |
Range | 80 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 8.108475918 |
---|---|
Coefficient of variation (CV) | 0.5030181257 |
Kurtosis | 3.493505174 |
Mean | 16.11964958 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 1.332695065 |
Sum | 1580580 |
Variance | 65.74738171 |
Monotocity | Not monotonic |
Value | Count | Frequency (%) |
13 | 5885 | 6.0% |
12 | 5816 | 5.9% |
15 | 5621 | 5.7% |
11 | 5592 | 5.7% |
14 | 5520 | 5.6% |
16 | 5271 | 5.4% |
10 | 5167 | 5.3% |
17 | 4783 | 4.9% |
9 | 4711 | 4.8% |
18 | 4399 | 4.5% |
Other values (65) | 45288 |
Value | Count | Frequency (%) |
1 | 236 | 0.2% |
2 | 397 | 0.4% |
3 | 785 | |
4 | 1269 | |
5 | 1835 |
Value | Count | Frequency (%) |
81 | 1 | < 0.1% |
79 | 1 | < 0.1% |
75 | 2 | |
74 | 1 | < 0.1% |
72 | 3 |
Distinct | 39 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.3763780812 |
---|---|
Minimum | 0 |
Maximum | 42 |
Zeros | 81680 |
Zeros (%) | 83.3% |
Memory size | 766.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 2 |
Maximum | 42 |
Range | 42 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1.28335944 |
---|---|
Coefficient of variation (CV) | 3.409761365 |
Kurtosis | 145.5912818 |
Mean | 0.3763780812 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 8.78170539 |
Sum | 36905 |
Variance | 1.647011452 |
Monotocity | Not monotonic |
Value | Count | Frequency (%) |
0 | 81680 | |
1 | 8340 | 8.5% |
2 | 3514 | 3.6% |
3 | 2005 | 2.0% |
4 | 1078 | 1.1% |
5 | 521 | 0.5% |
6 | 297 | 0.3% |
7 | 153 | 0.2% |
8 | 98 | 0.1% |
9 | 83 | 0.1% |
Other values (29) | 284 | 0.3% |
Value | Count | Frequency (%) |
0 | 81680 | |
1 | 8340 | 8.5% |
2 | 3514 | 3.6% |
3 | 2005 | 2.0% |
4 | 1078 | 1.1% |
Value | Count | Frequency (%) |
42 | 1 | |
40 | 1 | |
39 | 1 | |
38 | 1 | |
37 | 1 |
Distinct | 33 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.2024619339 |
---|---|
Minimum | 0 |
Maximum | 76 |
Zeros | 86846 |
Zeros (%) | 88.6% |
Memory size | 766.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 1 |
Maximum | 76 |
Range | 76 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.94289229 |
---|---|
Coefficient of variation (CV) | 4.657133675 |
Kurtosis | 1171.637565 |
Mean | 0.2024619339 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 22.71034016 |
Sum | 19852 |
Variance | 0.8890458704 |
Monotocity | Not monotonic |
Value | Count | Frequency (%) |
0 | 86846 | |
1 | 7550 | 7.7% |
2 | 2011 | 2.1% |
3 | 716 | 0.7% |
4 | 372 | 0.4% |
5 | 190 | 0.2% |
6 | 93 | 0.1% |
7 | 72 | 0.1% |
8 | 50 | 0.1% |
10 | 34 | < 0.1% |
Other values (23) | 119 | 0.1% |
Value | Count | Frequency (%) |
0 | 86846 | |
1 | 7550 | 7.7% |
2 | 2011 | 2.1% |
3 | 716 | 0.7% |
4 | 372 | 0.4% |
Value | Count | Frequency (%) |
76 | 1 | |
64 | 1 | |
63 | 1 | |
54 | 1 | |
46 | 1 |
Distinct | 20 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.6468644509 |
---|---|
Minimum | 0 |
Maximum | 21 |
Zeros | 64634 |
Zeros (%) | 65.9% |
Memory size | 766.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 3 |
Maximum | 21 |
Range | 21 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.271020492 |
---|---|
Coefficient of variation (CV) | 1.964894639 |
Kurtosis | 19.94332538 |
Mean | 0.6468644509 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.554829592 |
Sum | 63427 |
Variance | 1.61549309 |
Monotocity | Not monotonic |
Value | Count | Frequency (%) |
0 | 64634 | |
1 | 19067 | 19.4% |
2 | 7421 | 7.6% |
3 | 3346 | 3.4% |
4 | 1597 | 1.6% |
5 | 802 | 0.8% |
6 | 474 | 0.5% |
7 | 266 | 0.3% |
8 | 147 | 0.1% |
9 | 111 | 0.1% |
Other values (10) | 188 | 0.2% |
Value | Count | Frequency (%) |
0 | 64634 | |
1 | 19067 | 19.4% |
2 | 7421 | 7.6% |
3 | 3346 | 3.4% |
4 | 1597 | 1.6% |
Value | Count | Frequency (%) |
21 | 1 | < 0.1% |
19 | 2 | < 0.1% |
18 | 1 | < 0.1% |
16 | 5 | |
15 | 8 |
Distinct | 713 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 766.2 KiB |
428 | 6730 |
---|---|
414 | 6374 |
786 | 3900 |
410 | 3514 |
486 | 3412 |
Other values (708) |
Length
Max length | 6 |
---|---|
Median length | 3 |
Mean length | 3.162167399 |
Min length | 1 |
Characters and Unicode
Total characters | 310060 |
---|---|
Distinct characters | 13 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 87 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 276 |
---|---|
2nd row | 648 |
3rd row | 8 |
4th row | 197 |
5th row | 414 |
Value | Count | Frequency (%) |
428 | 6730 | 6.9% |
414 | 6374 | 6.5% |
786 | 3900 | 4.0% |
410 | 3514 | 3.6% |
486 | 3412 | 3.5% |
427 | 2701 | 2.8% |
491 | 2210 | 2.3% |
715 | 2073 | 2.1% |
434 | 1983 | 2.0% |
780 | 1976 | 2.0% |
Other values (703) | 63180 |
Value | Count | Frequency (%) |
428 | 6730 | 6.9% |
414 | 6374 | 6.5% |
786 | 3900 | 4.0% |
410 | 3514 | 3.6% |
486 | 3412 | 3.5% |
427 | 2701 | 2.8% |
491 | 2210 | 2.3% |
715 | 2073 | 2.1% |
434 | 1983 | 2.0% |
780 | 1976 | 2.0% |
Other values (703) | 63180 |
Most occurring characters
Value | Count | Frequency (%) |
4 | 53841 | |
2 | 37924 | |
8 | 36767 | |
5 | 35509 | |
7 | 27739 | |
1 | 26791 | |
0 | 23459 | |
6 | 22453 | |
9 | 19352 | 6.2% |
3 | 16850 | 5.4% |
Other values (3) | 9375 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 300685 | |
Other Punctuation | 7774 | 2.5% |
Uppercase Letter | 1601 | 0.5% |
Most frequent character per category
Value | Count | Frequency (%) |
4 | 53841 | |
2 | 37924 | |
8 | 36767 | |
5 | 35509 | |
7 | 27739 | |
1 | 26791 | |
0 | 23459 | |
6 | 22453 | |
9 | 19352 | 6.4% |
3 | 16850 | 5.6% |
Value | Count | Frequency (%) |
V | 1600 | |
E | 1 | 0.1% |
Value | Count | Frequency (%) |
. | 7774 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 308459 | |
Latin | 1601 | 0.5% |
Most frequent character per script
Value | Count | Frequency (%) |
4 | 53841 | |
2 | 37924 | |
8 | 36767 | |
5 | 35509 | |
7 | 27739 | |
1 | 26791 | |
0 | 23459 | |
6 | 22453 | |
9 | 19352 | 6.3% |
3 | 16850 | 5.5% |
Value | Count | Frequency (%) |
V | 1600 | |
E | 1 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 310060 |
Most frequent character per block
Value | Count | Frequency (%) |
4 | 53841 | |
2 | 37924 | |
8 | 36767 | |
5 | 35509 | |
7 | 27739 | |
1 | 26791 | |
0 | 23459 | |
6 | 22453 | |
9 | 19352 | 6.2% |
3 | 16850 | 5.4% |
Other values (3) | 9375 | 3.0% |
Distinct | 740 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 766.2 KiB |
428 | 6517 |
---|---|
276 | 6513 |
250 | 5412 |
427 | 4919 |
401 | 3613 |
Other values (735) |
Length
Max length | 6 |
---|---|
Median length | 3 |
Mean length | 3.172274178 |
Min length | 1 |
Characters and Unicode
Total characters | 311051 |
---|---|
Distinct characters | 13 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 120 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 250.01 |
---|---|
2nd row | 250 |
3rd row | 250.43 |
4th row | 157 |
5th row | 411 |
Value | Count | Frequency (%) |
428 | 6517 | 6.6% |
276 | 6513 | 6.6% |
250 | 5412 | 5.5% |
427 | 4919 | 5.0% |
401 | 3613 | 3.7% |
496 | 3233 | 3.3% |
599 | 3225 | 3.3% |
403 | 2781 | 2.8% |
414 | 2574 | 2.6% |
411 | 2496 | 2.5% |
Other values (730) | 56770 |
Value | Count | Frequency (%) |
428 | 6517 | 6.6% |
276 | 6513 | 6.6% |
250 | 5412 | 5.5% |
427 | 4919 | 5.0% |
401 | 3613 | 3.7% |
496 | 3233 | 3.3% |
599 | 3225 | 3.3% |
403 | 2781 | 2.8% |
414 | 2574 | 2.6% |
411 | 2496 | 2.5% |
Other values (730) | 56770 |
Most occurring characters
Value | Count | Frequency (%) |
4 | 49919 | |
2 | 47802 | |
5 | 36582 | |
0 | 32414 | |
8 | 27942 | |
7 | 27749 | |
1 | 25358 | |
9 | 21289 | |
6 | 19412 | 6.2% |
3 | 13683 | 4.4% |
Other values (3) | 8901 | 2.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 302150 | |
Other Punctuation | 6450 | 2.1% |
Uppercase Letter | 2451 | 0.8% |
Most frequent character per category
Value | Count | Frequency (%) |
4 | 49919 | |
2 | 47802 | |
5 | 36582 | |
0 | 32414 | |
8 | 27942 | |
7 | 27749 | |
1 | 25358 | |
9 | 21289 | |
6 | 19412 | 6.4% |
3 | 13683 | 4.5% |
Value | Count | Frequency (%) |
V | 1735 | |
E | 716 |
Value | Count | Frequency (%) |
. | 6450 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 308600 | |
Latin | 2451 | 0.8% |
Most frequent character per script
Value | Count | Frequency (%) |
4 | 49919 | |
2 | 47802 | |
5 | 36582 | |
0 | 32414 | |
8 | 27942 | |
7 | 27749 | |
1 | 25358 | |
9 | 21289 | |
6 | 19412 | 6.3% |
3 | 13683 | 4.4% |
Value | Count | Frequency (%) |
V | 1735 | |
E | 716 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 311051 |
Most frequent character per block
Value | Count | Frequency (%) |
4 | 49919 | |
2 | 47802 | |
5 | 36582 | |
0 | 32414 | |
8 | 27942 | |
7 | 27749 | |
1 | 25358 | |
9 | 21289 | |
6 | 19412 | 6.2% |
3 | 13683 | 4.4% |
Other values (3) | 8901 | 2.9% |
Distinct | 786 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 766.2 KiB |
250 | |
---|---|
401 | |
276 | 5097 |
428 | 4491 |
427 | 3865 |
Other values (781) |
Length
Max length | 6 |
---|---|
Median length | 3 |
Mean length | 3.142188408 |
Min length | 1 |
Characters and Unicode
Total characters | 308101 |
---|---|
Distinct characters | 13 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 126 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 255 |
---|---|
2nd row | V27 |
3rd row | 403 |
4th row | 250 |
5th row | 250 |
Value | Count | Frequency (%) |
250 | 11208 | 11.4% |
401 | 8090 | 8.3% |
276 | 5097 | 5.2% |
428 | 4491 | 4.6% |
427 | 3865 | 3.9% |
414 | 3567 | 3.6% |
496 | 2552 | 2.6% |
403 | 2322 | 2.4% |
585 | 1949 | 2.0% |
272 | 1910 | 1.9% |
Other values (776) | 53002 |
Value | Count | Frequency (%) |
250 | 11208 | 11.4% |
401 | 8090 | 8.3% |
276 | 5097 | 5.2% |
428 | 4491 | 4.6% |
427 | 3865 | 3.9% |
414 | 3567 | 3.6% |
496 | 2552 | 2.6% |
403 | 2322 | 2.4% |
585 | 1949 | 2.0% |
272 | 1910 | 1.9% |
Other values (776) | 53002 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 50082 | |
4 | 48157 | |
5 | 40276 | |
0 | 38773 | |
7 | 25936 | |
1 | 24108 | |
8 | 23281 | |
9 | 16938 | 5.5% |
6 | 16112 | 5.2% |
3 | 13976 | 4.5% |
Other values (3) | 10462 | 3.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 297639 | |
Other Punctuation | 5488 | 1.8% |
Uppercase Letter | 4974 | 1.6% |
Most frequent character per category
Value | Count | Frequency (%) |
2 | 50082 | |
4 | 48157 | |
5 | 40276 | |
0 | 38773 | |
7 | 25936 | |
1 | 24108 | |
8 | 23281 | |
9 | 16938 | 5.7% |
6 | 16112 | 5.4% |
3 | 13976 | 4.7% |
Value | Count | Frequency (%) |
V | 3757 | |
E | 1217 | 24.5% |
Value | Count | Frequency (%) |
. | 5488 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 303127 | |
Latin | 4974 | 1.6% |
Most frequent character per script
Value | Count | Frequency (%) |
2 | 50082 | |
4 | 48157 | |
5 | 40276 | |
0 | 38773 | |
7 | 25936 | |
1 | 24108 | |
8 | 23281 | |
9 | 16938 | 5.6% |
6 | 16112 | 5.3% |
3 | 13976 | 4.6% |
Value | Count | Frequency (%) |
V | 3757 | |
E | 1217 | 24.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 308101 |
Most frequent character per block
Value | Count | Frequency (%) |
2 | 50082 | |
4 | 48157 | |
5 | 40276 | |
0 | 38773 | |
7 | 25936 | |
1 | 24108 | |
8 | 23281 | |
9 | 16938 | 5.5% |
6 | 16112 | 5.2% |
3 | 13976 | 4.5% |
Other values (3) | 10462 | 3.4% |
number_diagnoses
Real number (ℝ≥0)
Distinct | 14 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.512059804 |
---|---|
Minimum | 3 |
Maximum | 16 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 766.2 KiB |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 4 |
Q1 | 6 |
median | 8 |
Q3 | 9 |
95-th percentile | 9 |
Maximum | 16 |
Range | 13 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 1.832496822 |
---|---|
Coefficient of variation (CV) | 0.2439406593 |
Kurtosis | -0.3451201144 |
Mean | 7.512059804 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.8175023377 |
Sum | 736580 |
Variance | 3.358044602 |
Monotocity | Not monotonic |
Value | Count | Frequency (%) |
9 | 48687 | |
5 | 10592 | 10.8% |
8 | 10388 | 10.6% |
7 | 10179 | 10.4% |
6 | 9988 | 10.2% |
4 | 5361 | 5.5% |
3 | 2751 | 2.8% |
16 | 40 | < 0.1% |
13 | 16 | < 0.1% |
10 | 16 | < 0.1% |
Other values (4) | 35 | < 0.1% |
Value | Count | Frequency (%) |
3 | 2751 | 2.8% |
4 | 5361 | |
5 | 10592 | |
6 | 9988 | |
7 | 10179 |
Value | Count | Frequency (%) |
16 | 40 | |
15 | 8 | < 0.1% |
14 | 7 | < 0.1% |
13 | 16 | < 0.1% |
12 | 9 | < 0.1% |
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 766.2 KiB |
None | |
---|---|
Norm | 2532 |
>200 | 1449 |
>300 | 1227 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Characters and Unicode
Total characters | 392212 |
---|---|
Distinct characters | 10 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | None |
---|---|
2nd row | None |
3rd row | None |
4th row | None |
5th row | None |
Value | Count | Frequency (%) |
None | 92845 | |
Norm | 2532 | 2.6% |
>200 | 1449 | 1.5% |
>300 | 1227 | 1.3% |
Value | Count | Frequency (%) |
none | 92845 | |
norm | 2532 | 2.6% |
200 | 1449 | 1.5% |
300 | 1227 | 1.3% |
Most occurring characters
Value | Count | Frequency (%) |
N | 95377 | |
o | 95377 | |
n | 92845 | |
e | 92845 | |
0 | 5352 | 1.4% |
> | 2676 | 0.7% |
r | 2532 | 0.6% |
m | 2532 | 0.6% |
2 | 1449 | 0.4% |
3 | 1227 | 0.3% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 286131 | |
Uppercase Letter | 95377 | 24.3% |
Decimal Number | 8028 | 2.0% |
Math Symbol | 2676 | 0.7% |
Most frequent character per category
Value | Count | Frequency (%) |
o | 95377 | |
n | 92845 | |
e | 92845 | |
r | 2532 | 0.9% |
m | 2532 | 0.9% |
Value | Count | Frequency (%) |
0 | 5352 | |
2 | 1449 | 18.0% |
3 | 1227 | 15.3% |
Value | Count | Frequency (%) |
N | 95377 |
Value | Count | Frequency (%) |
> | 2676 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 381508 | |
Common | 10704 | 2.7% |
Most frequent character per script
Value | Count | Frequency (%) |
N | 95377 | |
o | 95377 | |
n | 92845 | |
e | 92845 | |
r | 2532 | 0.7% |
m | 2532 | 0.7% |
Value | Count | Frequency (%) |
0 | 5352 | |
> | 2676 | |
2 | 1449 | 13.5% |
3 | 1227 | 11.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 392212 |
Most frequent character per block
Value | Count | Frequency (%) |
N | 95377 | |
o | 95377 | |
n | 92845 | |
e | 92845 | |
0 | 5352 | 1.4% |
> | 2676 | 0.7% |
r | 2532 | 0.6% |
m | 2532 | 0.6% |
2 | 1449 | 0.4% |
3 | 1227 | 0.3% |
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 766.2 KiB |
None | |
---|---|
>8 | 7631 |
Norm | 4854 |
>7 | 3708 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.768716918 |
Min length | 2 |
Characters and Unicode
Total characters | 369534 |
---|---|
Distinct characters | 9 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | None |
---|---|
2nd row | None |
3rd row | None |
4th row | None |
5th row | None |
Value | Count | Frequency (%) |
None | 81860 | |
>8 | 7631 | 7.8% |
Norm | 4854 | 5.0% |
>7 | 3708 | 3.8% |