Overview

Dataset statistics

Number of variables8
Number of observations1480
Missing cells7
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory97.0 KiB
Average record size in memory67.1 B

Variable types

Categorical3
Text2
Numeric3

Dataset

Description중앙분리대 개구부 현황
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=2655

Alerts

분리대폭(m) is highly overall correlated with 노선High correlation
분리대높이(m) is highly overall correlated with 형태High correlation
본부 is highly overall correlated with 노선High correlation
노선 is highly overall correlated with 분리대폭(m) and 1 other fieldsHigh correlation
형태 is highly overall correlated with 분리대높이(m)High correlation
분리대높이(m) is highly skewed (γ1 = 38.03685176)Skewed
분리대폭(m) has 81 (5.5%) zerosZeros
분리대높이(m) has 35 (2.4%) zerosZeros

Reproduction

Analysis started2024-01-09 20:05:36.623954
Analysis finished2024-01-09 20:05:38.555187
Duration1.93 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

본부
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size11.7 KiB
부산경남본부
219 
대구경북본부
219 
강원본부
211 
충북본부
188 
수도권본부
172 
Other values (3)
471 

Length

Max length6
Median length6
Mean length5.1527027
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산경남본부
2nd row부산경남본부
3rd row부산경남본부
4th row부산경남본부
5th row부산경남본부

Common Values

ValueCountFrequency (%)
부산경남본부 219
14.8%
대구경북본부 219
14.8%
강원본부 211
14.3%
충북본부 188
12.7%
수도권본부 172
11.6%
대전충남본부 170
11.5%
광주전남본부 159
10.7%
전북본부 142
9.6%

Length

2024-01-10T05:05:38.656186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:05:38.842005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산경남본부 219
14.8%
대구경북본부 219
14.8%
강원본부 211
14.3%
충북본부 188
12.7%
수도권본부 172
11.6%
대전충남본부 170
11.5%
광주전남본부 159
10.7%
전북본부 142
9.6%

지사
Text

Distinct56
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size11.7 KiB
2024-01-10T05:05:39.160506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length4
Mean length4.1310811
Min length4

Characters and Unicode

Total characters6114
Distinct characters60
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울산지사
2nd row서울산지사
3rd row서울산지사
4th row서울산지사
5th row서울산지사
ValueCountFrequency (%)
서울산지사 56
 
3.8%
대관령지사 54
 
3.6%
군위지사 45
 
3.0%
원주지사 43
 
2.9%
대구지사 42
 
2.8%
산청지사 40
 
2.7%
충주지사 39
 
2.6%
무주지사 39
 
2.6%
진천지사 38
 
2.6%
순천지사 38
 
2.6%
Other values (46) 1046
70.7%
2024-01-10T05:05:39.643329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1480
24.2%
1480
24.2%
353
 
5.8%
234
 
3.8%
145
 
2.4%
121
 
2.0%
107
 
1.8%
103
 
1.7%
100
 
1.6%
90
 
1.5%
Other values (50) 1901
31.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6114
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1480
24.2%
1480
24.2%
353
 
5.8%
234
 
3.8%
145
 
2.4%
121
 
2.0%
107
 
1.8%
103
 
1.7%
100
 
1.6%
90
 
1.5%
Other values (50) 1901
31.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6114
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1480
24.2%
1480
24.2%
353
 
5.8%
234
 
3.8%
145
 
2.4%
121
 
2.0%
107
 
1.8%
103
 
1.7%
100
 
1.6%
90
 
1.5%
Other values (50) 1901
31.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6114
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1480
24.2%
1480
24.2%
353
 
5.8%
234
 
3.8%
145
 
2.4%
121
 
2.0%
107
 
1.8%
103
 
1.7%
100
 
1.6%
90
 
1.5%
Other values (50) 1901
31.1%

노선
Categorical

HIGH CORRELATION 

Distinct38
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size11.7 KiB
경부선
153 
통영대전선,중부선
152 
중앙선
130 
영동선
124 
서해안선
121 
Other values (33)
800 

Length

Max length13
Median length11
Mean length5.9094595
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row경부선
2nd row경부선
3rd row경부선
4th row경부선
5th row경부선

Common Values

ValueCountFrequency (%)
경부선 153
 
10.3%
통영대전선,중부선 152
 
10.3%
중앙선 130
 
8.8%
영동선 124
 
8.4%
서해안선 121
 
8.2%
중부내륙선 113
 
7.6%
논산천안선,호남선 71
 
4.8%
남해선(순천부산) 49
 
3.3%
동해선 47
 
3.2%
광주대구선 46
 
3.1%
Other values (28) 474
32.0%

Length

2024-01-10T05:05:39.825290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경부선 153
 
10.0%
통영대전선,중부선 152
 
9.9%
중앙선 130
 
8.5%
영동선 124
 
8.1%
서해안선 121
 
7.9%
중부내륙선 113
 
7.4%
논산천안선,호남선 71
 
4.6%
지선 52
 
3.4%
남해선(순천부산 49
 
3.2%
동해선 47
 
3.1%
Other values (29) 520
33.9%

이정(km)
Real number (ℝ)

Distinct1298
Distinct (%)87.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.72258
Minimum0.3
Maximum414.8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.1 KiB
2024-01-10T05:05:39.983895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.3
5-th percentile6.2
Q136.6775
median100.715
Q3198.6
95-th percentile336.02
Maximum414.8
Range414.5
Interquartile range (IQR)161.9225

Descriptive statistics

Standard deviation105.74237
Coefficient of variation (CV)0.82790657
Kurtosis-0.53255635
Mean127.72258
Median Absolute Deviation (MAD)73.9085
Skewness0.73422728
Sum189029.42
Variance11181.448
MonotonicityNot monotonic
2024-01-10T05:05:40.145629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8.3 4
 
0.3%
149.6 4
 
0.3%
6.2 4
 
0.3%
16.5 4
 
0.3%
18.3 3
 
0.2%
12.3 3
 
0.2%
48.2 3
 
0.2%
13.9 3
 
0.2%
79.8 3
 
0.2%
6.8 3
 
0.2%
Other values (1288) 1446
97.7%
ValueCountFrequency (%)
0.3 1
0.1%
0.6 1
0.1%
0.68 1
0.1%
0.7 2
0.1%
0.9 1
0.1%
1.0 2
0.1%
1.2 2
0.1%
1.26 1
0.1%
1.3 1
0.1%
1.5 2
0.1%
ValueCountFrequency (%)
414.8 1
0.1%
413.4 1
0.1%
411.8 1
0.1%
410.2 1
0.1%
408.6 1
0.1%
407.2 1
0.1%
405.0 1
0.1%
401.6 1
0.1%
393.4 1
0.1%
390.1 1
0.1%

분리대폭(m)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct29
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.5038851
Minimum0
Maximum111
Zeros81
Zeros (%)5.5%
Negative0
Negative (%)0.0%
Memory size13.1 KiB
2024-01-10T05:05:40.308821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10.5
median0.75
Q33
95-th percentile4
Maximum111
Range111
Interquartile range (IQR)2.5

Descriptive statistics

Standard deviation6.7073981
Coefficient of variation (CV)2.6787962
Kurtosis156.59758
Mean2.5038851
Median Absolute Deviation (MAD)0.75
Skewness11.476002
Sum3705.75
Variance44.989189
MonotonicityNot monotonic
2024-01-10T05:05:40.464572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
3.0 505
34.1%
0.5 178
 
12.0%
0.61 121
 
8.2%
0.6 121
 
8.2%
0.3 110
 
7.4%
0.0 81
 
5.5%
4.0 58
 
3.9%
0.75 47
 
3.2%
1.0 42
 
2.8%
0.65 35
 
2.4%
Other values (19) 182
 
12.3%
ValueCountFrequency (%)
0.0 81
5.5%
0.1 9
 
0.6%
0.25 9
 
0.6%
0.3 110
7.4%
0.4 9
 
0.6%
0.45 26
 
1.8%
0.5 178
12.0%
0.58 26
 
1.8%
0.6 121
8.2%
0.61 121
8.2%
ValueCountFrequency (%)
111.0 1
 
0.1%
93.0 5
 
0.3%
20.0 1
 
0.1%
19.0 1
 
0.1%
17.0 18
1.2%
16.0 25
1.7%
15.0 2
 
0.1%
12.0 9
 
0.6%
10.0 2
 
0.1%
6.0 4
 
0.3%

분리대높이(m)
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct22
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2462804
Minimum0
Maximum111
Zeros35
Zeros (%)2.4%
Negative0
Negative (%)0.0%
Memory size13.1 KiB
2024-01-10T05:05:40.613711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.8
Q11.27
median1.27
Q31.27
95-th percentile1.27
Maximum111
Range111
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.8655914
Coefficient of variation (CV)2.2993151
Kurtosis1457.8717
Mean1.2462804
Median Absolute Deviation (MAD)0
Skewness38.036852
Sum1844.495
Variance8.211614
MonotonicityNot monotonic
2024-01-10T05:05:40.768844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
1.27 1182
79.9%
0.81 99
 
6.7%
0.8 55
 
3.7%
0.0 35
 
2.4%
1.2 24
 
1.6%
1.21 22
 
1.5%
0.775 17
 
1.1%
1.35 8
 
0.5%
0.87 7
 
0.5%
0.5 6
 
0.4%
Other values (12) 25
 
1.7%
ValueCountFrequency (%)
0.0 35
 
2.4%
0.2 2
 
0.1%
0.35 1
 
0.1%
0.5 6
 
0.4%
0.76 6
 
0.4%
0.77 1
 
0.1%
0.775 17
 
1.1%
0.8 55
3.7%
0.81 99
6.7%
0.83 3
 
0.2%
ValueCountFrequency (%)
111.0 1
 
0.1%
1.35 8
 
0.5%
1.27 1182
79.9%
1.25 1
 
0.1%
1.24 1
 
0.1%
1.21 22
 
1.5%
1.2 24
 
1.6%
1.17 1
 
0.1%
1.0 4
 
0.3%
0.9 1
 
0.1%

형태
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size11.7 KiB
가드레일-양면
554 
가드레일 개량형-양면
507 
가드레일
229 
기타
78 
가드레일-기타
 
44
Other values (6)
68 

Length

Max length16
Median length11
Mean length7.8060811
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row가드레일-양면
2nd row가드레일 개량형-양면
3rd row가드레일 개량형-양면
4th row가드레일 개량형-양면
5th row가드레일 개량형-양면

Common Values

ValueCountFrequency (%)
가드레일-양면 554
37.4%
가드레일 개량형-양면 507
34.3%
가드레일 229
15.5%
기타 78
 
5.3%
가드레일-기타 44
 
3.0%
선형 분리구간, 차단문 27
 
1.8%
가드레일 특수형 24
 
1.6%
PC방호벽 11
 
0.7%
가드레일-단면 3
 
0.2%
콘크리트방호벽 2
 
0.1%

Length

2024-01-10T05:05:40.947378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
가드레일 760
36.8%
가드레일-양면 554
26.8%
개량형-양면 507
24.6%
기타 78
 
3.8%
가드레일-기타 44
 
2.1%
선형 27
 
1.3%
분리구간&#44 27
 
1.3%
차단문 27
 
1.3%
특수형 24
 
1.2%
pc방호벽 11
 
0.5%
Other values (3) 6
 
0.3%
Distinct179
Distinct (%)12.2%
Missing7
Missing (%)0.5%
Memory size11.7 KiB
2024-01-10T05:05:41.213058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length32
Mean length11.202308
Min length2

Characters and Unicode

Total characters16501
Distinct characters159
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62 ?
Unique (%)4.2%

Sample

1st row볼트풀기-형식2
2nd row볼트풀기-형식2
3rd row슬라이딩-형식1
4th row볼트풀기-형식1
5th row볼트풀기-형식2
ValueCountFrequency (%)
가드레일 563
 
16.7%
철거 338
 
10.0%
230
 
6.8%
지주 110
 
3.3%
지주철거 98
 
2.9%
슬라이딩 90
 
2.7%
볼트해체 83
 
2.5%
볼트 81
 
2.4%
볼트해체&#44 71
 
2.1%
레일 64
 
1.9%
Other values (182) 1642
48.7%
2024-01-10T05:05:42.153174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1909
 
11.6%
876
 
5.3%
867
 
5.3%
741
 
4.5%
721
 
4.4%
702
 
4.3%
669
 
4.1%
4 663
 
4.0%
437
 
2.6%
414
 
2.5%
Other values (149) 8502
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11908
72.2%
Space Separator 1909
 
11.6%
Other Punctuation 955
 
5.8%
Decimal Number 891
 
5.4%
Open Punctuation 273
 
1.7%
Close Punctuation 271
 
1.6%
Dash Punctuation 95
 
0.6%
Math Symbol 86
 
0.5%
Uppercase Letter 78
 
0.5%
Lowercase Letter 33
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
876
 
7.4%
867
 
7.3%
741
 
6.2%
721
 
6.1%
702
 
5.9%
669
 
5.6%
437
 
3.7%
414
 
3.5%
412
 
3.5%
409
 
3.4%
Other values (111) 5660
47.5%
Lowercase Letter
ValueCountFrequency (%)
g 6
18.2%
m 4
12.1%
i 4
12.1%
e 4
12.1%
t 4
12.1%
a 4
12.1%
n 2
 
6.1%
d 2
 
6.1%
l 2
 
6.1%
x 1
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
T 27
34.6%
C 18
23.1%
P 15
19.2%
X 11
14.1%
R 2
 
2.6%
J 2
 
2.6%
S 2
 
2.6%
A 1
 
1.3%
Decimal Number
ValueCountFrequency (%)
4 663
74.4%
2 100
 
11.2%
1 78
 
8.8%
6 29
 
3.3%
0 10
 
1.1%
3 6
 
0.7%
5 5
 
0.6%
Other Punctuation
ValueCountFrequency (%)
& 314
32.9%
; 311
32.6%
# 311
32.6%
* 11
 
1.2%
. 8
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 272
99.6%
[ 1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 270
99.6%
] 1
 
0.4%
Space Separator
ValueCountFrequency (%)
1909
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 95
100.0%
Math Symbol
ValueCountFrequency (%)
+ 86
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11908
72.2%
Common 4482
 
27.2%
Latin 111
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
876
 
7.4%
867
 
7.3%
741
 
6.2%
721
 
6.1%
702
 
5.9%
669
 
5.6%
437
 
3.7%
414
 
3.5%
412
 
3.5%
409
 
3.4%
Other values (111) 5660
47.5%
Common
ValueCountFrequency (%)
1909
42.6%
4 663
 
14.8%
& 314
 
7.0%
; 311
 
6.9%
# 311
 
6.9%
( 272
 
6.1%
) 270
 
6.0%
2 100
 
2.2%
- 95
 
2.1%
+ 86
 
1.9%
Other values (10) 151
 
3.4%
Latin
ValueCountFrequency (%)
T 27
24.3%
C 18
16.2%
P 15
13.5%
X 11
9.9%
g 6
 
5.4%
m 4
 
3.6%
i 4
 
3.6%
e 4
 
3.6%
t 4
 
3.6%
a 4
 
3.6%
Other values (8) 14
12.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11908
72.2%
ASCII 4593
 
27.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1909
41.6%
4 663
 
14.4%
& 314
 
6.8%
; 311
 
6.8%
# 311
 
6.8%
( 272
 
5.9%
) 270
 
5.9%
2 100
 
2.2%
- 95
 
2.1%
+ 86
 
1.9%
Other values (28) 262
 
5.7%
Hangul
ValueCountFrequency (%)
876
 
7.4%
867
 
7.3%
741
 
6.2%
721
 
6.1%
702
 
5.9%
669
 
5.6%
437
 
3.7%
414
 
3.5%
412
 
3.5%
409
 
3.4%
Other values (111) 5660
47.5%

Interactions

2024-01-10T05:05:37.876929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:05:37.213432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:05:37.548841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/