Overview

Dataset statistics

Number of variables13
Number of observations572
Missing cells114
Missing cells (%)1.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory58.8 KiB
Average record size in memory105.2 B

Variable types

Numeric1
Text3
Categorical8
DateTime1

Dataset

Description시설물안전법에 따른 우리 시 소관 시설물을 종별, 등급별 등으로 데이터를 제공합니다.
Author광주광역시
URLhttps://www.data.go.kr/data/15088717/fileData.do

Alerts

시도 has constant value ""Constant
데이터기준일자 has constant value ""Constant
시설물구분 is highly overall correlated with 시설물종류High correlation
시설물종류 is highly overall correlated with 시설물구분High correlation
종별 is highly overall correlated with 등급High correlation
등급 is highly overall correlated with 종별High correlation
관리주체구분 is highly imbalanced (75.2%)Imbalance
읍면동 has 114 (19.9%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:04:45.317573
Analysis finished2023-12-12 17:04:46.502563
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct572
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean286.5
Minimum1
Maximum572
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2023-12-13T02:04:46.600764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile29.55
Q1143.75
median286.5
Q3429.25
95-th percentile543.45
Maximum572
Range571
Interquartile range (IQR)285.5

Descriptive statistics

Standard deviation165.26645
Coefficient of variation (CV)0.57684625
Kurtosis-1.2
Mean286.5
Median Absolute Deviation (MAD)143
Skewness0
Sum163878
Variance27313
MonotonicityStrictly increasing
2023-12-13T02:04:46.771046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
386 1
 
0.2%
380 1
 
0.2%
381 1
 
0.2%
382 1
 
0.2%
383 1
 
0.2%
384 1
 
0.2%
385 1
 
0.2%
387 1
 
0.2%
378 1
 
0.2%
Other values (562) 562
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
572 1
0.2%
571 1
0.2%
570 1
0.2%
569 1
0.2%
568 1
0.2%
567 1
0.2%
566 1
0.2%
565 1
0.2%
564 1
0.2%
563 1
0.2%
Distinct565
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2023-12-13T02:04:47.095188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24
Mean length7.0979021
Min length2

Characters and Unicode

Total characters4060
Distinct characters297
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique558 ?
Unique (%)97.6%

Sample

1st row(구)산동교
2nd row5.18기념문화관
3rd row5.18민주화운동기록관
4th row518자유관
5th rowB(과)선계제2
ValueCountFrequency (%)
광주도시철도 19
 
2.6%
터널 11
 
1.5%
광주광역시 9
 
1.3%
1호선 7
 
1.0%
절토사면 6
 
0.8%
역사 6
 
0.8%
광주광역시청소년수련원 4
 
0.6%
좌로 4
 
0.6%
배수통문 4
 
0.6%
행정복지센터 4
 
0.6%
Other values (610) 644
89.7%
2023-12-13T02:04:47.576408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
297
 
7.3%
146
 
3.6%
109
 
2.7%
109
 
2.7%
1 101
 
2.5%
89
 
2.2%
85
 
2.1%
83
 
2.0%
71
 
1.7%
68
 
1.7%
Other values (287) 2902
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3412
84.0%
Decimal Number 285
 
7.0%
Space Separator 146
 
3.6%
Close Punctuation 48
 
1.2%
Open Punctuation 48
 
1.2%
Uppercase Letter 40
 
1.0%
Dash Punctuation 28
 
0.7%
Other Punctuation 24
 
0.6%
Math Symbol 20
 
0.5%
Lowercase Letter 9
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
297
 
8.7%
109
 
3.2%
109
 
3.2%
89
 
2.6%
85
 
2.5%
83
 
2.4%
71
 
2.1%
68
 
2.0%
63
 
1.8%
61
 
1.8%
Other values (252) 2377
69.7%
Decimal Number
ValueCountFrequency (%)
1 101
35.4%
2 56
19.6%
0 22
 
7.7%
3 22
 
7.7%
6 16
 
5.6%
8 16
 
5.6%
5 16
 
5.6%
7 14
 
4.9%
4 14
 
4.9%
9 8
 
2.8%
Uppercase Letter
ValueCountFrequency (%)
C 5
12.5%
T 5
12.5%
A 5
12.5%
B 4
10.0%
P 4
10.0%
I 4
10.0%
K 4
10.0%
M 4
10.0%
E 3
7.5%
R 2
 
5.0%
Lowercase Letter
ValueCountFrequency (%)
s 2
22.2%
t 2
22.2%
a 2
22.2%
e 1
11.1%
p 1
11.1%
y 1
11.1%
Other Punctuation
ValueCountFrequency (%)
. 15
62.5%
, 8
33.3%
/ 1
 
4.2%
Math Symbol
ValueCountFrequency (%)
+ 12
60.0%
~ 8
40.0%
Space Separator
ValueCountFrequency (%)
146
100.0%
Close Punctuation
ValueCountFrequency (%)
) 48
100.0%
Open Punctuation
ValueCountFrequency (%)
( 48
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3412
84.0%
Common 599
 
14.8%
Latin 49
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
297
 
8.7%
109
 
3.2%
109
 
3.2%
89
 
2.6%
85
 
2.5%
83
 
2.4%
71
 
2.1%
68
 
2.0%
63
 
1.8%
61
 
1.8%
Other values (252) 2377
69.7%
Common
ValueCountFrequency (%)
146
24.4%
1 101
16.9%
2 56
 
9.3%
) 48
 
8.0%
( 48
 
8.0%
- 28
 
4.7%
0 22
 
3.7%
3 22
 
3.7%
6 16
 
2.7%
8 16
 
2.7%
Other values (9) 96
16.0%
Latin
ValueCountFrequency (%)
C 5
10.2%
T 5
10.2%
A 5
10.2%
B 4
8.2%
P 4
8.2%
I 4
8.2%
K 4
8.2%
M 4
8.2%
E 3
 
6.1%
s 2
 
4.1%
Other values (6) 9
18.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3412
84.0%
ASCII 648
 
16.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
297
 
8.7%
109
 
3.2%
109
 
3.2%
89
 
2.6%
85
 
2.5%
83
 
2.4%
71
 
2.1%
68
 
2.0%
63
 
1.8%
61
 
1.8%
Other values (252) 2377
69.7%
ASCII
ValueCountFrequency (%)
146
22.5%
1 101
15.6%
2 56
 
8.6%
) 48
 
7.4%
( 48
 
7.4%
- 28
 
4.3%
0 22
 
3.4%
3 22
 
3.4%
6 16
 
2.5%
8 16
 
2.5%
Other values (25) 145
22.4%

시설물구분
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
교량
286 
건축물
139 
하천
66 
터널
43 
옹벽
 
16
Other values (5)
 
22

Length

Max length4
Median length2
Mean length2.3041958
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row교량
2nd row건축물
3rd row건축물
4th row건축물
5th row교량

Common Values

ValueCountFrequency (%)
교량 286
50.0%
건축물 139
24.3%
하천 66
 
11.5%
터널 43
 
7.5%
옹벽 16
 
2.8%
상하수도 11
 
1.9%
절토사면 7
 
1.2%
2
 
0.3%
공동구 1
 
0.2%
기타 1
 
0.2%

Length

2023-12-13T02:04:47.751400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:04:47.902307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교량 286
50.0%
건축물 139
24.3%
하천 66
 
11.5%
터널 43
 
7.5%
옹벽 16
 
2.8%
상하수도 11
 
1.9%
절토사면 7
 
1.2%
2
 
0.3%
공동구 1
 
0.2%
기타 1
 
0.2%

시설물종류
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
도로교량
224 
다중이용건축물
99 
육교
53 
수문 및 통문
51 
철도역시설
 
20
Other values (17)
125 

Length

Max length8
Median length4
Mean length4.6800699
Min length2

Unique

Unique3 ?
Unique (%)0.5%

Sample

1st row도로교량
2nd row다중이용건축물
3rd row다중이용건축물
4th row다중이용건축물
5th row도로교량

Common Values

ValueCountFrequency (%)
도로교량 224
39.2%
다중이용건축물 99
17.3%
육교 53
 
9.3%
수문 및 통문 51
 
8.9%
철도역시설 20
 
3.5%
도로터널 16
 
2.8%
지하차도 15
 
2.6%
도로옹벽 15
 
2.6%
철도터널 12
 
2.1%
기타 11
 
1.9%
Other values (12) 56
 
9.8%

Length

2023-12-13T02:04:48.049844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
도로교량 224
33.2%
다중이용건축물 99
14.7%
육교 53
 
7.9%
수문 51
 
7.6%
51
 
7.6%
통문 51
 
7.6%
철도역시설 20
 
3.0%
도로터널 16
 
2.4%
지하차도 15
 
2.2%
도로옹벽 15
 
2.2%
Other values (14) 79
 
11.7%

종별
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
3종
230 
2종
223 
1종
118 
기타
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row기타
2nd row2종
3rd row2종
4th row3종
5th row2종

Common Values

ValueCountFrequency (%)
3종 230
40.2%
2종 223
39.0%
1종 118
20.6%
기타 1
 
0.2%

Length

2023-12-13T02:04:48.181866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:04:48.301888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3종 230
40.2%
2종 223
39.0%
1종 118
20.6%
기타 1
 
0.2%

등급
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
B등급
390 
C등급
94 
A등급
85 
D등급
 
2
E등급
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st rowE등급
2nd rowB등급
3rd rowB등급
4th rowB등급
5th rowB등급

Common Values

ValueCountFrequency (%)
B등급 390
68.2%
C등급 94
 
16.4%
A등급 85
 
14.9%
D등급 2
 
0.3%
E등급 1
 
0.2%

Length

2023-12-13T02:04:48.447431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:04:48.589706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
b등급 390
68.2%
c등급 94
 
16.4%
a등급 85
 
14.9%
d등급 2
 
0.3%
e등급 1
 
0.2%
Distinct76
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2023-12-13T02:04:48.852317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length8.8653846
Min length5

Characters and Unicode

Total characters5071
Distinct characters133
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)7.0%

Sample

1st row북구건설과도로계
2nd row광주광역시5.18기념문화센터
3rd row5.18민주화운동기록관
4th row광주광역시5.18기념문화센터
5th row광산구청 건설과 도로관리팀
ValueCountFrequency (%)
종합건설본부 184
21.0%
건설과 105
 
12.0%
광산구청 73
 
8.3%
남구청 44
 
5.0%
동구청 37
 
4.2%
도로관리팀 35
 
4.0%
서구청 33
 
3.8%
북구건설과도로계 32
 
3.7%
도로관리계 31
 
3.5%
광주광역시 29
 
3.3%
Other values (72) 272
31.1%
2023-12-13T02:04:49.301497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
345
 
6.8%
340
 
6.7%
327
 
6.4%
303
 
6.0%
253
 
5.0%
227
 
4.5%
223
 
4.4%
209
 
4.1%
207
 
4.1%
200
 
3.9%
Other values (123) 2437
48.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4663
92.0%
Space Separator 303
 
6.0%
Decimal Number 59
 
1.2%
Close Punctuation 22
 
0.4%
Open Punctuation 21
 
0.4%
Other Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
345
 
7.4%
340
 
7.3%
327
 
7.0%
253
 
5.4%
227
 
4.9%
223
 
4.8%
209
 
4.5%
207
 
4.4%
200
 
4.3%
196
 
4.2%
Other values (114) 2136
45.8%
Decimal Number
ValueCountFrequency (%)
1 28
47.5%
2 17
28.8%
3 8
 
13.6%
8 3
 
5.1%
5 3
 
5.1%
Space Separator
ValueCountFrequency (%)
303
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4663
92.0%
Common 408
 
8.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
345
 
7.4%
340
 
7.3%
327
 
7.0%
253
 
5.4%
227
 
4.9%
223
 
4.8%
209
 
4.5%
207
 
4.4%
200
 
4.3%
196
 
4.2%
Other values (114) 2136
45.8%
Common
ValueCountFrequency (%)
303
74.3%
1 28
 
6.9%
) 22
 
5.4%
( 21
 
5.1%
2 17
 
4.2%
3 8
 
2.0%
8 3
 
0.7%
. 3
 
0.7%
5 3
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4663
92.0%
ASCII 408
 
8.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
345
 
7.4%
340
 
7.3%
327
 
7.0%
253
 
5.4%
227
 
4.9%
223
 
4.8%
209
 
4.5%
207
 
4.4%
200
 
4.3%
196
 
4.2%
Other values (114) 2136
45.8%
ASCII
ValueCountFrequency (%)
303
74.3%
1 28
 
6.9%
) 22
 
5.4%
( 21
 
5.1%
2 17
 
4.2%
3 8
 
2.0%
8 3
 
0.7%
. 3
 
0.7%
5 3
 
0.7%

관리주체구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
지자체
532 
공공기관
 
37
공공
 
3

Length

Max length4
Median length3
Mean length3.0594406
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지자체
2nd row지자체
3rd row공공기관
4th row지자체
5th row지자체

Common Values

ValueCountFrequency (%)
지자체 532
93.0%
공공기관 37
 
6.5%
공공 3
 
0.5%

Length

2023-12-13T02:04:49.484492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:04:49.585013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지자체 532
93.0%
공공기관 37
 
6.5%
공공 3
 
0.5%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
광주광역시
572 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광주광역시
2nd row광주광역시
3rd row광주광역시
4th row광주광역시
5th row광주광역시

Common Values

ValueCountFrequency (%)
광주광역시 572
100.0%

Length

2023-12-13T02:04:49.708778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:04:49.824328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광주광역시 572
100.0%

시군구
Categorical

Distinct5
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
광산구
169 
북구
134 
서구
109 
동구
83 
남구
77 

Length

Max length3
Median length2
Mean length2.2954545
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row북구
2nd row서구
3rd row동구
4th row서구
5th row광산구

Common Values

ValueCountFrequency (%)
광산구 169
29.5%
북구 134
23.4%
서구 109
19.1%
동구 83
14.5%
남구 77
13.5%

Length

2023-12-13T02:04:49.949246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:04:50.069484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광산구 169
29.5%
북구 134
23.4%
서구 109
19.1%
동구 83
14.5%
남구 77
13.5%

읍면동
Text

MISSING 

Distinct193
Distinct (%)42.1%
Missing114
Missing (%)19.9%
Memory size4.6 KiB
2023-12-13T02:04:50.421604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/