Overview

Dataset statistics

Number of variables31
Number of observations1773
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory448.6 KiB
Average record size in memory259.1 B

Variable types

Categorical22
DateTime1
Text2
Numeric6

Dataset

Description교통 안전 데이터로서 어린이 등하교 및 초등학교 주변 사각지대, 위험요소 데이터 제공합니다. (출처: 공공데이터포털, https://www.data.go.kr/data/15076627/fileData.do)
Author백수빈
URLhttps://www.jejudatahub.net/data/view/data/875

Alerts

사고번호 has constant value ""Constant
피해운전자 has constant value ""Constant
데이터기준일자 has constant value ""Constant
사망자수 is highly imbalanced (84.6%)Imbalance
부상신고자 is highly imbalanced (61.0%)Imbalance
법규위반 is highly imbalanced (56.9%)Imbalance
노면상태 is highly imbalanced (75.3%)Imbalance
기상상태 is highly imbalanced (66.1%)Imbalance
가해운전자 is highly imbalanced (59.6%)Imbalance
가해운전_부상 is highly imbalanced (82.9%)Imbalance
경도 has unique valuesUnique
가해운전_나이 has 61 (3.4%) zerosZeros

Reproduction

Analysis started2023-12-11 20:04:35.713814
Analysis finished2023-12-11 20:04:36.215324
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사고번호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
2020000000000000
1773 

Length

Max length16
Median length16
Mean length16
Min length16

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020000000000000
2nd row2020000000000000
3rd row2020000000000000
4th row2020000000000000
5th row2020000000000000

Common Values

ValueCountFrequency (%)
2020000000000000 1773
100.0%

Length

2023-12-12T05:04:36.307617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:36.409431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020000000000000 1773
100.0%
Distinct662
Distinct (%)37.3%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
Minimum2018-01-01 00:00:00
Maximum2019-12-31 00:00:00
2023-12-12T05:04:36.498021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T05:04:36.616132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

발생시간
Categorical

Distinct24
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
18:00
172 
19:00
152 
17:00
125 
20:00
 
106
21:00
 
102
Other values (19)
1116 

Length

Max length5
Median length5
Mean length4.7828539
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6:00
2nd row20:00
3rd row3:00
4th row8:00
5th row14:00

Common Values

ValueCountFrequency (%)
18:00 172
 
9.7%
19:00 152
 
8.6%
17:00 125
 
7.1%
20:00 106
 
6.0%
21:00 102
 
5.8%
14:00 97
 
5.5%
15:00 95
 
5.4%
16:00 92
 
5.2%
11:00 89
 
5.0%
8:00 82
 
4.6%
Other values (14) 661
37.3%

Length

2023-12-12T05:04:36.734562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
18:00 172
 
9.7%
19:00 152
 
8.6%
17:00 125
 
7.1%
20:00 106
 
6.0%
21:00 102
 
5.8%
14:00 97
 
5.5%
15:00 95
 
5.4%
16:00 92
 
5.2%
11:00 89
 
5.0%
8:00 82
 
4.6%
Other values (14) 661
37.3%

발생요일
Categorical

Distinct7
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
284 
280 
273 
257 
253 
Other values (2)
426 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
284
16.0%
280
15.8%
273
15.4%
257
14.5%
253
14.3%
239
13.5%
187
10.5%

Length

2023-12-12T05:04:36.830315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:36.930012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
284
16.0%
280
15.8%
273
15.4%
257
14.5%
253
14.3%
239
13.5%
187
10.5%


Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
제주시
1329 
서귀포시
444 

Length

Max length4
Median length3
Mean length3.250423
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서귀포시
2nd row서귀포시
3rd row서귀포시
4th row서귀포시
5th row서귀포시

Common Values

ValueCountFrequency (%)
제주시 1329
75.0%
서귀포시 444
 
25.0%

Length

2023-12-12T05:04:37.246194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:37.323333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제주시 1329
75.0%
서귀포시 444
 
25.0%
Distinct68
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
2023-12-12T05:04:37.502820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.2639594
Min length2

Characters and Unicode

Total characters5787
Distinct characters72
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.3%

Sample

1st row하효동
2nd row동홍동
3rd row서귀동
4th row안덕면
5th row서귀동
ValueCountFrequency (%)
연동 214
 
12.1%
노형동 147
 
8.3%
이도이동 128
 
7.2%
서귀동 122
 
6.9%
일도이동 94
 
5.3%
애월읍 69
 
3.9%
이도일동 67
 
3.8%
동홍동 65
 
3.7%
한림읍 52
 
2.9%
삼도일동 50
 
2.8%
Other values (58) 765
43.1%
2023-12-12T05:04:37.809919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1489
25.7%
565
 
9.8%
507
 
8.8%
430
 
7.4%
273
 
4.7%
214
 
3.7%
151
 
2.6%
147
 
2.5%
147
 
2.5%
122
 
2.1%
Other values (62) 1742
30.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5787
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1489
25.7%
565
 
9.8%
507
 
8.8%
430
 
7.4%
273
 
4.7%
214
 
3.7%
151
 
2.6%
147
 
2.5%
147
 
2.5%
122
 
2.1%
Other values (62) 1742
30.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5787
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1489
25.7%
565
 
9.8%
507
 
8.8%
430
 
7.4%
273
 
4.7%
214
 
3.7%
151
 
2.6%
147
 
2.5%
147
 
2.5%
122
 
2.1%
Other values (62) 1742
30.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5787
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1489
25.7%
565
 
9.8%
507
 
8.8%
430
 
7.4%
273
 
4.7%
214
 
3.7%
151
 
2.6%
147
 
2.5%
147
 
2.5%
122
 
2.1%
Other values (62) 1742
30.1%

사고내용
Categorical

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
경상사고
869 
중상사고
711 
부상신고사고
124 
사망사고
 
69

Length

Max length6
Median length4
Mean length4.1398759
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사망사고
2nd row경상사고
3rd row중상사고
4th row사망사고
5th row경상사고

Common Values

ValueCountFrequency (%)
경상사고 869
49.0%
중상사고 711
40.1%
부상신고사고 124
 
7.0%
사망사고 69
 
3.9%

Length

2023-12-12T05:04:38.015637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:38.139893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상사고 869
49.0%
중상사고 711
40.1%
부상신고사고 124
 
7.0%
사망사고 69
 
3.9%

사망자수
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
0
1704 
1
 
67
2
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
0 1704
96.1%
1 67
 
3.8%
2 2
 
0.1%

Length

2023-12-12T05:04:38.274424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:38.469960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 1704
96.1%
1 67
 
3.8%
2 2
 
0.1%

중상자수
Categorical

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
0
1057 
1
702 
2
 
14

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 1057
59.6%
1 702
39.6%
2 14
 
0.8%

Length

2023-12-12T05:04:38.689542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:38.887924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 1057
59.6%
1 702
39.6%
2 14
 
0.8%

경상자수
Categorical

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
0
876 
1
867 
2
 
29
3
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
0 876
49.4%
1 867
48.9%
2 29
 
1.6%
3 1
 
0.1%

Length

2023-12-12T05:04:39.072257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:39.222245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 876
49.4%
1 867
48.9%
2 29
 
1.6%
3 1
 
0.1%

부상신고자
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
0
1637 
1
 
136

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
0 1637
92.3%
1 136
 
7.7%

Length

2023-12-12T05:04:39.397751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:39.527126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 1637
92.3%
1 136
 
7.7%

사고유형
Categorical

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
기타
892 
횡단중
663 
차도통행중
108 
길가장자리구역통행중
 
64
보도통행중
 
46

Length

Max length10
Median length2
Mean length2.9232939
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row횡단중
2nd row기타
3rd row기타
4th row기타
5th row횡단중

Common Values

ValueCountFrequency (%)
기타 892
50.3%
횡단중 663
37.4%
차도통행중 108
 
6.1%
길가장자리구역통행중 64
 
3.6%
보도통행중 46
 
2.6%

Length

2023-12-12T05:04:39.707765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:39.875395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 892
50.3%
횡단중 663
37.4%
차도통행중 108
 
6.1%
길가장자리구역통행중 64
 
3.6%
보도통행중 46
 
2.6%

법규위반
Categorical

IMBALANCE 

Distinct10
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
안전운전불이행
1162 
보행자보호의무위반
444 
기타
 
61
신호위반
 
53
중앙선침범
 
27
Other values (5)
 
26

Length

Max length9
Median length7
Mean length7.178229
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안전운전불이행
2nd row안전운전불이행
3rd row안전운전불이행
4th row안전운전불이행
5th row보행자보호의무위반

Common Values

ValueCountFrequency (%)
안전운전불이행 1162
65.5%
보행자보호의무위반 444
 
25.0%
기타 61
 
3.4%
신호위반 53
 
3.0%
중앙선침범 27
 
1.5%
과속 9
 
0.5%
교차로운행방법위반 8
 
0.5%
미분류 4
 
0.2%
불법유턴 3
 
0.2%
안전거리미확보 2
 
0.1%

Length

2023-12-12T05:04:40.082926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:40.315089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안전운전불이행 1162
65.5%
보행자보호의무위반 444
 
25.0%
기타 61
 
3.4%
신호위반 53
 
3.0%
중앙선침범 27
 
1.5%
과속 9
 
0.5%
교차로운행방법위반 8
 
0.5%
미분류 4
 
0.2%
불법유턴 3
 
0.2%
안전거리미확보 2
 
0.1%

노면상태
Categorical

IMBALANCE 

Distinct7
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
포장 - 건조
1538 
포장 - 젖음/습기
184 
포장 - 기타
 
42
포장 - 서리/결빙
 
4
포장 - 적설
 
3
Other values (2)
 
2

Length

Max length11
Median length7
Mean length7.320925
Min length7

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row포장 - 건조
2nd row포장 - 건조
3rd row포장 - 건조
4th row포장 - 젖음/습기
5th row포장 - 젖음/습기

Common Values

ValueCountFrequency (%)
포장 - 건조 1538
86.7%
포장 - 젖음/습기 184
 
10.4%
포장 - 기타 42
 
2.4%
포장 - 서리/결빙 4
 
0.2%
포장 - 적설 3
 
0.2%
비포장 - 건조 1
 
0.1%
비포장 - 젖음/습기 1
 
0.1%

Length

2023-12-12T05:04:40.534706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:40.688233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1773
33.3%
포장 1771
33.3%
건조 1539
28.9%
젖음/습기 185
 
3.5%
기타 42
 
0.8%
서리/결빙 4
 
0.1%
적설 3
 
0.1%
비포장 2
 
< 0.1%

기상상태
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
맑음
1496 
 
145
흐림
 
87
기타
 
34
 
8

Length

Max length2
Median length2
Mean length1.9137056
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row맑음
2nd row맑음
3rd row맑음
4th row
5th row

Common Values

ValueCountFrequency (%)
맑음 1496
84.4%
145
 
8.2%
흐림 87
 
4.9%
기타 34
 
1.9%
8
 
0.5%
안개 3
 
0.2%

Length

2023-12-12T05:04:40.867095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:41.001446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
맑음 1496
84.4%
145
 
8.2%
흐림 87
 
4.9%
기타 34
 
1.9%
8
 
0.5%
안개 3
 
0.2%

도로형태
Categorical

Distinct8
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
단일로 - 기타
956 
교차로 - 교차로횡단보도내
270 
교차로 - 교차로안
239 
교차로 - 교차로부근
181 
기타 - 기타
102 
Other values (3)
 
25

Length

Max length15
Median length8
Mean length9.4495206
Min length7

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row교차로 - 교차로부근
2nd row교차로 - 교차로부근
3rd row단일로 - 기타
4th row교차로 - 교차로안
5th row교차로 - 교차로횡단보도내

Common Values

ValueCountFrequency (%)
단일로 - 기타 956
53.9%
교차로 - 교차로횡단보도내 270
 
15.2%
교차로 - 교차로안 239
 
13.5%
교차로 - 교차로부근 181
 
10.2%
기타 - 기타 102
 
5.8%
주차장 - 주차장 23
 
1.3%
단일로 - 지하차도(도로)내 1
 
0.1%
미분류 - 미분류 1
 
0.1%

Length

2023-12-12T05:04:41.166322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:41.312258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/