Overview

Dataset statistics

Number of variables7
Number of observations4561
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory262.9 KiB
Average record size in memory59.0 B

Variable types

Text2
Numeric3
Categorical1
DateTime1

Dataset

Description교통 안전 데이터로서 어린이 등하교 및 초등학교 주변 사각지대, 위험요소 데이터 제공합니다. (출처: 공공데이터포털, https://www.data.go.kr/data/15076627/fileData.do)
Author백수빈
URLhttps://www.jejudatahub.net/data/view/data/875

Alerts

데이터기준일자 has constant value ""Constant
요청사항 is highly imbalanced (91.6%)Imbalance
분석아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-11 20:04:20.033270
Analysis finished2023-12-11 20:04:21.653623
Duration1.62 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct114
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size35.8 KiB
2023-12-12T05:04:21.898456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length6
Mean length6.2587152
Min length6

Characters and Unicode

Total characters28546
Distinct characters122
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가마초등학교
2nd row가마초등학교
3rd row가마초등학교
4th row가마초등학교
5th row가마초등학교
ValueCountFrequency (%)
이도초등학교 210
 
4.6%
외도초등학교 207
 
4.5%
한라초등학교 126
 
2.8%
도남초등학교 121
 
2.7%
아라초등학교 114
 
2.5%
인화초등학교 112
 
2.5%
노형초등학교 111
 
2.4%
삼양초등학교 110
 
2.4%
도련초등학교 110
 
2.4%
동홍초등학교 108
 
2.4%
Other values (104) 3232
70.9%
2023-12-12T05:04:22.377064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4674
16.4%
4561
16.0%
4503
15.8%
4503
15.8%
828
 
2.9%
402
 
1.4%
400
 
1.4%
400
 
1.4%
381
 
1.3%
356
 
1.2%
Other values (112) 7538
26.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28546
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4674
16.4%
4561
16.0%
4503
15.8%
4503
15.8%
828
 
2.9%
402
 
1.4%
400
 
1.4%
400
 
1.4%
381
 
1.3%
356
 
1.2%
Other values (112) 7538
26.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28546
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4674
16.4%
4561
16.0%
4503
15.8%
4503
15.8%
828
 
2.9%
402
 
1.4%
400
 
1.4%
400
 
1.4%
381
 
1.3%
356
 
1.2%
Other values (112) 7538
26.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28546
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4674
16.4%
4561
16.0%
4503
15.8%
4503
15.8%
828
 
2.9%
402
 
1.4%
400
 
1.4%
400
 
1.4%
381
 
1.3%
356
 
1.2%
Other values (112) 7538
26.4%

위도
Real number (ℝ)

Distinct4546
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.439092
Minimum33.219069
Maximum33.557238
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size40.2 KiB
2023-12-12T05:04:22.549274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33.219069
5-th percentile33.249662
Q133.40911
median33.487379
Q333.502918
95-th percentile33.521416
Maximum33.557238
Range0.33816896
Interquartile range (IQR)0.09380799

Descriptive statistics

Standard deviation0.099295111
Coefficient of variation (CV)0.0029694321
Kurtosis-0.36549453
Mean33.439092
Median Absolute Deviation (MAD)0.02275053
Skewness-1.1559232
Sum152515.7
Variance0.009859519
MonotonicityNot monotonic
2023-12-12T05:04:22.775747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33.25231406 2
 
< 0.1%
33.51672455 2
 
< 0.1%
33.24595759 2
 
< 0.1%
33.48901358 2
 
< 0.1%
33.48774941 2
 
< 0.1%
33.49026752 2
 
< 0.1%
33.48992898 2
 
< 0.1%
33.47617214 2
 
< 0.1%
33.24453267 2
 
< 0.1%
33.24887938 2
 
< 0.1%
Other values (4536) 4541
99.6%
ValueCountFrequency (%)
33.21906913 1
< 0.1%
33.21950208 1
< 0.1%
33.21999467 1
< 0.1%
33.22112218 1
< 0.1%
33.2213515 1
< 0.1%
33.22136545 1
< 0.1%
33.22203926 1
< 0.1%
33.22249 1
< 0.1%
33.22271129 1
< 0.1%
33.22292619 1
< 0.1%
ValueCountFrequency (%)
33.55723809 1
< 0.1%
33.55665126 1
< 0.1%
33.55575726 1
< 0.1%
33.55573332 1
< 0.1%
33.55567405 1
< 0.1%
33.5556326 1
< 0.1%
33.55548159 1
< 0.1%
33.55528065 1
< 0.1%
33.55519153 1
< 0.1%
33.55516719 1
< 0.1%

경도
Real number (ℝ)

Distinct4540
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.52566
Minimum124.84923
Maximum126.93521
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size40.2 KiB
2023-12-12T05:04:22.950134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum124.84923
5-th percentile126.28745
Q1126.47431
median126.52833
Q3126.571
95-th percentile126.80111
Maximum126.93521
Range2.085974
Interquartile range (IQR)0.0966907

Descriptive statistics

Standard deviation0.13117854
Coefficient of variation (CV)0.0010367742
Kurtosis7.3638489
Mean126.52566
Median Absolute Deviation (MAD)0.0482002
Skewness-0.015860374
Sum577083.53
Variance0.017207809
MonotonicityNot monotonic
2023-12-12T05:04:23.186877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.4322932 2
 
< 0.1%
126.5190899 2
 
< 0.1%
126.4740787 2
 
< 0.1%
126.3923306 2
 
< 0.1%
126.5329081 2
 
< 0.1%
126.5131744 2
 
< 0.1%
126.1782584 2
 
< 0.1%
126.56323 2
 
< 0.1%
126.5358743 2
 
< 0.1%
126.5165245 2
 
< 0.1%
Other values (4530) 4541
99.6%
ValueCountFrequency (%)
124.8492344 1
< 0.1%
126.1729124 1
< 0.1%
126.1762601 1
< 0.1%
126.1769475 1
< 0.1%
126.1781194 1
< 0.1%
126.1781636 1
< 0.1%
126.1781782 1
< 0.1%
126.1782075 1
< 0.1%
126.1782298 1
< 0.1%
126.1782584 2
< 0.1%
ValueCountFrequency (%)
126.9352084 1
< 0.1%
126.9348646 1
< 0.1%
126.9347839 1
< 0.1%
126.9345749 1
< 0.1%
126.9342019 1
< 0.1%
126.9339396 1
< 0.1%
126.9334162 1
< 0.1%
126.932437 1
< 0.1%
126.932203 1
< 0.1%
126.9319849 1
< 0.1%
Distinct228
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size35.8 KiB
2023-12-12T05:04:23.613376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length146
Median length4
Mean length4.7853541
Min length1

Characters and Unicode

Total characters21826
Distinct characters384
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique208 ?
Unique (%)4.6%

Sample

1st row해당없음
2nd row해당없음
3rd row해당없음
4th row해당없음
5th row해당없음
ValueCountFrequency (%)
해당없음 4236
78.4%
신호등 65
 
1.2%
좋겠다 31
 
0.6%
신호등이 28
 
0.5%
횡단보도가 21
 
0.4%
없음 19
 
0.4%
많이 19
 
0.4%
횡단보도 18
 
0.3%
차가 18
 
0.3%
횡단보도에 14
 
0.3%
Other values (604) 935
 
17.3%
2023-12-12T05:04:24.239161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4289
19.7%
4276
19.6%
4266
19.5%
4238
19.4%
844
 
3.9%
162
 
0.7%
141
 
0.6%
134
 
0.6%
126
 
0.6%
126
 
0.6%
Other values (374) 3224
14.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20876
95.6%
Space Separator 844
 
3.9%
Other Punctuation 66
 
0.3%
Lowercase Letter 24
 
0.1%
Uppercase Letter 8
 
< 0.1%
Decimal Number 5
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4289
20.5%
4276
20.5%
4266
20.4%
4238
20.3%
162
 
0.8%
141
 
0.7%
134
 
0.6%
126
 
0.6%
126
 
0.6%
126
 
0.6%
Other values (345) 2992
14.3%
Lowercase Letter
ValueCountFrequency (%)
c 4
16.7%
u 3
12.5%
l 3
12.5%
e 2
8.3%
w 2
8.3%
t 2
8.3%
h 1
 
4.2%
s 1
 
4.2%
i 1
 
4.2%
g 1
 
4.2%
Other values (4) 4
16.7%
Uppercase Letter
ValueCountFrequency (%)
C 3
37.5%
G 1
 
12.5%
S 1
 
12.5%
T 1
 
12.5%
V 1
 
12.5%
K 1
 
12.5%
Decimal Number
ValueCountFrequency (%)
9 3
60.0%
5 1
 
20.0%
2 1
 
20.0%
Other Punctuation
ValueCountFrequency (%)
. 65
98.5%
! 1
 
1.5%
Space Separator
ValueCountFrequency (%)
844
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20876
95.6%
Common 918
 
4.2%
Latin 32
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4289
20.5%
4276
20.5%
4266
20.4%
4238
20.3%
162
 
0.8%
141
 
0.7%
134
 
0.6%
126
 
0.6%
126
 
0.6%
126
 
0.6%
Other values (345) 2992
14.3%
Latin
ValueCountFrequency (%)
c 4
 
12.5%
u 3
 
9.4%
C 3
 
9.4%
l 3
 
9.4%
e 2
 
6.2%
w 2
 
6.2%
t 2
 
6.2%
G 1
 
3.1%
S 1
 
3.1%
T 1
 
3.1%
Other values (10) 10
31.2%
Common
ValueCountFrequency (%)
844
91.9%
. 65
 
7.1%
9 3
 
0.3%
) 1
 
0.1%
5 1
 
0.1%
2 1
 
0.1%
- 1
 
0.1%
( 1
 
0.1%
! 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20872
95.6%
ASCII 950
 
4.4%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4289
20.5%
4276
20.5%
4266
20.4%
4238
20.3%
162
 
0.8%
141
 
0.7%
134
 
0.6%
126
 
0.6%
126
 
0.6%
126
 
0.6%
Other values (342) 2988
14.3%
ASCII
ValueCountFrequency (%)
844
88.8%
. 65
 
6.8%
c 4
 
0.4%
u 3
 
0.3%
9 3
 
0.3%
C 3
 
0.3%
l 3
 
0.3%
e 2
 
0.2%
w 2
 
0.2%
t 2
 
0.2%
Other values (19) 19
 
2.0%
Compat Jamo
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

요청사항
Categorical

IMBALANCE 

Distinct39
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size35.8 KiB
해당없음
4330 
신호등
 
113
횡단보도
 
37
인도
 
16
가로등
 
8
Other values (34)
 
57

Length

Max length11
Median length4
Mean length3.984214
Min length2

Unique

Unique22 ?
Unique (%)0.5%

Sample

1st row해당없음
2nd row해당없음
3rd row해당없음
4th row해당없음
5th row해당없음

Common Values

ValueCountFrequency (%)
해당없음 4330
94.9%
신호등 113
 
2.5%
횡단보도 37
 
0.8%
인도 16
 
0.4%
가로등 8
 
0.2%
과속방지턱 8
 
0.2%
CCTV 4
 
0.1%
자전거 도로 4
 
0.1%
신호등.횡단보도 3
 
0.1%
주차단속 2
 
< 0.1%
Other values (29) 36
 
0.8%

Length

2023-12-12T05:04:24.421366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
해당없음 4330
94.6%
신호등 113
 
2.5%
횡단보도 37
 
0.8%
인도 16
 
0.3%
가로등 8
 
0.2%
과속방지턱 8
 
0.2%
cctv 4
 
0.1%
자전거 4
 
0.1%
도로 4
 
0.1%
신호등.횡단보도 3
 
0.1%
Other values (38) 49
 
1.1%

분석아이디
Real number (ℝ)

UNIQUE 

Distinct4561
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8438.85
Minimum6050
Maximum11149
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size40.2 KiB
2023-12-12T05:04:24.559263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6050
5-th percentile6283
Q17199
median8344
Q39637
95-th percentile10921
Maximum11149
Range5099
Interquartile range (IQR)2438

Descriptive statistics

Standard deviation1458.6588
Coefficient of variation (CV)0.17285042
Kurtosis-1.0686369
Mean8438.85
Median Absolute Deviation (MAD)1219
Skewness0.20140302
Sum38489595
Variance2127685.5
MonotonicityNot monotonic
2023-12-12T05:04:24.720468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7287 1
 
< 0.1%
7149 1
 
< 0.1%
7141 1
 
< 0.1%
7142 1
 
< 0.1%
7143 1
 
< 0.1%
7144 1
 
< 0.1%
7145 1
 
< 0.1%
7147 1
 
< 0.1%
7150 1
 
< 0.1%
7139 1
 
< 0.1%
Other values (4551) 4551
99.8%
ValueCountFrequency (%)
6050 1
< 0.1%
6051 1
< 0.1%
6055 1
< 0.1%
6057 1
< 0.1%
6058 1
< 0.1%
6059 1
< 0.1%
6060 1
< 0.1%
6061 1
< 0.1%
6062 1
< 0.1%
6063 1
< 0.1%
ValueCountFrequency (%)
11149 1
< 0.1%
11148 1
< 0.1%
11147 1
< 0.1%
11146 1
< 0.1%
11145 1
< 0.1%
11144 1
< 0.1%
11143 1
< 0.1%
11142 1
< 0.1%
11141 1
< 0.1%
11140 1
< 0.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size35.8 KiB
Minimum2021-01-25 00:00:00
Maximum2021-01-25 00:00:00
2023-12-12T05:04:24.858193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T05:04:24.959188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T05:04:21.102349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T05:04:20.534551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T05:04:20.798806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/