Overview

Dataset statistics

Number of variables8
Number of observations843
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)0.2%
Total size in memory57.8 KiB
Average record size in memory70.2 B

Variable types

Text1
Numeric6
Categorical1

Dataset

Description교통 안전 데이터로서 어린이 등하교 및 초등학교 주변 사각지대, 위험요소 데이터 제공합니다. (출처: 공공데이터포털, https://www.data.go.kr/data/15076627/fileData.do)
Author백수빈
URLhttps://www.jejudatahub.net/data/view/data/875

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 2 (0.2%) duplicate rowsDuplicates
위험요소 준경험 선택건수_초등학교 전체 is highly overall correlated with 사각지대분석지정점수High correlation
위험지역선택건수_초등학교 전체 is highly overall correlated with 위험지역선택건수_전연령 and 1 other fieldsHigh correlation
위험지역선택건수_전연령 is highly overall correlated with 위험지역선택건수_초등학교 전체 and 1 other fieldsHigh correlation
사각지대분석지정점수 is highly overall correlated with 위험요소 준경험 선택건수_초등학교 전체 and 2 other fieldsHigh correlation
위험지역선택건수_초등학교 전체 has 560 (66.4%) zerosZeros
위험지역선택건수_전연령 has 560 (66.4%) zerosZeros

Reproduction

Analysis started2023-12-11 20:04:09.856291
Analysis finished2023-12-11 20:04:13.945713
Duration4.09 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct105
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size6.7 KiB
2023-12-12T05:04:14.164413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length6
Mean length6.1708185
Min length6

Characters and Unicode

Total characters5202
Distinct characters116
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)2.0%

Sample

1st row고산초등학교
2nd row고산초등학교
3rd row고산초등학교
4th row고산초등학교
5th row고산초등학교
ValueCountFrequency (%)
이도초등학교 52
 
6.2%
외도초등학교 35
 
4.2%
동홍초등학교 29
 
3.4%
인화초등학교 26
 
3.1%
도남초등학교 26
 
3.1%
오라초등학교 24
 
2.8%
신광초등학교 23
 
2.7%
남광초등학교 22
 
2.6%
삼양초등학교 22
 
2.6%
삼화초등학교 19
 
2.3%
Other values (95) 565
67.0%
2023-12-12T05:04:14.599207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
843
16.2%
843
16.2%
842
16.2%
831
16.0%
165
 
3.2%
83
 
1.6%
78
 
1.5%
78
 
1.5%
75
 
1.4%
71
 
1.4%
Other values (106) 1293
24.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5202
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
843
16.2%
843
16.2%
842
16.2%
831
16.0%
165
 
3.2%
83
 
1.6%
78
 
1.5%
78
 
1.5%
75
 
1.4%
71
 
1.4%
Other values (106) 1293
24.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5202
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
843
16.2%
843
16.2%
842
16.2%
831
16.0%
165
 
3.2%
83
 
1.6%
78
 
1.5%
78
 
1.5%
75
 
1.4%
71
 
1.4%
Other values (106) 1293
24.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5202
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
843
16.2%
843
16.2%
842
16.2%
831
16.0%
165
 
3.2%
83
 
1.6%
78
 
1.5%
78
 
1.5%
75
 
1.4%
71
 
1.4%
Other values (106) 1293
24.9%

위도
Real number (ℝ)

Distinct833
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.444766
Minimum33.222862
Maximum33.555681
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.5 KiB
2023-12-12T05:04:14.770488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33.222862
5-th percentile33.251394
Q133.448938
median33.488683
Q333.504465
95-th percentile33.52182
Maximum33.555681
Range0.33281824
Interquartile range (IQR)0.05552694

Descriptive statistics

Standard deviation0.097252072
Coefficient of variation (CV)0.0029078413
Kurtosis0.00063197345
Mean33.444766
Median Absolute Deviation (MAD)0.02045362
Skewness-1.3001855
Sum28193.938
Variance0.0094579656
MonotonicityNot monotonic
2023-12-12T05:04:14.915053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33.25330102 4
 
0.5%
33.26141106 4
 
0.5%
33.53317837 2
 
0.2%
33.50917533 2
 
0.2%
33.49296271 2
 
0.2%
33.48340532 2
 
0.2%
33.5028233 1
 
0.1%
33.51105318 1
 
0.1%
33.50979062 1
 
0.1%
33.51214399 1
 
0.1%
Other values (823) 823
97.6%
ValueCountFrequency (%)
33.22286229 1
0.1%
33.2230448 1
0.1%
33.2232273 1
0.1%
33.22351479 1
0.1%
33.22354715 1
0.1%
33.22355146 1
0.1%
33.22360519 1
0.1%
33.22377267 1
0.1%
33.22394874 1
0.1%
33.22395088 1
0.1%
ValueCountFrequency (%)
33.55568053 1
0.1%
33.55478779 1
0.1%
33.5546035 1
0.1%
33.55460089 1
0.1%
33.55424275 1
0.1%
33.55422182 1
0.1%
33.55345687 1
0.1%
33.55166509 1
0.1%
33.54885705 1
0.1%
33.54143114 1
0.1%

경도
Real number (ℝ)

Distinct834
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.52054
Minimum126.17697
Maximum126.93146
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.5 KiB
2023-12-12T05:04:15.062669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.17697
5-th percentile126.26921
Q1126.47486
median126.52894
Q3126.56804
95-th percentile126.76431
Maximum126.93146
Range0.7544903
Interquartile range (IQR)0.0931761

Descriptive statistics

Standard deviation0.12731477
Coefficient of variation (CV)0.0010062775
Kurtosis2.1889543
Mean126.52054
Median Absolute Deviation (MAD)0.0468022
Skewness0.3415811
Sum106656.81
Variance0.01620905
MonotonicityNot monotonic
2023-12-12T05:04:15.229128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.3050547 4
 
0.5%
126.3043001 4
 
0.5%
126.6191298 2
 
0.2%
126.5213976 2
 
0.2%
126.4159927 2
 
0.2%
126.516523 1
 
0.1%
126.5095144 1
 
0.1%
126.5080257 1
 
0.1%
126.5080402 1
 
0.1%
126.50909 1
 
0.1%
Other values (824) 824
97.7%
ValueCountFrequency (%)
126.1769706 1
0.1%
126.1775959 1
0.1%
126.1778406 1
0.1%
126.1780526 1
0.1%
126.1782402 1
0.1%
126.1782511 1
0.1%
126.1784523 1
0.1%
126.1793168 1
0.1%
126.1795642 1
0.1%
126.1819322 1
0.1%
ValueCountFrequency (%)
126.9314609 1
0.1%
126.9221871 1
0.1%
126.9156468 1
0.1%
126.9150013 1
0.1%
126.9147861 1
0.1%
126.9145709 1
0.1%
126.9143448 1
0.1%
126.913709 1
0.1%
126.9121954 1
0.1%
126.9096218 1
0.1%
Distinct7
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1601423
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.5 KiB
2023-12-12T05:04:15.366542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile2
Maximum10
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.60025804
Coefficient of variation (CV)0.51740034
Kurtosis81.110631
Mean1.1601423
Median Absolute Deviation (MAD)0
Skewness7.3526222
Sum978
Variance0.36030972
MonotonicityNot monotonic
2023-12-12T05:04:15.499743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 750
89.0%
2 72
 
8.5%
3 11
 
1.3%
4 7
 
0.8%
10 1
 
0.1%
5 1
 
0.1%
8 1
 
0.1%
ValueCountFrequency (%)
1 750
89.0%
2 72
 
8.5%
3 11
 
1.3%
4 7
 
0.8%
5 1
 
0.1%
8 1
 
0.1%
10 1
 
0.1%
ValueCountFrequency (%)
10 1
 
0.1%
8 1
 
0.1%
5 1
 
0.1%
4 7
 
0.8%
3 11
 
1.3%
2 72
 
8.5%
1 750
89.0%

위험지역선택건수_초등학교 전체
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct12
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.72479241
Minimum0
Maximum22
Zeros560
Zeros (%)66.4%
Negative0
Negative (%)0.0%
Memory size7.5 KiB
2023-12-12T05:04:15.640609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile4
Maximum22
Range22
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.5458748
Coefficient of variation (CV)2.1328519
Kurtosis48.897542
Mean0.72479241
Median Absolute Deviation (MAD)0
Skewness5.0970218
Sum611
Variance2.389729
MonotonicityNot monotonic
2023-12-12T05:04:15.751277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
0 560
66.4%
1 150
 
17.8%
2 52
 
6.2%
3 32
 
3.8%
4 29
 
3.4%
5 9
 
1.1%
7 3
 
0.4%
8 3
 
0.4%
6 2
 
0.2%
9 1
 
0.1%
Other values (2) 2
 
0.2%
ValueCountFrequency (%)
0 560
66.4%
1 150
 
17.8%
2 52
 
6.2%
3 32
 
3.8%
4 29
 
3.4%
5 9
 
1.1%
6 2
 
0.2%
7 3
 
0.4%
8 3
 
0.4%
9 1
 
0.1%
ValueCountFrequency (%)
22 1
 
0.1%
12 1
 
0.1%
9 1
 
0.1%
8 3
 
0.4%
7 3
 
0.4%
6 2
 
0.2%
5 9
 
1.1%
4 29
3.4%
3 32
3.8%
2 52
6.2%

위험지역선택건수_전연령
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct12
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.72360617
Minimum0
Maximum22
Zeros560
Zeros (%)66.4%
Negative0
Negative (%)0.0%
Memory size7.5 KiB
2023-12-12T05:04:15.934667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile4
Maximum22
Range22
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.5452787
Coefficient of variation (CV)2.1355245
Kurtosis48.992881
Mean0.72360617
Median Absolute Deviation (MAD)0
Skewness5.1045692
Sum610
Variance2.3878863
MonotonicityNot monotonic
2023-12-12T05:04:16.088170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
0 560
66.4%
1 151
 
17.9%
2 51
 
6.0%
3 32
 
3.8%
4 29
 
3.4%
5 9
 
1.1%
7 3
 
0.4%
8 3
 
0.4%
6 2
 
0.2%
9 1
 
0.1%
Other values (2) 2
 
0.2%
ValueCountFrequency (%)
0 560
66.4%
1 151
 
17.9%
2 51
 
6.0%
3 32
 
3.8%
4 29
 
3.4%
5 9
 
1.1%
6 2
 
0.2%
7 3
 
0.4%
8 3
 
0.4%
9 1
 
0.1%
ValueCountFrequency (%)
22 1
 
0.1%
12 1
 
0.1%
9 1
 
0.1%
8 3
 
0.4%
7 3
 
0.4%
6 2
 
0.2%
5 9
 
1.1%
4 29
3.4%
3 32
3.8%
2 51
6.0%

사각지대분석지정점수
Real number (ℝ)

HIGH CORRELATION 

Distinct53
Distinct (%)6.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.7113049
Minimum5
Maximum53
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.5 KiB
2023-12-12T05:04:16.267274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile5
Q15
median5
Q38
95-th percentile12.67
Maximum53
Range48
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.5084273
Coefficient of variation (CV)0.52276382
Kurtosis51.724424
Mean6.7113049
Median Absolute Deviation (MAD)0
Skewness5.4863442
Sum5657.63
Variance12.309062
MonotonicityNot monotonic
2023-12-12T05:04:16.708845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.0 519
61.6%
8.0 85
 
10.1%
7.0 45
 
5.3%
10.0 40
 
4.7%
9.0 22
 
2.6%
7.5 21
 
2.5%
13.0 14
 
1.7%
6.0 13
 
1.5%
7.67 6
 
0.7%
8.5 5
 
0.6%
Other values (43) 73
 
8.7%
ValueCountFrequency (%)
5.0 519
61.6%
6.0 13
 
1.5%
6.33 3
 
0.4%
6.5 5
 
0.6%
6.67 1
 
0.1%
7.0 45
 
5.3%
7.14 1
 
0.1%
7.2 1
 
0.1%
7.25 2
 
0.2%
7.33 3
 
0.4%
ValueCountFrequency (%)
53.0 1
0.1%
42.2 1
0.1%
25.0 1
0.1%
23.0 2
0.2%
22.57 1
0.1%
22.43 1
0.1%
22.08 1
0.1%
22.0 1
0.1%
20.0 1
0.1%
19.0 1
0.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.7 KiB
2021-01-25
843 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-01-25
2nd row2021-01-25
3rd row2021-01-25
4th row2021-01-25
5th row2021-01-25

Common Values

ValueCountFrequency (%)
2021-01-25 843
100.0%

Length

2023-12-12T05:04:16.875941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:04:17.009992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-01-25 843
100.0%

Interactions

2023-12-12T05:04:13.173293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T05:04:10.405816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T05:04:11.052697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T05:04:11.579316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/<