Overview

Dataset statistics

Number of variables6
Number of observations1098
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory55.9 KiB
Average record size in memory52.1 B

Variable types

Categorical2
Text1
Numeric3

Dataset

Description김해시에서 통계기반 도시현황 파악을 위해 개발한 통계지수 중 하나로서, 통계연도, 시도명, 시군구명, 유아천명당 어린이집의 수(개), 보육시설수(개), 0에서5세아동수(명)로 구성되어 있습니다. 김해시 중심의 통계지수로서, 데이터 수집, 가공 등의 어려움으로 김해시 외 지역의 정보는 누락될 수 있습니다.
Author경상남도 김해시
URLhttps://www.data.go.kr/data/15110149/fileData.do

Alerts

유아천명당 어린이집의 수(개) is highly overall correlated with 보육시설수(개)High correlation
보육시설수(개) is highly overall correlated with 유아천명당 어린이집의 수(개) and 1 other fieldsHigh correlation
0에서5세아동수(명) is highly overall correlated with 보육시설수(개)High correlation

Reproduction

Analysis started2023-12-12 05:54:07.458698
Analysis finished2023-12-12 05:54:09.051007
Duration1.59 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

통계연도
Categorical

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
2014
227 
2015
227 
2016
227 
2012
226 
2013
191 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2012
2nd row2012
3rd row2012
4th row2012
5th row2012

Common Values

ValueCountFrequency (%)
2014 227
20.7%
2015 227
20.7%
2016 227
20.7%
2012 226
20.6%
2013 191
17.4%

Length

2023-12-12T14:54:09.136765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:54:09.313378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2014 227
20.7%
2015 227
20.7%
2016 227
20.7%
2012 226
20.6%
2013 191
17.4%

시도명
Categorical

Distinct16
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
경기도
154 
경상북도
115 
전라남도
110 
서울특별시
100 
강원도
90 
Other values (11)
529 

Length

Max length7
Median length5
Mean length4.1147541
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
경기도 154
14.0%
경상북도 115
10.5%
전라남도 110
10.0%
서울특별시 100
9.1%
강원도 90
8.2%
경상남도 90
8.2%
부산광역시 80
7.3%
충청남도 75
6.8%
전라북도 70
6.4%
인천광역시 45
 
4.1%
Other values (6) 169
15.4%

Length

2023-12-12T14:54:09.450162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 154
14.0%
경상북도 115
10.5%
전라남도 110
10.0%
서울특별시 100
9.1%
강원도 90
8.2%
경상남도 90
8.2%
부산광역시 80
7.3%
충청남도 75
6.8%
전라북도 70
6.4%
인천광역시 45
 
4.1%
Other values (6) 169
15.4%
Distinct205
Distinct (%)18.7%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
2023-12-12T14:54:09.823138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9253188
Min length2

Characters and Unicode

Total characters3212
Distinct characters130
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로구
2nd row중구
3rd row용산구
4th row성동구
5th row광진구
ValueCountFrequency (%)
동구 30
 
2.7%
중구 29
 
2.6%
서구 25
 
2.3%
남구 20
 
1.8%
북구 20
 
1.8%
고성군 10
 
0.9%
강서구 9
 
0.8%
동해시 5
 
0.5%
군위군 5
 
0.5%
계룡시 5
 
0.5%
Other values (195) 940
85.6%
2023-12-12T14:54:10.422954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
417
 
13.0%
386
 
12.0%
339
 
10.6%
105
 
3.3%
97
 
3.0%
88
 
2.7%
87
 
2.7%
80
 
2.5%
78
 
2.4%
62
 
1.9%
Other values (120) 1473
45.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3212
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
417
 
13.0%
386
 
12.0%
339
 
10.6%
105
 
3.3%
97
 
3.0%
88
 
2.7%
87
 
2.7%
80
 
2.5%
78
 
2.4%
62
 
1.9%
Other values (120) 1473
45.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3212
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
417
 
13.0%
386
 
12.0%
339
 
10.6%
105
 
3.3%
97
 
3.0%
88
 
2.7%
87
 
2.7%
80
 
2.5%
78
 
2.4%
62
 
1.9%
Other values (120) 1473
45.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3212
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
417
 
13.0%
386
 
12.0%
339
 
10.6%
105
 
3.3%
97
 
3.0%
88
 
2.7%
87
 
2.7%
80
 
2.5%
78
 
2.4%
62
 
1.9%
Other values (120) 1473
45.9%

유아천명당 어린이집의 수(개)
Real number (ℝ)

HIGH CORRELATION 

Distinct755
Distinct (%)68.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.052286
Minimum2.31
Maximum28.25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.8 KiB
2023-12-12T14:54:10.613608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.31
5-th percentile8.2285
Q111.4525
median13.565
Q316.845
95-th percentile20.5545
Maximum28.25
Range25.94
Interquartile range (IQR)5.3925

Descriptive statistics

Standard deviation3.9160243
Coefficient of variation (CV)0.27867525
Kurtosis0.33690403
Mean14.052286
Median Absolute Deviation (MAD)2.535
Skewness0.34056284
Sum15429.41
Variance15.335246
MonotonicityNot monotonic
2023-12-12T14:54:10.802134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14.41 6
 
0.5%
13.15 6
 
0.5%
17.12 5
 
0.5%
12.76 5
 
0.5%
13.38 4
 
0.4%
11.0 4
 
0.4%
14.86 4
 
0.4%
17.33 4
 
0.4%
13.26 4
 
0.4%
19.25 4
 
0.4%
Other values (745) 1052
95.8%
ValueCountFrequency (%)
2.31 1
0.1%
2.32 1
0.1%
2.6 1
0.1%
2.79 1
0.1%
3.3 1
0.1%
4.14 1
0.1%
4.65 1
0.1%
5.51 1
0.1%
5.77 2
0.2%
5.85 1
0.1%
ValueCountFrequency (%)
28.25 1
0.1%
27.62 1
0.1%
27.3 1
0.1%
26.02 1
0.1%
25.96 1
0.1%
25.95 1
0.1%
25.79 1
0.1%
25.78 1
0.1%
25.37 1
0.1%
25.22 1
0.1%

보육시설수(개)
Real number (ℝ)

HIGH CORRELATION 

Distinct427
Distinct (%)38.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean184.63843
Minimum1
Maximum1311
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.8 KiB
2023-12-12T14:54:10.984718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11
Q125.25
median104
Q3250.75
95-th percentile677.45
Maximum1311
Range1310
Interquartile range (IQR)225.5

Descriptive statistics

Standard deviation223.09692
Coefficient of variation (CV)1.2082908
Kurtosis5.2785685
Mean184.63843
Median Absolute Deviation (MAD)89
Skewness2.1089641
Sum202733
Variance49772.237
MonotonicityNot monotonic
2023-12-12T14:54:11.172631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14 32
 
2.9%
13 27
 
2.5%
11 24
 
2.2%
17 21
 
1.9%
12 21
 
1.9%
16 16
 
1.5%
15 14
 
1.3%
26 12
 
1.1%
25 12
 
1.1%
39 12
 
1.1%
Other values (417) 907
82.6%
ValueCountFrequency (%)
1 1
 
0.1%
2 4
0.4%
3 2
 
0.2%
4 3
 
0.3%
5 6
0.5%
6 6
0.5%
7 4
0.4%
8 8
0.7%
9 6
0.5%
10 9
0.8%
ValueCountFrequency (%)
1311 1
0.1%
1284 1
0.1%
1266 1
0.1%
1254 1
0.1%
1187 1
0.1%
1182 1
0.1%
1166 1
0.1%
1161 1
0.1%
1134 1
0.1%
1114 1
0.1%

0에서5세아동수(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct1056
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11827.185
Minimum291
Maximum71849
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.8 KiB
2023-12-12T14:54:11.347106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum291
5-th percentile1013.6
Q12227.75
median6863.5
Q317745.75
95-th percentile35789.65
Maximum71849
Range71558
Interquartile range (IQR)15518

Descriptive statistics

Standard deviation12860.77
Coefficient of variation (CV)1.0873906
Kurtosis3.9464356
Mean11827.185
Median Absolute Deviation (MAD)5528
Skewness1.8215064
Sum12986249
Variance1.653994 × 108
MonotonicityNot monotonic
2023-12-12T14:54:11.518728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1429 3
 
0.3%
10356 2
 
0.2%
14542 2
 
0.2%
1117 2
 
0.2%
3499 2
 
0.2%
3732 2
 
0.2%
1034 2
 
0.2%
746 2
 
0.2%
2632 2
 
0.2%
2325 2
 
0.2%
Other values (1046) 1077
98.1%
ValueCountFrequency (%)
291 1
0.1%
303 1
0.1%
316 1
0.1%
323 1
0.1%
342 1
0.1%
559 1
0.1%
563 1
0.1%
569 1
0.1%
574 1
0.1%
578 1
0.1%
ValueCountFrequency (%)
71849 1
0.1%
71795 1
0.1%
70161 1
0.1%
70147 1
0.1%
69027 1
0.1%
67766 1
0.1%
67435 1
0.1%
66464 1
0.1%
65496 1
0.1%
64446 1
0.1%

Interactions

2023-12-12T14:54:08.476815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:07.780066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:08.115044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:08.580589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:07.888025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:08.231807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:08.694592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/