Overview

Dataset statistics

Number of variables6
Number of observations25
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory55.3 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description2020년 12월말 기준 경남도내 공동생활가정 현황(시설명, 소재지, 시설장, 시설규모(정원))등의 항목을 제공합니다
Author경상남도
URLhttps://www.data.go.kr/data/15053059/fileData.do

Alerts

연번 is highly overall correlated with 시군High correlation
시군 is highly overall correlated with 연번High correlation
시설규모(정원) is highly imbalanced (75.8%)Imbalance
연번 has unique valuesUnique
시설명 has unique valuesUnique
소재지 has unique valuesUnique
시설장 has unique valuesUnique

Reproduction

Analysis started2024-04-22 00:49:43.421299
Analysis finished2024-04-22 00:49:43.931467
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13
Minimum1
Maximum25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2024-04-22T09:49:43.991910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.2
Q17
median13
Q319
95-th percentile23.8
Maximum25
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.3598007
Coefficient of variation (CV)0.56613852
Kurtosis-1.2
Mean13
Median Absolute Deviation (MAD)6
Skewness0
Sum325
Variance54.166667
MonotonicityStrictly increasing
2024-04-22T09:49:44.112834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
1 1
 
4.0%
2 1
 
4.0%
25 1
 
4.0%
24 1
 
4.0%
23 1
 
4.0%
22 1
 
4.0%
21 1
 
4.0%
20 1
 
4.0%
19 1
 
4.0%
18 1
 
4.0%
Other values (15) 15
60.0%
ValueCountFrequency (%)
1 1
4.0%
2 1
4.0%
3 1
4.0%
4 1
4.0%
5 1
4.0%
6 1
4.0%
7 1
4.0%
8 1
4.0%
9 1
4.0%
10 1
4.0%
ValueCountFrequency (%)
25 1
4.0%
24 1
4.0%
23 1
4.0%
22 1
4.0%
21 1
4.0%
20 1
4.0%
19 1
4.0%
18 1
4.0%
17 1
4.0%
16 1
4.0%

시군
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)36.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
창원시
진주시
통영시
사천시
김해시
Other values (4)

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique2 ?
Unique (%)8.0%

Sample

1st row창원시
2nd row창원시
3rd row창원시
4th row창원시
5th row창원시

Common Values

ValueCountFrequency (%)
창원시 5
20.0%
진주시 5
20.0%
통영시 4
16.0%
사천시 3
12.0%
김해시 2
 
8.0%
거제시 2
 
8.0%
양산시 2
 
8.0%
함안군 1
 
4.0%
거창군 1
 
4.0%

Length

2024-04-22T09:49:44.237939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:49:44.333913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
창원시 5
20.0%
진주시 5
20.0%
통영시 4
16.0%
사천시 3
12.0%
김해시 2
 
8.0%
거제시 2
 
8.0%
양산시 2
 
8.0%
함안군 1
 
4.0%
거창군 1
 
4.0%

시설명
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2024-04-22T09:49:44.524142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length4
Mean length4.52
Min length2

Characters and Unicode

Total characters113
Distinct characters66
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row빛누리집
2nd row마산한울타리
3rd row꿈놀이터
4th row행복울타리
5th row은혜의집
ValueCountFrequency (%)
빛누리집 1
 
4.0%
파란나라 1
 
4.0%
푸른꿈그룹홈 1
 
4.0%
연지 1
 
4.0%
푸른나래 1
 
4.0%
좋은씨앗 1
 
4.0%
콩이네집 1
 
4.0%
자작나무 1
 
4.0%
두리골해맑은아이들의집 1
 
4.0%
모모네 1
 
4.0%
Other values (15) 15
60.0%
2024-04-22T09:49:44.835839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7
 
6.2%
6
 
5.3%
5
 
4.4%
4
 
3.5%
4
 
3.5%
4
 
3.5%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
Other values (56) 71
62.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 113
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
6.2%
6
 
5.3%
5
 
4.4%
4
 
3.5%
4
 
3.5%
4
 
3.5%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
Other values (56) 71
62.8%

Most occurring scripts

ValueCountFrequency (%)
Hangul 113
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
6.2%
6
 
5.3%
5
 
4.4%
4
 
3.5%
4
 
3.5%
4
 
3.5%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
Other values (56) 71
62.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 113
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7
 
6.2%
6
 
5.3%
5
 
4.4%
4
 
3.5%
4
 
3.5%
4
 
3.5%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
Other values (56) 71
62.8%

소재지
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2024-04-22T09:49:45.079977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length32
Mean length27.96
Min length17

Characters and Unicode

Total characters699
Distinct characters89
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row경상남도 창원시 마산합포구 구산면 옥계로 11
2nd row경상남도 창원시 마산합포구 월영남로 59,106동 1603호(월영동 동아1차)
3rd row경상남도 창원시 진해구 인사로 32번길 12(제황산동 28-428)
4th row경상남도 창원시 진해구 석동로 75번길 20-8(석동 637-1)
5th row경상남도 창원시 의창구 읍성로 102-1, 3층 (소답동)
ValueCountFrequency (%)
경상남도 25
 
17.7%
창원시 5
 
3.5%
진주시 5
 
3.5%
통영시 4
 
2.8%
사천시 3
 
2.1%
양산시 2
 
1.4%
정동면 2
 
1.4%
김해시 2
 
1.4%
삼안로 2
 
1.4%
거제시 2
 
1.4%
Other values (85) 89
63.1%
2024-04-22T09:49:45.470940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
116
 
16.6%
1 34
 
4.9%
29
 
4.1%
2 29
 
4.1%
28
 
4.0%
27
 
3.9%
25
 
3.6%
23
 
3.3%
23
 
3.3%
0 20
 
2.9%
Other values (79) 345
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 387
55.4%
Decimal Number 145
 
20.7%
Space Separator 116
 
16.6%
Close Punctuation 15
 
2.1%
Open Punctuation 15
 
2.1%
Dash Punctuation 13
 
1.9%
Other Punctuation 8
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
7.5%
28
 
7.2%
27
 
7.0%
25
 
6.5%
23
 
5.9%
23
 
5.9%
17
 
4.4%
16
 
4.1%
9
 
2.3%
8
 
2.1%
Other values (64) 182
47.0%
Decimal Number
ValueCountFrequency (%)
1 34
23.4%
2 29
20.0%
0 20
13.8%
5 15
10.3%
6 10
 
6.9%
8 9
 
6.2%
3 8
 
5.5%
9 8
 
5.5%
4 6
 
4.1%
7 6
 
4.1%
Space Separator
ValueCountFrequency (%)
116
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Other Punctuation
ValueCountFrequency (%)
, 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 387
55.4%
Common 312
44.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
7.5%
28
 
7.2%
27
 
7.0%
25
 
6.5%
23
 
5.9%
23
 
5.9%
17
 
4.4%
16
 
4.1%
9
 
2.3%
8
 
2.1%
Other values (64) 182
47.0%
Common
ValueCountFrequency (%)
116
37.2%
1 34
 
10.9%
2 29
 
9.3%
0 20
 
6.4%
) 15
 
4.8%
( 15
 
4.8%
5 15
 
4.8%
- 13
 
4.2%
6 10
 
3.2%
8 9
 
2.9%
Other values (5) 36
 
11.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 387
55.4%
ASCII 312
44.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
116
37.2%
1 34
 
10.9%
2 29
 
9.3%
0 20
 
6.4%
) 15
 
4.8%
( 15
 
4.8%
5 15
 
4.8%
- 13
 
4.2%
6 10
 
3.2%
8 9
 
2.9%
Other values (5) 36
 
11.5%
Hangul
ValueCountFrequency (%)
29
 
7.5%
28
 
7.2%
27
 
7.0%
25
 
6.5%
23
 
5.9%
23
 
5.9%
17
 
4.4%
16
 
4.1%
9
 
2.3%
8
 
2.1%
Other values (64) 182
47.0%

시설장
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2024-04-22T09:49:45.657828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.96
Min length2

Characters and Unicode

Total characters74
Distinct characters46
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row이경순
2nd row나혜령
3rd row이광원
4th row김수경
5th row전정민
ValueCountFrequency (%)
이경순 1
 
4.0%
강명석 1
 
4.0%
최재열 1
 
4.0%
한미나 1
 
4.0%
이성우 1
 
4.0%
제은혜 1
 
4.0%
이미진 1
 
4.0%
박현태 1
 
4.0%
박명조 1
 
4.0%
한경희 1
 
4.0%
Other values (15) 15
60.0%
2024-04-22T09:49:45.960641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
 
6.8%
4
 
5.4%
4
 
5.4%
4
 
5.4%
3
 
4.1%
3
 
4.1%
3
 
4.1%
3
 
4.1%
2
 
2.7%
2
 
2.7%
Other values (36) 41
55.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 74
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
6.8%
4
 
5.4%
4
 
5.4%
4
 
5.4%
3
 
4.1%
3
 
4.1%
3
 
4.1%
3
 
4.1%
2
 
2.7%
2
 
2.7%
Other values (36) 41
55.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 74
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
6.8%
4
 
5.4%
4
 
5.4%
4
 
5.4%
3
 
4.1%
3
 
4.1%
3
 
4.1%
3
 
4.1%
2
 
2.7%
2
 
2.7%
Other values (36) 41
55.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 74
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5
 
6.8%
4
 
5.4%
4
 
5.4%
4
 
5.4%
3
 
4.1%
3
 
4.1%
3
 
4.1%
3
 
4.1%
2
 
2.7%
2
 
2.7%
Other values (36) 41
55.4%

시설규모(정원)
Categorical

IMBALANCE 

Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
7
24 
5
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)4.0%

Sample

1st row7
2nd row7
3rd row7
4th row7
5th row7

Common Values

ValueCountFrequency (%)
7 24
96.0%
5 1
 
4.0%

Length

2024-04-22T09:49:46.080660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:49:46.170646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
7 24
96.0%
5 1
 
4.0%

Interactions

2024-04-22T09:49:43.699188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-22T09:49:46.233362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군시설명소재지시설장시설규모(정원)
연번1.0000.8521.0001.0001.0000.559
시군0.8521.0001.0001.0001.0000.000
시설명1.0001.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.0001.000
시설장1.0001.0001.0001.0001.0001.000
시설규모(정원)0.5590.0001.0001.0001.0001.000
2024-04-22T09:49:46.329121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군시설규모(정원)
시군1.0000.000
시설규모(정원)0.0001.000
2024-04-22T09:49:46.409553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/