Overview

Dataset statistics

Number of variables18
Number of observations482
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory67.9 KiB
Average record size in memory144.3 B

Variable types

Categorical14
Text3
DateTime1

Dataset

Description성남시내 국가예방접종지원사업 참여병원 현황 데이터로, 구별,동별, 의료기관명, 예방접종별(어린이국가예방접종, 건강여성첫걸음클리닉, B형주산기, 인플루엔자, 폐렴구균, A형간염, 기타, 코로나) 접종가능여부, 소재지주소, 전화번호 등의 항목으로 구성되어 있습니다
URLhttps://www.data.go.kr/data/15000821/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
폐렴구균(어르신) is highly overall correlated with 구별 and 12 other fieldsHigh correlation
인플루엔자(어린이 13-18세) is highly overall correlated with 구별 and 12 other fieldsHigh correlation
HPV국가예방접종사업 is highly overall correlated with 구별 and 12 other fieldsHigh correlation
B형주산기 is highly overall correlated with 구별 and 12 other fieldsHigh correlation
인플루엔자(어린이 13세이하) is highly overall correlated with 구별 and 12 other fieldsHigh correlation
기타사업 is highly overall correlated with 구별 and 12 other fieldsHigh correlation
동별 is highly overall correlated with 구별 and 12 other fieldsHigh correlation
인플루엔자(어르신) is highly overall correlated with 구별 and 12 other fieldsHigh correlation
어린이국가예방접종 is highly overall correlated with 구별 and 12 other fieldsHigh correlation
코로나 is highly overall correlated with 구별 and 12 other fieldsHigh correlation
A형간염(고위험군) is highly overall correlated with 구별 and 12 other fieldsHigh correlation
인플루엔자(수급권자) is highly overall correlated with 구별 and 12 other fieldsHigh correlation
구별 is highly overall correlated with 동별 and 12 other fieldsHigh correlation
인플루엔자(임신부) is highly overall correlated with 구별 and 12 other fieldsHigh correlation
인플루엔자(어르신) is highly imbalanced (53.2%)Imbalance
기타사업 is highly imbalanced (51.3%)Imbalance

Reproduction

Analysis started2023-12-12 06:23:28.081350
Analysis finished2023-12-12 06:23:29.839013
Duration1.76 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구별
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
분당구
228 
수정구
141 
중원구
113 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수정구
2nd row수정구
3rd row수정구
4th row수정구
5th row수정구

Common Values

ValueCountFrequency (%)
분당구 228
47.3%
수정구 141
29.3%
중원구 113
23.4%

Length

2023-12-12T15:23:30.246360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:30.363527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
분당구 228
47.3%
수정구 141
29.3%
중원구 113
23.4%

동별
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
신흥동
41 
정자동
36 
야탑동
35 
태평동
35 
창곡동
33 
Other values (26)
302 

Length

Max length4
Median length3
Mean length3.0497925
Min length2

Unique

Unique2 ?
Unique (%)0.4%

Sample

1st row신흥동
2nd row창곡동
3rd row창곡동
4th row태평동
5th row양지동

Common Values

ValueCountFrequency (%)
신흥동 41
 
8.5%
정자동 36
 
7.5%
야탑동 35
 
7.3%
태평동 35
 
7.3%
창곡동 33
 
6.8%
서현동 32
 
6.6%
구미동 27
 
5.6%
금광동 25
 
5.2%
금곡동 25
 
5.2%
성남동 22
 
4.6%
Other values (21) 171
35.5%

Length

2023-12-12T15:23:30.532012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
신흥동 41
 
8.5%
정자동 36
 
7.5%
야탑동 35
 
7.3%
태평동 35
 
7.3%
창곡동 33
 
6.8%
서현동 32
 
6.6%
구미동 27
 
5.6%
금광동 25
 
5.2%
금곡동 25
 
5.2%
성남동 22
 
4.6%
Other values (21) 171
35.5%
Distinct472
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2023-12-12T15:23:30.789737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length19
Mean length8.5020747
Min length3

Characters and Unicode

Total characters4098
Distinct characters291
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique463 ?
Unique (%)96.1%

Sample

1st row(의) 열린의료재단 인하의원
2nd row365위례연세내과의원
3rd row365프렌즈내과의원
4th row계정이비인후과의원
5th row고려정형외과의원
ValueCountFrequency (%)
두리이비인후과의원 3
 
0.6%
다나이비인후과의원 2
 
0.4%
산부인과의원 2
 
0.4%
소아청소년과의원 2
 
0.4%
우리의원 2
 
0.4%
열린이비인후과의원 2
 
0.4%
메디플러스의원 2
 
0.4%
연세내과의원 2
 
0.4%
상쾌한이비인후과의원 2
 
0.4%
행복한내과의원 2
 
0.4%
Other values (482) 483
95.8%
2023-12-12T15:23:31.297521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
492
 
12.0%
490
 
12.0%
350
 
8.5%
126
 
3.1%
119
 
2.9%
105
 
2.6%
101
 
2.5%
85
 
2.1%
84
 
2.0%
75
 
1.8%
Other values (281) 2071
50.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4029
98.3%
Decimal Number 26
 
0.6%
Space Separator 23
 
0.6%
Close Punctuation 6
 
0.1%
Open Punctuation 6
 
0.1%
Uppercase Letter 6
 
0.1%
Lowercase Letter 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
492
 
12.2%
490
 
12.2%
350
 
8.7%
126
 
3.1%
119
 
3.0%
105
 
2.6%
101
 
2.5%
85
 
2.1%
84
 
2.1%
75
 
1.9%
Other values (265) 2002
49.7%
Decimal Number
ValueCountFrequency (%)
3 6
23.1%
5 5
19.2%
6 5
19.2%
2 5
19.2%
1 4
15.4%
4 1
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
S 2
33.3%
W 1
16.7%
Y 1
16.7%
H 1
16.7%
L 1
16.7%
Space Separator
ValueCountFrequency (%)
23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4029
98.3%
Common 62
 
1.5%
Latin 7
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
492
 
12.2%
490
 
12.2%
350
 
8.7%
126
 
3.1%
119
 
3.0%
105
 
2.6%
101
 
2.5%
85
 
2.1%
84
 
2.1%
75
 
1.9%
Other values (265) 2002
49.7%
Common
ValueCountFrequency (%)
23
37.1%
) 6
 
9.7%
3 6
 
9.7%
( 6
 
9.7%
5 5
 
8.1%
6 5
 
8.1%
2 5
 
8.1%
1 4
 
6.5%
4 1
 
1.6%
- 1
 
1.6%
Latin
ValueCountFrequency (%)
S 2
28.6%
W 1
14.3%
e 1
14.3%
Y 1
14.3%
H 1
14.3%
L 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4029
98.3%
ASCII 69
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
492
 
12.2%
490
 
12.2%
350
 
8.7%
126
 
3.1%
119
 
3.0%
105
 
2.6%
101
 
2.5%
85
 
2.1%
84
 
2.1%
75
 
1.9%
Other values (265) 2002
49.7%
ASCII
ValueCountFrequency (%)
23
33.3%
) 6
 
8.7%
3 6
 
8.7%
( 6
 
8.7%
5 5
 
7.2%
6 5
 
7.2%
2 5
 
7.2%
1 4
 
5.8%
S 2
 
2.9%
4 1
 
1.4%
Other values (6) 6
 
8.7%

어린이국가예방접종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
276 
<NA>
206 

Length

Max length4
Median length1
Mean length2.2821577
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row
3rd row
4th row<NA>
5th row

Common Values

ValueCountFrequency (%)
276
57.3%
<NA> 206
42.7%

Length

2023-12-12T15:23:31.468114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:31.603978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
276
57.3%
na 206
42.7%

HPV국가예방접종사업
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
<NA>
303 
179 

Length

Max length4
Median length4
Mean length2.8858921
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 303
62.9%
179
37.1%

Length

2023-12-12T15:23:31.736239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:31.869355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 303
62.9%
179
37.1%

B형주산기
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
<NA>
414 
68 

Length

Max length4
Median length4
Mean length3.5767635
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 414
85.9%
68
 
14.1%

Length

2023-12-12T15:23:31.994959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:32.128021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 414
85.9%
68
 
14.1%

인플루엔자(어린이 13세이하)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
<NA>
259 
223 

Length

Max length4
Median length4
Mean length2.6120332
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 259
53.7%
223
46.3%

Length

2023-12-12T15:23:32.271035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:32.408865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 259
53.7%
223
46.3%

인플루엔자(어린이 13-18세)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
<NA>
398 
84 

Length

Max length4
Median length4
Mean length3.4771784
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row
3rd row
4th row<NA>
5th row

Common Values

ValueCountFrequency (%)
<NA> 398
82.6%
84
 
17.4%

Length

2023-12-12T15:23:32.562965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:32.705735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 398
82.6%
84
 
17.4%

인플루엔자(어르신)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
434 
<NA>
48 

Length

Max length4
Median length1
Mean length1.2987552
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
434
90.0%
<NA> 48
 
10.0%

Length

2023-12-12T15:23:32.823751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:32.955436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
434
90.0%
na 48
 
10.0%

인플루엔자(임신부)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
<NA>
307 
175 

Length

Max length4
Median length4
Mean length2.9107884
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 307
63.7%
175
36.3%

Length

2023-12-12T15:23:33.141556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:33.269406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 307
63.7%
175
36.3%

인플루엔자(수급권자)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
<NA>
427 
55 

Length

Max length4
Median length4
Mean length3.6576763
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row

Common Values

ValueCountFrequency (%)
<NA> 427
88.6%
55
 
11.4%

Length

2023-12-12T15:23:33.405385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:33.559253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 427
88.6%
55
 
11.4%

폐렴구균(어르신)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
314 
<NA>
168 

Length

Max length4
Median length1
Mean length2.0456432
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row
3rd row
4th row
5th row<NA>

Common Values

ValueCountFrequency (%)
314
65.1%
<NA> 168
34.9%

Length

2023-12-12T15:23:33.702605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:33.867453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
314
65.1%
na 168
34.9%

A형간염(고위험군)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
<NA>
413 
69 

Length

Max length4
Median length4
Mean length3.5705394
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 413
85.7%
69
 
14.3%

Length

2023-12-12T15:23:34.012056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:34.147676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 413
85.7%
69
 
14.3%

기타사업
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
431 
<NA>
51 

Length

Max length4
Median length1
Mean length1.3174274
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
431
89.4%
<NA> 51
 
10.6%

Length

2023-12-12T15:23:34.315059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:34.475341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
431
89.4%
na 51
 
10.6%

코로나
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
335 
<NA>
147 

Length

Max length4
Median length1
Mean length1.9149378
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row
3rd row
4th row
5th row<NA>

Common Values

ValueCountFrequency (%)
335
69.5%
<NA> 147
30.5%

Length

2023-12-12T15:23:34.601036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:23:34.747111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
335
69.5%
na 147
30.5%
Distinct478
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2023-12-12T15:23:35.097010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length55
Mean length38.819502
Min length25

Characters and Unicode

Total characters18711
Distinct characters307
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique474 ?
Unique (%)98.3%

Sample

1st row경기도 성남시 수정구 수정로 210, (신흥동, 제중빌딩)
2nd row경기도 성남시 수정구 위례동로 153, (창곡동, 에이플타워) 4층 404호
3rd row경기도 성남시 수정구 위례동로 147, (창곡동, 위례건아타워) 5층 505호
4th row경기도 성남시 수정구 수정로 121, (태평동) 계정이비인후과
5th row경기도 성남시 수정구 산성대로 517, (양지동) 2층
ValueCountFrequency (%)
경기도 482
 
12.5%
성남시 482
 
12.5%
분당구 228
 
5.9%
수정구 141
 
3.6%
중원구 113
 
2.9%
2층 92
 
2.4%
3층 49
 
1.3%
산성대로 48
 
1.2%
수정로 47
 
1.2%
신흥동 42
 
1.1%
Other values (865) 2141
55.4%
2023-12-12T15:23:35.743925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/