Overview

Dataset statistics

Number of variables16
Number of observations313
Missing cells278
Missing cells (%)5.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory39.3 KiB
Average record size in memory128.4 B

Variable types

DateTime3
Categorical7
Text6

Dataset

Description충청남도 공주시 진단용방사선발생장치 현황에 대한 데이터로 (의료기관명, 장비명, 장비수 ) 등의 항목을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=436&beforeMenuCd=DOM_000000201001001000&publicdatapk=15030706

Alerts

장비용도 is highly overall correlated with 장비형태High correlation
장비형태 is highly overall correlated with 장비용도High correlation
제조국명 is highly imbalanced (52.8%)Imbalance
의료기관영업구분 is highly imbalanced (60.7%)Imbalance
판매회사 has 263 (84.0%) missing valuesMissing
제조사 has 15 (4.8%) missing valuesMissing

Reproduction

Analysis started2024-01-09 21:17:57.357360
Analysis finished2024-01-09 21:17:58.401901
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct240
Distinct (%)76.7%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
Minimum1994-05-23 00:00:00
Maximum2018-08-24 00:00:00
2024-01-10T06:17:58.457409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:17:58.568455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct5
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
사용중
159 
양도양수
70 
폐기
41 
사용중지
34 
이전
 
9

Length

Max length4
Median length3
Mean length3.172524
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사용중
2nd row사용중
3rd row사용중
4th row사용중
5th row사용중

Common Values

ValueCountFrequency (%)
사용중 159
50.8%
양도양수 70
22.4%
폐기 41
 
13.1%
사용중지 34
 
10.9%
이전 9
 
2.9%

Length

2024-01-10T06:17:58.690877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:17:58.803642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사용중 159
50.8%
양도양수 70
22.4%
폐기 41
 
13.1%
사용중지 34
 
10.9%
이전 9
 
2.9%

장비용도
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
치과일반
71 
일반
48 
촬영 및 투시
48 
골밀도
35 
치과용 파노라마
33 
Other values (10)
78 

Length

Max length12
Median length8
Mean length4.7380192
Min length2

Unique

Unique3 ?
Unique (%)1.0%

Sample

1st row골밀도
2nd row일반
3rd row유방촬영
4th row골밀도
5th row일반

Common Values

ValueCountFrequency (%)
치과일반 71
22.7%
일반 48
15.3%
촬영 및 투시 48
15.3%
골밀도 35
11.2%
치과용 파노라마 33
10.5%
전신 16
 
5.1%
치과용CT 및 파노라마 14
 
4.5%
C-arm 14
 
4.5%
이동용 14
 
4.5%
유방촬영 11
 
3.5%
Other values (5) 9
 
2.9%

Length

2024-01-10T06:17:58.922943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
치과일반 71
15.0%
62
13.1%
촬영 49
10.4%
투시 49
10.4%
일반 48
10.2%
파노라마 47
10.0%
골밀도 35
7.4%
치과용 33
7.0%
치과용ct 18
 
3.8%
전신 16
 
3.4%
Other values (7) 44
9.3%

장비형태
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
거치형
273 
이동형
40 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row거치형
2nd row거치형
3rd row거치형
4th row거치형
5th row거치형

Common Values

ValueCountFrequency (%)
거치형 273
87.2%
이동형 40
 
12.8%

Length

2024-01-10T06:17:59.022033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:17:59.098689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
거치형 273
87.2%
이동형 40
 
12.8%

장비상태
Categorical

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
신제품
216 
중고제품
86 
기타
 
11

Length

Max length4
Median length3
Mean length3.2396166
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신제품
2nd row중고제품
3rd row중고제품
4th row중고제품
5th row중고제품

Common Values

ValueCountFrequency (%)
신제품 216
69.0%
중고제품 86
 
27.5%
기타 11
 
3.5%

Length

2024-01-10T06:17:59.185645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:17:59.283870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신제품 216
69.0%
중고제품 86
 
27.5%
기타 11
 
3.5%

판매회사
Text

MISSING 

Distinct38
Distinct (%)76.0%
Missing263
Missing (%)84.0%
Memory size2.6 KiB
2024-01-10T06:17:59.450857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length6.1
Min length2

Characters and Unicode

Total characters305
Distinct characters94
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)62.0%

Sample

1st row(주)한결메디칼
2nd row바텍(주)
3rd row(주)바텍
4th row한결메디칼
5th row영한엑스레이(주)
ValueCountFrequency (%)
한결메디칼 5
 
10.0%
리스템 3
 
6.0%
바텍 3
 
6.0%
리스템대전충남북대리점 2
 
4.0%
제노레이 2
 
4.0%
바텍코리아 2
 
4.0%
포인트닉스 2
 
4.0%
주)아시아방사선 1
 
2.0%
도시바메디칼시스템즈코리아 1
 
2.0%
주)코메드메디칼 1
 
2.0%
Other values (28) 28
56.0%
2024-01-10T06:17:59.741238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
5.6%
16
 
5.2%
16
 
5.2%
14
 
4.6%
) 14
 
4.6%
( 14
 
4.6%
13
 
4.3%
12
 
3.9%
10
 
3.3%
9
 
3.0%
Other values (84) 170
55.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 275
90.2%
Close Punctuation 14
 
4.6%
Open Punctuation 14
 
4.6%
Uppercase Letter 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
6.2%
16
 
5.8%
16
 
5.8%
14
 
5.1%
13
 
4.7%
12
 
4.4%
10
 
3.6%
9
 
3.3%
9
 
3.3%
9
 
3.3%
Other values (80) 150
54.5%
Uppercase Letter
ValueCountFrequency (%)
T 1
50.0%
I 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 275
90.2%
Common 28
 
9.2%
Latin 2
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
6.2%
16
 
5.8%
16
 
5.8%
14
 
5.1%
13
 
4.7%
12
 
4.4%
10
 
3.6%
9
 
3.3%
9
 
3.3%
9
 
3.3%
Other values (80) 150
54.5%
Common
ValueCountFrequency (%)
) 14
50.0%
( 14
50.0%
Latin
ValueCountFrequency (%)
T 1
50.0%
I 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 275
90.2%
ASCII 30
 
9.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
17
 
6.2%
16
 
5.8%
16
 
5.8%
14
 
5.1%
13
 
4.7%
12
 
4.4%
10
 
3.6%
9
 
3.3%
9
 
3.3%
9
 
3.3%
Other values (80) 150
54.5%
ASCII
ValueCountFrequency (%)
) 14
46.7%
( 14
46.7%
T 1
 
3.3%
I 1
 
3.3%

장치명칭
Categorical

Distinct16
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
치과진단용 엑스선 발생장치
99 
진단용 엑스선 장치
94 
진단용 엑스선 발생기
71 
치과용 전산화 단층 촬영장치
20 
전산화 단층 촬영장치
 
9
Other values (11)
20 

Length

Max length15
Median length14
Mean length11.776358
Min length7

Unique

Unique7 ?
Unique (%)2.2%

Sample

1st row진단용 엑스선 발생기
2nd row진단용 엑스선 장치
3rd row유방촬영용장치
4th row진단용 엑스선 발생기
5th row진단용 엑스선 장치

Common Values

ValueCountFrequency (%)
치과진단용 엑스선 발생장치 99
31.6%
진단용 엑스선 장치 94
30.0%
진단용 엑스선 발생기 71
22.7%
치과용 전산화 단층 촬영장치 20
 
6.4%
전산화 단층 촬영장치 9
 
2.9%
유방촬영용장치 5
 
1.6%
유방촬영용 장치 3
 
1.0%
유방촬영용 장치 등 3
 
1.0%
진단용엑스선촬영장치 2
 
0.6%
전신용엑스선골밀도측정기 1
 
0.3%
Other values (6) 6
 
1.9%

Length

2024-01-10T06:17:59.859649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
엑스선 264
28.4%
진단용 165
17.8%
장치 100
 
10.8%
치과진단용 99
 
10.7%
발생장치 99
 
10.7%
발생기 71
 
7.7%
전산화 29
 
3.1%
단층 29
 
3.1%
촬영장치 29
 
3.1%
치과용 20
 
2.2%
Other values (11) 23
 
2.5%
Distinct207
Distinct (%)66.1%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2024-01-10T06:18:00.111237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length42
Mean length9.5463259
Min length3

Characters and Unicode

Total characters2988
Distinct characters73
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique159 ?
Unique (%)50.8%

Sample

1st rowDEXXUM T
2nd rowMXHF-1500R
3rd rowAffinity Mammography System & Accessories
4th rowOSTEOPRIMA
5th rowDKⅡ-525RF
ValueCountFrequency (%)
dexxum 18
 
4.0%
max-gls 18
 
4.0%
t 18
 
4.0%
zeus 8
 
1.8%
esx 6
 
1.3%
max-gl 6
 
1.3%
point 5
 
1.1%
pht-30lfo 5
 
1.1%
dxg-5125 4
 
0.9%
제우스 4
 
0.9%
Other values (267) 353
79.3%
2024-01-10T06:18:00.481486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 227
 
7.6%
- 222
 
7.4%
X 176
 
5.9%
R 132
 
4.4%
132
 
4.4%
S 128
 
4.3%
A 120
 
4.0%
D 119
 
4.0%
E 104
 
3.5%
T 98
 
3.3%
Other values (63) 1530
51.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 1583
53.0%
Decimal Number 563
 
18.8%
Lowercase Letter 462
 
15.5%
Dash Punctuation 222
 
7.4%
Space Separator 132
 
4.4%
Other Letter 12
 
0.4%
Other Punctuation 6
 
0.2%
Letter Number 6
 
0.2%
Math Symbol 2
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
X 176
 
11.1%
R 132
 
8.3%
S 128
 
8.1%
A 120
 
7.6%
D 119
 
7.5%
E 104
 
6.6%
T 98
 
6.2%
M 90
 
5.7%
O 79
 
5.0%
H 74
 
4.7%
Other values (16) 463
29.2%
Lowercase Letter
ValueCountFrequency (%)
a 55
11.9%
o 49
10.6%
i 46
 
10.0%
r 36
 
7.8%
e 33
 
7.1%
t 31
 
6.7%
n 26
 
5.6%
s 23
 
5.0%
y 22
 
4.8%
m 21
 
4.5%
Other values (16) 120
26.0%
Decimal Number
ValueCountFrequency (%)
0 227
40.3%
5 88
 
15.6%
3 65
 
11.5%
2 64
 
11.4%
1 62
 
11.0%
6 23
 
4.1%
7 12
 
2.1%
4 10
 
1.8%
8 10
 
1.8%
9 2
 
0.4%
Other Letter
ValueCountFrequency (%)
4
33.3%
4
33.3%
4
33.3%
Letter Number
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Other Punctuation
ValueCountFrequency (%)
& 3
50.0%
/ 3
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 222
100.0%
Space Separator
ValueCountFrequency (%)
132
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2050
68.6%
Common 925
31.0%
Hangul 12
 
0.4%
Greek 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
X 176
 
8.6%
R 132
 
6.4%
S 128
 
6.2%
A 120
 
5.9%
D 119
 
5.8%
E 104
 
5.1%
T 98
 
4.8%
M 90
 
4.4%
O 79
 
3.9%
H 74
 
3.6%
Other values (44) 930
45.4%
Common
ValueCountFrequency (%)
0 227
24.5%
- 222
24.0%
132
14.3%
5 88
 
9.5%
3 65
 
7.0%
2 64
 
6.9%
1 62
 
6.7%
6 23
 
2.5%
7 12
 
1.3%
4 10
 
1.1%
Other values (5) 20
 
2.2%
Hangul
ValueCountFrequency (%)
4
33.3%
4
33.3%
4
33.3%
Greek
ValueCountFrequency (%)
α 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2969
99.4%
Hangul 12
 
0.4%
Number Forms 6
 
0.2%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 227
 
7.6%
- 222
 
7.5%
X 176
 
5.9%
R 132
 
4.4%
132
 
4.4%
S 128
 
4.3%
A 120
 
4.0%
D 119
 
4.0%
E 104
 
3.5%
T 98
 
3.3%
Other values (56) 1511
50.9%
Hangul
ValueCountFrequency (%)
4
33.3%
4
33.3%
4
33.3%
Number Forms
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
None
ValueCountFrequency (%)
α 1
100.0%
Distinct100
Distinct (%)31.9%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2024-01-10T06:18:00.655996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/