Overview

Dataset statistics

Number of variables4
Number of observations79
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory33.7 B

Variable types

Categorical2
Text2

Dataset

Description당진시 모범 숙박업소 현황입니다.(평가구분, 업종, 업소명, 소새지(도로명))
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=450&beforeMenuCd=DOM_000000201001001000&publicdatapk=15052875

Alerts

평가구분 has constant value ""Constant
업소명 has unique valuesUnique
소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:11:56.735624
Analysis finished2024-01-09 20:11:57.204023
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

평가구분
Categorical

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size764.0 B
녹색(최우수)
79 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row녹색(최우수)
2nd row녹색(최우수)
3rd row녹색(최우수)
4th row녹색(최우수)
5th row녹색(최우수)

Common Values

ValueCountFrequency (%)
녹색(최우수) 79
100.0%

Length

2024-01-10T05:11:57.294231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:11:57.408931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
녹색(최우수 79
100.0%

업종
Categorical

Distinct2
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size764.0 B
숙박업(일반)
70 
숙박업(생활)

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
숙박업(일반) 70
88.6%
숙박업(생활) 9
 
11.4%

Length

2024-01-10T05:11:57.535549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:11:57.666040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
숙박업(일반 70
88.6%
숙박업(생활 9
 
11.4%

업소명
Text

UNIQUE 

Distinct79
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size764.0 B
2024-01-10T05:11:57.934245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5
Min length2

Characters and Unicode

Total characters395
Distinct characters136
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)100.0%

Sample

1st row석산장여관
2nd row칠복장여관
3rd row유락모텔
4th row당진모텔
5th row동진장모텔
ValueCountFrequency (%)
호텔 2
 
2.4%
석산장여관 1
 
1.2%
무인텔힐링 1
 
1.2%
블루호텔 1
 
1.2%
당진모텔s 1
 
1.2%
블랑모텔 1
 
1.2%
인피니티호텔 1
 
1.2%
짬호텔 1
 
1.2%
모텔케이 1
 
1.2%
모텔비치타운 1
 
1.2%
Other values (72) 72
86.7%
2024-01-10T05:11:58.441293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52
 
13.2%
32
 
8.1%
17
 
4.3%
15
 
3.8%
15
 
3.8%
10
 
2.5%
10
 
2.5%
9
 
2.3%
9
 
2.3%
8
 
2.0%
Other values (126) 218
55.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 369
93.4%
Uppercase Letter 13
 
3.3%
Space Separator 4
 
1.0%
Open Punctuation 3
 
0.8%
Close Punctuation 3
 
0.8%
Decimal Number 2
 
0.5%
Lowercase Letter 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
52
 
14.1%
32
 
8.7%
17
 
4.6%
15
 
4.1%
15
 
4.1%
10
 
2.7%
10
 
2.7%
9
 
2.4%
9
 
2.4%
8
 
2.2%
Other values (111) 192
52.0%
Uppercase Letter
ValueCountFrequency (%)
X 2
15.4%
O 2
15.4%
S 2
15.4%
E 2
15.4%
D 1
7.7%
K 1
7.7%
Q 1
7.7%
L 1
7.7%
M 1
7.7%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Lowercase Letter
ValueCountFrequency (%)
a 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 369
93.4%
Latin 14
 
3.5%
Common 12
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
14.1%
32
 
8.7%
17
 
4.6%
15
 
4.1%
15
 
4.1%
10
 
2.7%
10
 
2.7%
9
 
2.4%
9
 
2.4%
8
 
2.2%
Other values (111) 192
52.0%
Latin
ValueCountFrequency (%)
X 2
14.3%
O 2
14.3%
S 2
14.3%
E 2
14.3%
D 1
7.1%
K 1
7.1%
Q 1
7.1%
a 1
7.1%
L 1
7.1%
M 1
7.1%
Common
ValueCountFrequency (%)
4
33.3%
( 3
25.0%
) 3
25.0%
2 1
 
8.3%
1 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 369
93.4%
ASCII 26
 
6.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
52
 
14.1%
32
 
8.7%
17
 
4.6%
15
 
4.1%
15
 
4.1%
10
 
2.7%
10
 
2.7%
9
 
2.4%
9
 
2.4%
8
 
2.2%
Other values (111) 192
52.0%
ASCII
ValueCountFrequency (%)
4
15.4%
( 3
11.5%
) 3
11.5%
X 2
 
7.7%
O 2
 
7.7%
S 2
 
7.7%
E 2
 
7.7%
D 1
 
3.8%
2 1
 
3.8%
1 1
 
3.8%
Other values (5) 5
19.2%
Distinct79
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size764.0 B
2024-01-10T05:11:58.814909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length27
Mean length22.962025
Min length18

Characters and Unicode

Total characters1814
Distinct characters104
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)100.0%

Sample

1st row충청남도 당진시 당진중앙3로 51 (읍내동)
2nd row충청남도 당진시 합덕읍 중동1길 32-10
3rd row충청남도 당진시 당진중앙2로 33-12 (읍내동)
4th row충청남도 당진시 당진시장북길 37-7 (읍내동)
5th row충청남도 당진시 송악읍 송악로 17
ValueCountFrequency (%)
충청남도 79
19.7%
당진시 79
19.7%
송악읍 28
 
7.0%
읍내동 12
 
3.0%
석문면 12
 
3.0%
신평면 9
 
2.2%
당진중앙2로 8
 
2.0%
반촌로 8
 
2.0%
왜목길 6
 
1.5%
한진포구길 5
 
1.2%
Other values (125) 155
38.7%
2024-01-10T05:11:59.389623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
322
17.8%
97
 
5.3%
94
 
5.2%
85
 
4.7%
84
 
4.6%
79
 
4.4%
79
 
4.4%
79
 
4.4%
1 61
 
3.4%
2 44
 
2.4%
Other values (94) 790
43.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1129
62.2%
Space Separator 322
 
17.8%
Decimal Number 276
 
15.2%
Dash Punctuation 39
 
2.1%
Close Punctuation 21
 
1.2%
Open Punctuation 21
 
1.2%
Other Punctuation 6
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
97
 
8.6%
94
 
8.3%
85
 
7.5%
84
 
7.4%
79
 
7.0%
79
 
7.0%
79
 
7.0%
44
 
3.9%
42
 
3.7%
35
 
3.1%
Other values (79) 411
36.4%
Decimal Number
ValueCountFrequency (%)
1 61
22.1%
2 44
15.9%
3 42
15.2%
6 25
9.1%
5 24
 
8.7%
7 23
 
8.3%
8 19
 
6.9%
4 17
 
6.2%
9 11
 
4.0%
0 10
 
3.6%
Space Separator
ValueCountFrequency (%)
322
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 39
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1129
62.2%
Common 685
37.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
97
 
8.6%
94
 
8.3%
85
 
7.5%
84
 
7.4%
79
 
7.0%
79
 
7.0%
79
 
7.0%
44
 
3.9%
42
 
3.7%
35
 
3.1%
Other values (79) 411
36.4%
Common
ValueCountFrequency (%)
322
47.0%
1 61
 
8.9%
2 44
 
6.4%
3 42
 
6.1%
- 39
 
5.7%
6 25
 
3.6%
5 24
 
3.5%
7 23
 
3.4%
) 21
 
3.1%
( 21
 
3.1%
Other values (5) 63
 
9.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1129
62.2%
ASCII 685
37.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
322
47.0%
1 61
 
8.9%
2 44
 
6.4%
3 42
 
6.1%
- 39
 
5.7%
6 25
 
3.6%
5 24
 
3.5%
7 23
 
3.4%
) 21
 
3.1%
( 21
 
3.1%
Other values (5) 63
 
9.2%
Hangul
ValueCountFrequency (%)
97
 
8.6%
94
 
8.3%
85
 
7.5%
84
 
7.4%
79
 
7.0%
79
 
7.0%
79
 
7.0%
44
 
3.9%
42
 
3.7%
35
 
3.1%
Other values (79) 411
36.4%

Correlations

2024-01-10T05:11:59.521281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종업소명소재지(도로명)
업종1.0001.0001.000
업소명1.0001.0001.000
소재지(도로명)1.0001.0001.000

Missing values

2024-01-10T05:11:57.040697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:11:57.158433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

평가구분업종업소명소재지(도로명)
0녹색(최우수)숙박업(일반)석산장여관충청남도 당진시 당진중앙3로 51 (읍내동)
1녹색(최우수)숙박업(일반)칠복장여관충청남도 당진시 합덕읍 중동1길 32-10
2녹색(최우수)숙박업(일반)유락모텔충청남도 당진시 당진중앙2로 33-12 (읍내동)
3녹색(최우수)숙박업(일반)당진모텔충청남도 당진시 당진시장북길 37-7 (읍내동)
4녹색(최우수)숙박업(일반)동진장모텔충청남도 당진시 송악읍 송악로 17
5녹색(최우수)숙박업(일반)삽교호비치파크여관충청남도 당진시 신평면 삽교천2길 35
6녹색(최우수)숙박업(일반)석문모텔충청남도 당진시 석문면 통정3길 36-6
7녹색(최우수)숙박업(일반)가빈장 여관충청남도 당진시 우강면 덕평로 608-4 (8)
8녹색(최우수)숙박업(일반)해돋이모텔충청남도 당진시 석문면 왜목길 37
9녹색(최우수)숙박업(일반)동인파크모텔충청남도 당진시 석문면 왜목길 3
평가구분업종업소명소재지(도로명)
69녹색(최우수)숙박업(일반)(주)호텔로씨오충청남도 당진시 송악읍 한진포구길 30-38
70녹색(최우수)숙박업(생활)제이블리스모텔충청남도 당진시 석문면 왜목길 15-12, 지하1,지상123층
71녹색(최우수)숙박업(생활)해와달 펜션충청남도 당진시 석문면 새골길 74-75
72녹색(최우수)숙박업(생활)펀다이어트캠프충청남도 당진시 대호지면 빈정들길 80-59
73녹색(최우수)숙박업(생활)XO1호텔충청남도 당진시 송악읍 구래1길 26
74녹색(최우수)숙박업(생활)XO2호텔충청남도 당진시 송악읍 구래1길 24
75녹색(최우수)숙박업(생활)라메르펜션충청남도 당진시 석문면 석문해안로 133, 라메르펜션
76녹색(최우수)숙박업(생활)힐링인더삽교충청남도 당진시 신평면 삽교천3길 49-1, 2층
77녹색(최우수)숙박업(생활)하늘빛바다충청남도 당진시 석문면 석문해안로 19-26 (하늘빛바다펜션)
78녹색(최우수)숙박업(생활)왜목펜션충청남도 당진시 석문면 석문해안로 9, 왜목팬션