Overview

Dataset statistics

Number of variables3
Number of observations74
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory25.8 B

Variable types

Text3

Dataset

Description충청남도 당진시의 고압가스 저장소 현황 데이터로 컬럼으로는 상호, 주소, 규모가 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=438&beforeMenuCd=DOM_000000201001001000&publicdatapk=15029694

Reproduction

Analysis started2024-01-09 21:34:28.201729
Analysis finished2024-01-09 21:34:28.614432
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct62
Distinct (%)83.8%
Missing0
Missing (%)0.0%
Memory size724.0 B
2024-01-10T06:34:28.759379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length15.5
Mean length9.8243243
Min length4

Characters and Unicode

Total characters727
Distinct characters144
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)78.4%

Sample

1st row(주)수석
2nd row(주)하이테크이엔브이
3rd row엔씨케이(주)
4th row당진종합병원
5th row(주)신한씨에스
ValueCountFrequency (%)
현대제철(주 10
 
10.3%
케이지스틸(주 4
 
4.1%
한국동서발전(주 4
 
4.1%
당진화력본부 4
 
4.1%
한국동서발전(주)당진화력본부 4
 
4.1%
일관 2
 
2.1%
현대하이스코(주 2
 
2.1%
삼원중공업(주 1
 
1.0%
고로1기 1
 
1.0%
제강사무실 1
 
1.0%
Other values (64) 64
66.0%
2024-01-10T06:34:29.046592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
66
 
9.1%
( 64
 
8.8%
) 64
 
8.8%
23
 
3.2%
22
 
3.0%
21
 
2.9%
20
 
2.8%
20
 
2.8%
19
 
2.6%
19
 
2.6%
Other values (134) 389
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 556
76.5%
Open Punctuation 64
 
8.8%
Close Punctuation 64
 
8.8%
Space Separator 23
 
3.2%
Math Symbol 6
 
0.8%
Uppercase Letter 6
 
0.8%
Decimal Number 5
 
0.7%
Other Punctuation 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
66
 
11.9%
22
 
4.0%
21
 
3.8%
20
 
3.6%
20
 
3.6%
19
 
3.4%
19
 
3.4%
17
 
3.1%
16
 
2.9%
12
 
2.2%
Other values (119) 324
58.3%
Uppercase Letter
ValueCountFrequency (%)
C 1
16.7%
A 1
16.7%
P 1
16.7%
M 1
16.7%
I 1
16.7%
S 1
16.7%
Decimal Number
ValueCountFrequency (%)
1 2
40.0%
2 2
40.0%
3 1
20.0%
Math Symbol
ValueCountFrequency (%)
< 3
50.0%
> 3
50.0%
Open Punctuation
ValueCountFrequency (%)
( 64
100.0%
Close Punctuation
ValueCountFrequency (%)
) 64
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 556
76.5%
Common 165
 
22.7%
Latin 6
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
66
 
11.9%
22
 
4.0%
21
 
3.8%
20
 
3.6%
20
 
3.6%
19
 
3.4%
19
 
3.4%
17
 
3.1%
16
 
2.9%
12
 
2.2%
Other values (119) 324
58.3%
Common
ValueCountFrequency (%)
( 64
38.8%
) 64
38.8%
23
 
13.9%
, 3
 
1.8%
< 3
 
1.8%
> 3
 
1.8%
1 2
 
1.2%
2 2
 
1.2%
3 1
 
0.6%
Latin
ValueCountFrequency (%)
C 1
16.7%
A 1
16.7%
P 1
16.7%
M 1
16.7%
I 1
16.7%
S 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 556
76.5%
ASCII 171
 
23.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
66
 
11.9%
22
 
4.0%
21
 
3.8%
20
 
3.6%
20
 
3.6%
19
 
3.4%
19
 
3.4%
17
 
3.1%
16
 
2.9%
12
 
2.2%
Other values (119) 324
58.3%
ASCII
ValueCountFrequency (%)
( 64
37.4%
) 64
37.4%
23
 
13.5%
, 3
 
1.8%
< 3
 
1.8%
> 3
 
1.8%
1 2
 
1.2%
2 2
 
1.2%
3 1
 
0.6%
C 1
 
0.6%
Other values (5) 5
 
2.9%
Distinct56
Distinct (%)75.7%
Missing0
Missing (%)0.0%
Memory size724.0 B
2024-01-10T06:34:29.294911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length34
Mean length23.972973
Min length17

Characters and Unicode

Total characters1774
Distinct characters115
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)66.2%

Sample

1st row충청남도 당진시 합덕읍 인더스파크로 21
2nd row충청남도 당진시 정미면 4.4만세로 574
3rd row충청남도 당진시 송산면 동곡리 381-5
4th row충청남도 당진시 반촌로 5-15, 당진종합병원 (시곡동)
5th row충청남도 당진시 합덕읍 면천로 1339
ValueCountFrequency (%)
충청남도 74
18.8%
당진시 74
18.8%
송악읍 27
 
6.9%
북부산업로 15
 
3.8%
석문면 13
 
3.3%
1480 10
 
2.5%
송산면 10
 
2.5%
교로길 8
 
2.0%
30 8
 
2.0%
합덕읍 7
 
1.8%
Other values (110) 147
37.4%
2024-01-10T06:34:29.626707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
336
18.9%
78
 
4.4%
77
 
4.3%
76
 
4.3%
74
 
4.2%
74
 
4.2%
74
 
4.2%
74
 
4.2%
1 53
 
3.0%
47
 
2.6%
Other values (105) 811
45.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1102
62.1%
Space Separator 336
 
18.9%
Decimal Number 279
 
15.7%
Dash Punctuation 22
 
1.2%
Open Punctuation 11
 
0.6%
Close Punctuation 11
 
0.6%
Other Punctuation 10
 
0.6%
Uppercase Letter 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
78
 
7.1%
77
 
7.0%
76
 
6.9%
74
 
6.7%
74
 
6.7%
74
 
6.7%
74
 
6.7%
47
 
4.3%
47
 
4.3%
41
 
3.7%
Other values (87) 440
39.9%
Decimal Number
ValueCountFrequency (%)
1 53
19.0%
4 38
13.6%
3 35
12.5%
0 32
11.5%
2 27
9.7%
8 26
9.3%
6 24
8.6%
7 17
 
6.1%
5 17
 
6.1%
9 10
 
3.6%
Other Punctuation
ValueCountFrequency (%)
, 9
90.0%
. 1
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
A 2
66.7%
C 1
33.3%
Space Separator
ValueCountFrequency (%)
336
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1102
62.1%
Common 669
37.7%
Latin 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
78
 
7.1%
77
 
7.0%
76
 
6.9%
74
 
6.7%
74
 
6.7%
74
 
6.7%
74
 
6.7%
47
 
4.3%
47
 
4.3%
41
 
3.7%
Other values (87) 440
39.9%
Common
ValueCountFrequency (%)
336
50.2%
1 53
 
7.9%
4 38
 
5.7%
3 35
 
5.2%
0 32
 
4.8%
2 27
 
4.0%
8 26
 
3.9%
6 24
 
3.6%
- 22
 
3.3%
7 17
 
2.5%
Other values (6) 59
 
8.8%
Latin
ValueCountFrequency (%)
A 2
66.7%
C 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1102
62.1%
ASCII 672
37.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
336
50.0%
1 53
 
7.9%
4 38
 
5.7%
3 35
 
5.2%
0 32
 
4.8%
2 27
 
4.0%
8 26
 
3.9%
6 24
 
3.6%
- 22
 
3.3%
7 17
 
2.5%
Other values (8) 62
 
9.2%
Hangul
ValueCountFrequency (%)
78
 
7.1%
77
 
7.0%
76
 
6.9%
74
 
6.7%
74
 
6.7%
74
 
6.7%
74
 
6.7%
47
 
4.3%
47
 
4.3%
41
 
3.7%
Other values (87) 440
39.9%
Distinct61
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Memory size724.0 B
2024-01-10T06:34:29.816093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length232
Median length35
Mean length20.864865
Min length6

Characters and Unicode

Total characters1544
Distinct characters43
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)67.6%

Sample

1st row 산소 (60.141톤)
2nd row 산소 (10.2Kg), 질소 (30Kg/㎠)
3rd row 질소 (14.54톤)
4th row 산소 (12.093톤)
5th row 산소 (15.416톤), 탄산가스 (14.96톤)
ValueCountFrequency (%)
0kg/㎠ 43
15.8%
산소 34
12.5%
기타 31
11.4%
25
 
9.2%
탄산가스 21
 
7.7%
질소 19
 
7.0%
아르곤 11
 
4.0%
수소 11
 
4.0%
액화암모니아 7
 
2.6%
0 6
 
2.2%
Other values (54) 64
23.5%
2024-01-10T06:34:30.099712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
272
17.6%
( 136
 
8.8%
) 136
 
8.8%
0 80
 
5.2%
K 68
 
4.4%
g 68
 
4.4%
/ 66
 
4.3%
66
 
4.3%
64
 
4.1%
, 62
 
4.0%
Other values (33) 526
34.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 381
24.7%
Space Separator 272
17.6%
Decimal Number 249
16.1%
Other Punctuation 155
10.0%
Open Punctuation 136
 
8.8%
Close Punctuation 136
 
8.8%
Other Symbol 79
 
5.1%
Uppercase Letter 68
 
4.4%
Lowercase Letter 68
 
4.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
16.8%
55
14.4%
31
8.1%
31
8.1%
24
 
6.3%
22
 
5.8%
22
 
5.8%
21
 
5.5%
19
 
5.0%
19
 
5.0%
Other values (13) 73
19.2%
Decimal Number
ValueCountFrequency (%)
0 80
32.1%
1 35
14.1%
9 24
 
9.6%
4 22
 
8.8%
2 19
 
7.6%
6 17
 
6.8%
7 15
 
6.0%
8 14
 
5.6%
3 12
 
4.8%
5 11
 
4.4%
Other Punctuation
ValueCountFrequency (%)
/ 66
42.6%
, 62
40.0%
. 27
17.4%
Other Symbol
ValueCountFrequency (%)
66
83.5%
13
 
16.5%
Space Separator
ValueCountFrequency (%)
272
100.0%
Open Punctuation
ValueCountFrequency (%)
( 136
100.0%
Close Punctuation
ValueCountFrequency (%)
) 136
100.0%
Uppercase Letter
ValueCountFrequency (%)
K 68
100.0%
Lowercase Letter
ValueCountFrequency (%)
g 68
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1027
66.5%
Hangul 381
 
24.7%
Latin 136
 
8.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
64
16.8%
55
14.4%
31
8.1%
31
8.1%
24
 
6.3%
22
 
5.8%
22
 
5.8%
21
 
5.5%
19
 
5.0%
19
 
5.0%
Other values (13) 73
19.2%
Common
ValueCountFrequency (%)
272
26.5%
( 136
13.2%
) 136
13.2%
0 80
 
7.8%
/ 66
 
6.4%
66
 
6.4%
, 62
 
6.0%
1 35
 
3.4%
. 27
 
2.6%
9 24
 
2.3%
Other values (8) 123
12.0%
Latin
ValueCountFrequency (%)
K 68
50.0%
g 68
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1084
70.2%
Hangul 381
 
24.7%
CJK Compat 79
 
5.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
272
25.1%
( 136
12.5%
) 136
12.5%
0 80
 
7.4%
K 68
 
6.3%
g 68
 
6.3%
/ 66
 
6.1%
, 62
 
5.7%
1 35
 
3.2%
. 27
 
2.5%
Other values (8) 134
12.4%
CJK Compat
ValueCountFrequency (%)
66
83.5%
13
 
16.5%
Hangul
ValueCountFrequency (%)
64
16.8%
55
14.4%
31
8.1%
31
8.1%
24
 
6.3%
22
 
5.8%
22
 
5.8%
21
 
5.5%
19
 
5.0%
19
 
5.0%
Other values (13) 73
19.2%

Correlations

2024-01-10T06:34:30.172045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/