Overview

Dataset statistics

Number of variables6
Number of observations71
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.6 KiB
Average record size in memory51.9 B

Variable types

Numeric1
Text2
Categorical3

Dataset

Description예산군 관내 옥외 현수막 지정게시대 목록으로 현수막 지정게시대 명칭, 위치(지번 주소), 사용자재, 규격 등의 항목을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=97&beforeMenuCd=DOM_000000201001001000&publicdatapk=15104182

Alerts

규격 has constant value ""Constant
게시대 형태 is highly imbalanced (74.7%)Imbalance
게시면 is highly imbalanced (68.7%)Imbalance
현수막 지정 게시대명 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:49:53.322469
Analysis finished2024-01-09 22:49:53.776861
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

Distinct70
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.915493
Minimum1
Maximum71
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size771.0 B
2024-01-10T07:49:53.832711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.5
Q118.5
median36
Q353.5
95-th percentile67.5
Maximum71
Range70
Interquartile range (IQR)35

Descriptive statistics

Standard deviation20.734889
Coefficient of variation (CV)0.57732435
Kurtosis-1.2103848
Mean35.915493
Median Absolute Deviation (MAD)18
Skewness-0.0032090341
Sum2550
Variance429.93561
MonotonicityNot monotonic
2024-01-10T07:49:53.962584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 2
 
2.8%
55 1
 
1.4%
53 1
 
1.4%
52 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
47 1
 
1.4%
1 1
 
1.4%
Other values (60) 60
84.5%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 2
2.8%
ValueCountFrequency (%)
71 1
1.4%
70 1
1.4%
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%
64 1
1.4%
63 1
1.4%
62 1
1.4%
Distinct71
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size700.0 B
2024-01-10T07:49:54.196653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length17
Mean length12.647887
Min length5

Characters and Unicode

Total characters898
Distinct characters186
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)100.0%

Sample

1st row복합문화센터 삼거리 - 벚꽃로
2nd row신성 APT 보건소
3rd row발연-주공아파트 발연주공아파트 삼거리
4th row한신아파트 신 터미널 육교
5th row한국유통 사거리
ValueCountFrequency (%)
삼거리 20
 
9.1%
사거리 14
 
6.4%
7
 
3.2%
교차로 6
 
2.7%
초입 6
 
2.7%
벚꽃로 6
 
2.7%
면사무소 4
 
1.8%
apt 4
 
1.8%
삽교 4
 
1.8%
로타리 3
 
1.4%
Other values (125) 145
66.2%
2024-01-10T07:49:54.565646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
151
 
16.8%
53
 
5.9%
38
 
4.2%
22
 
2.4%
) 21
 
2.3%
21
 
2.3%
( 20
 
2.2%
20
 
2.2%
20
 
2.2%
17
 
1.9%
Other values (176) 515
57.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 671
74.7%
Space Separator 151
 
16.8%
Close Punctuation 21
 
2.3%
Open Punctuation 20
 
2.2%
Uppercase Letter 17
 
1.9%
Dash Punctuation 10
 
1.1%
Decimal Number 8
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
7.9%
38
 
5.7%
22
 
3.3%
21
 
3.1%
20
 
3.0%
20
 
3.0%
17
 
2.5%
14
 
2.1%
13
 
1.9%
12
 
1.8%
Other values (163) 441
65.7%
Uppercase Letter
ValueCountFrequency (%)
A 5
29.4%
P 4
23.5%
T 4
23.5%
I 2
 
11.8%
C 2
 
11.8%
Decimal Number
ValueCountFrequency (%)
3 3
37.5%
0 2
25.0%
9 2
25.0%
1 1
 
12.5%
Space Separator
ValueCountFrequency (%)
151
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 670
74.6%
Common 210
 
23.4%
Latin 17
 
1.9%
Han 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
7.9%
38
 
5.7%
22
 
3.3%
21
 
3.1%
20
 
3.0%
20
 
3.0%
17
 
2.5%
14
 
2.1%
13
 
1.9%
12
 
1.8%
Other values (162) 440
65.7%
Common
ValueCountFrequency (%)
151
71.9%
) 21
 
10.0%
( 20
 
9.5%
- 10
 
4.8%
3 3
 
1.4%
0 2
 
1.0%
9 2
 
1.0%
1 1
 
0.5%
Latin
ValueCountFrequency (%)
A 5
29.4%
P 4
23.5%
T 4
23.5%
I 2
 
11.8%
C 2
 
11.8%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 670
74.6%
ASCII 227
 
25.3%
CJK 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
151
66.5%
) 21
 
9.3%
( 20
 
8.8%
- 10
 
4.4%
A 5
 
2.2%
P 4
 
1.8%
T 4
 
1.8%
3 3
 
1.3%
I 2
 
0.9%
C 2
 
0.9%
Other values (3) 5
 
2.2%
Hangul
ValueCountFrequency (%)
53
 
7.9%
38
 
5.7%
22
 
3.3%
21
 
3.1%
20
 
3.0%
20
 
3.0%
17
 
2.5%
14
 
2.1%
13
 
1.9%
12
 
1.8%
Other values (162) 440
65.7%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct69
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size700.0 B
2024-01-10T07:49:54.818071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length21.704225
Min length19

Characters and Unicode

Total characters1541
Distinct characters86
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)94.4%

Sample

1st row충청남도 예산군 예산읍 석양리 229-1
2nd row충청남도 예산군 예산읍 예산리 18-2
3rd row충청남도 예산군 예산읍 발연리 115-2
4th row충청남도 예산군 예산읍 산성리 882
5th row충청남도 예산군 예산읍 산성리 882
ValueCountFrequency (%)
충청남도 71
20.1%
예산군 70
19.8%
예산읍 25
 
7.1%
삽교읍 10
 
2.8%
산성리 9
 
2.5%
고덕면 8
 
2.3%
덕산면 7
 
2.0%
오가면 4
 
1.1%
광시면 4
 
1.1%
대술면 3
 
0.8%
Other values (118) 143
40.4%
2024-01-10T07:49:55.164963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
290
18.8%
121
 
7.9%
99
 
6.4%
72
 
4.7%
71
 
4.6%
71
 
4.6%
71
 
4.6%
71
 
4.6%
71
 
4.6%
- 56
 
3.6%
Other values (76) 548
35.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 923
59.9%
Space Separator 290
 
18.8%
Decimal Number 272
 
17.7%
Dash Punctuation 56
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
121
13.1%
99
10.7%
72
 
7.8%
71
 
7.7%
71
 
7.7%
71
 
7.7%
71
 
7.7%
71
 
7.7%
37
 
4.0%
36
 
3.9%
Other values (64) 203
22.0%
Decimal Number
ValueCountFrequency (%)
1 49
18.0%
2 36
13.2%
3 32
11.8%
6 28
10.3%
7 24
8.8%
8 24
8.8%
9 22
8.1%
4 21
7.7%
5 18
 
6.6%
0 18
 
6.6%
Space Separator
ValueCountFrequency (%)
290
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 56
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 923
59.9%
Common 618
40.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
121
13.1%
99
10.7%
72
 
7.8%
71
 
7.7%
71
 
7.7%
71
 
7.7%
71
 
7.7%
71
 
7.7%
37
 
4.0%
36
 
3.9%
Other values (64) 203
22.0%
Common
ValueCountFrequency (%)
290
46.9%
- 56
 
9.1%
1 49
 
7.9%
2 36
 
5.8%
3 32
 
5.2%
6 28
 
4.5%
7 24
 
3.9%
8 24
 
3.9%
9 22
 
3.6%
4 21
 
3.4%
Other values (2) 36
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 923
59.9%
ASCII 618
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
290
46.9%
- 56
 
9.1%
1 49
 
7.9%
2 36
 
5.8%
3 32
 
5.2%
6 28
 
4.5%
7 24
 
3.9%
8 24
 
3.9%
9 22
 
3.6%
4 21
 
3.4%
Other values (2) 36
 
5.8%
Hangul
ValueCountFrequency (%)
121
13.1%
99
10.7%
72
 
7.8%
71
 
7.7%
71
 
7.7%
71
 
7.7%
71
 
7.7%
71
 
7.7%
37
 
4.0%
36
 
3.9%
Other values (64) 203
22.0%

게시대 형태
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size700.0 B
스텐레스(사다리형 게시대)
68 
스텐레스(사다리형 태양광 게시대)
 
3

Length

Max length18
Median length14
Mean length14.169014
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row스텐레스(사다리형 게시대)
2nd row스텐레스(사다리형 게시대)
3rd row스텐레스(사다리형 게시대)
4th row스텐레스(사다리형 게시대)
5th row스텐레스(사다리형 게시대)

Common Values

ValueCountFrequency (%)
스텐레스(사다리형 게시대) 68
95.8%
스텐레스(사다리형 태양광 게시대) 3
 
4.2%

Length

2024-01-10T07:49:55.301560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:49:55.403846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
스텐레스(사다리형 71
49.0%
게시대 71
49.0%
태양광 3
 
2.1%

규격
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size700.0 B
6×0.9
71 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6×0.9
2nd row6×0.9
3rd row6×0.9
4th row6×0.9
5th row6×0.9

Common Values

ValueCountFrequency (%)
6×0.9 71
100.0%

Length

2024-01-10T07:49:55.493278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:49:55.569145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6×0.9 71
100.0%

게시면
Categorical

IMBALANCE 

Distinct3
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size700.0 B
5
65 
2
 
4
4
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5
2nd row5
3rd row5
4th row5
5th row5

Common Values

ValueCountFrequency (%)
5 65
91.5%
2 4
 
5.6%
4 2
 
2.8%

Length

2024-01-10T07:49:55.668324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:49:55.764955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 65
91.5%
2 4
 
5.6%
4 2
 
2.8%

Interactions

2024-01-10T07:49:53.554203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:49:55.833295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호현수막 지정 게시대명지번주소게시대 형태게시면
번호1.0001.0001.0000.0000.550
현수막 지정 게시대명1.0001.0001.0001.0001.000
지번주소1.0001.0001.0001.0001.000
게시대 형태0.0001.0001.0001.0000.000
게시면0.5501.0001.0000.0001.000
2024-01-10T07:49:55.914906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
게시대 형태게시면
게시대 형태1.0000.000
게시면0.0001.000
2024-01-10T07:49:55.984994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호게시대 형태게시면
번호1.0000.0000.425
게시대 형태0.0001.0000.000
게시면0.4250.0001.000

Missing values

2024-01-10T07:49:53.651743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:49:53.743118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/