Overview

Dataset statistics

Number of variables15
Number of observations205
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.4 KiB
Average record size in memory121.6 B

Variable types

Numeric1
Text5
Categorical7
DateTime2

Dataset

Description「폐기물관리법」 시행규칙 제18조제1항에 따라 ① 「대기환경보전법」, 「물환경보전법」 또는 「소음·진동관리법」에 따른 배출시설을 설치·운영하는 자로서 폐기물을 1일 평균 100킬로그램 이상 배출하는자, ② 「폐기물관리법」 시행령 제2조제1호부터 제5호까지의 시설을 설치·운영하는 자로서 폐기물을 1일 평균 100킬로그램 이상 배출하는자, ③ 폐기물을 1일 평균 300킬로그램 이상 배출하는 자가 사업장폐기물배출자 신고에 대한 현황 자료임.
Author충청남도 청양군
URLhttps://www.data.go.kr/data/15060144/fileData.do

Alerts

업무구분 has constant value ""Constant
데이터기준일자 has constant value ""Constant
전 화 번 호 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
사업자 등록번호 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
사업장지번주소 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
처리방법 is highly overall correlated with 폐기물자가처리방법High correlation
폐기물자가처리방법 is highly overall correlated with 처리방법High correlation
순번 is highly overall correlated with 사업자 등록번호 and 3 other fieldsHigh correlation
폐기물구분 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:54:17.476610
Analysis finished2023-12-12 21:54:19.481989
Duration2.01 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct205
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean103
Minimum1
Maximum205
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-13T06:54:19.552993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11.2
Q152
median103
Q3154
95-th percentile194.8
Maximum205
Range204
Interquartile range (IQR)102

Descriptive statistics

Standard deviation59.322565
Coefficient of variation (CV)0.57594723
Kurtosis-1.2
Mean103
Median Absolute Deviation (MAD)51
Skewness0
Sum21115
Variance3519.1667
MonotonicityStrictly increasing
2023-12-13T06:54:19.701176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
142 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
137 1
 
0.5%
138 1
 
0.5%
139 1
 
0.5%
Other values (195) 195
95.1%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
205 1
0.5%
204 1
0.5%
203 1
0.5%
202 1
0.5%
201 1
0.5%
200 1
0.5%
199 1
0.5%
198 1
0.5%
197 1
0.5%
196 1
0.5%
Distinct54
Distinct (%)26.3%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T06:54:19.937658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length18
Mean length9.3902439
Min length2

Characters and Unicode

Total characters1925
Distinct characters127
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)10.7%

Sample

1st row청흥버섯영농조합법인
2nd row뉴콘
3rd row뉴콘
4th row청양군청
5th row(사)운곡2산업단지운영협의회
ValueCountFrequency (%)
청양군청 20
 
8.0%
주)으뜸농산 16
 
6.4%
애경케미칼 14
 
5.6%
주식회사 14
 
5.6%
청양1공장 14
 
5.6%
애경케미칼(주)청양2공장 14
 
5.6%
환경시설관리주식회사 13
 
5.2%
매일유업(주)청양공장 12
 
4.8%
애경산업(주 9
 
3.6%
주)보민환경 8
 
3.2%
Other values (53) 115
46.2%
2023-12-13T06:54:20.395083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
151
 
7.8%
( 132
 
6.9%
) 132
 
6.9%
109
 
5.7%
94
 
4.9%
63
 
3.3%
61
 
3.2%
54
 
2.8%
53
 
2.8%
51
 
2.6%
Other values (117) 1025
53.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1585
82.3%
Open Punctuation 132
 
6.9%
Close Punctuation 132
 
6.9%
Space Separator 44
 
2.3%
Decimal Number 29
 
1.5%
Other Symbol 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
151
 
9.5%
109
 
6.9%
94
 
5.9%
63
 
4.0%
61
 
3.8%
54
 
3.4%
53
 
3.3%
51
 
3.2%
48
 
3.0%
40
 
2.5%
Other values (111) 861
54.3%
Decimal Number
ValueCountFrequency (%)
2 15
51.7%
1 14
48.3%
Open Punctuation
ValueCountFrequency (%)
( 132
100.0%
Close Punctuation
ValueCountFrequency (%)
) 132
100.0%
Space Separator
ValueCountFrequency (%)
44
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1588
82.5%
Common 337
 
17.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
151
 
9.5%
109
 
6.9%
94
 
5.9%
63
 
4.0%
61
 
3.8%
54
 
3.4%
53
 
3.3%
51
 
3.2%
48
 
3.0%
40
 
2.5%
Other values (112) 864
54.4%
Common
ValueCountFrequency (%)
( 132
39.2%
) 132
39.2%
44
 
13.1%
2 15
 
4.5%
1 14
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1585
82.3%
ASCII 337
 
17.5%
None 3
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
151
 
9.5%
109
 
6.9%
94
 
5.9%
63
 
4.0%
61
 
3.8%
54
 
3.4%
53
 
3.3%
51
 
3.2%
48
 
3.0%
40
 
2.5%
Other values (111) 861
54.3%
ASCII
ValueCountFrequency (%)
( 132
39.2%
) 132
39.2%
44
 
13.1%
2 15
 
4.5%
1 14
 
4.2%
None
ValueCountFrequency (%)
3
100.0%

사업자 등록번호
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)22.9%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
113-86-63305
28 
307-83-01257
24 
307-81-17793
16 
339-81-01041
13 
307-85-02513
 
9
Other values (42)
115 

Length

Max length12
Median length12
Mean length11.726829
Min length4

Unique

Unique17 ?
Unique (%)8.3%

Sample

1st row307-81-08106
2nd row438-81-00636
3rd row438-81-00636
4th row307-83-01257
5th row310-82-06756

Common Values

ValueCountFrequency (%)
113-86-63305 28
 
13.7%
307-83-01257 24
 
11.7%
307-81-17793 16
 
7.8%
339-81-01041 13
 
6.3%
307-85-02513 9
 
4.4%
844-81-00466 9
 
4.4%
310-81-18099 8
 
3.9%
310-85-12375 8
 
3.9%
307-81-05152 7
 
3.4%
<NA> 7
 
3.4%
Other values (37) 76
37.1%

Length

2023-12-13T06:54:20.565555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
113-86-63305 28
 
13.7%
307-83-01257 24
 
11.7%
307-81-17793 16
 
7.8%
339-81-01041 13
 
6.3%
307-85-02513 9
 
4.4%
844-81-00466 9
 
4.4%
310-81-18099 8
 
3.9%
310-85-12375 8
 
3.9%
na 7
 
3.4%
307-81-05152 7
 
3.4%
Other values (37) 76
37.1%

전 화 번 호
Categorical

HIGH CORRELATION 

Distinct39
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
<NA>
88 
041-943-9381
13 
041-940-3472
041-940-5210
041-942-2911
 
8
Other values (34)
78 

Length

Max length13
Median length12
Mean length8.5658537
Min length4

Unique

Unique18 ?
Unique (%)8.8%

Sample

1st row041-943-4973
2nd row<NA>
3rd row<NA>
4th row041-940-4826
5th row041-942-7474

Common Values

ValueCountFrequency (%)
<NA> 88
42.9%
041-943-9381 13
 
6.3%
041-940-3472 9
 
4.4%
041-940-5210 9
 
4.4%
041-942-2911 8
 
3.9%
041-943-7436 8
 
3.9%
041-943-6681 7
 
3.4%
041-943-6670 5
 
2.4%
041-943-4482 5
 
2.4%
041-942-0707 5
 
2.4%
Other values (29) 48
23.4%

Length

2023-12-13T06:54:20.716113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 88
42.5%
041-943-9381 13
 
6.3%
041-940-3472 9
 
4.3%
041-940-5210 9
 
4.3%
041-942-2911 8
 
3.9%
041-943-7436 8
 
3.9%
041-943-6681 7
 
3.4%
041-943-4482 5
 
2.4%
041-942-0707 5
 
2.4%
041-943-6670 5
 
2.4%
Other values (31) 50
24.2%
Distinct60
Distinct (%)29.3%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum1995-02-05 00:00:00
Maximum2022-04-14 00:00:00
2023-12-13T06:54:20.875851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:54:21.020250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

폐기물구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
배출시설계
153 
24 
생활계
 
14
비배출시설계
 
12
<NA>
 
2

Length

Max length6
Median length5
Mean length4.4439024
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row배출시설계
2nd row생활계
3rd row배출시설계
4th row배출시설계
5th row배출시설계

Common Values

ValueCountFrequency (%)
배출시설계 153
74.6%
24
 
11.7%
생활계 14
 
6.8%
비배출시설계 12
 
5.9%
<NA> 2
 
1.0%

Length

2023-12-13T06:54:21.158839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:54:21.280750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
배출시설계 153
84.5%
생활계 14
 
7.7%
비배출시설계 12
 
6.6%
na 2
 
1.1%
Distinct55
Distinct (%)26.8%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T06:54:21.562395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length38
Mean length10.941463
Min length1

Characters and Unicode

Total characters2243
Distinct characters134
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)14.6%

Sample

1st row그 밖의 식물성잔재물
2nd row폐콘크리트
3rd row그 밖의 공정오니
4th row그 밖의 폐목재류
5th row그 밖의 폐수처리오니
ValueCountFrequency (%)
61
14.6%
밖의 61
14.6%
폐합성수지류(폐염화비닐수지류는 36
 
8.6%
제외한다 36
 
8.6%
폐수처리오니 28
 
6.7%
폐합성수지류 20
 
4.8%
식물성잔재물 12
 
2.9%
폐기물 11
 
2.6%
공정오니 9
 
2.2%
폐콘크리트 9
 
2.2%
Other values (81) 135
32.3%
2023-12-13T06:54:21.953946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
219
 
9.8%
205
 
9.1%
133
 
5.9%
117
 
5.2%
98
 
4.4%
88
 
3.9%
71
 
3.2%
62
 
2.8%
61
 
2.7%
61
 
2.7%
Other values (124) 1128
50.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1936
86.3%
Space Separator 219
 
9.8%
Open Punctuation 43
 
1.9%
Close Punctuation 43
 
1.9%
Connector Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
205
 
10.6%
133
 
6.9%
117
 
6.0%
98
 
5.1%
88
 
4.5%
71
 
3.7%
62
 
3.2%
61
 
3.2%
61
 
3.2%
59
 
3.0%
Other values (120) 981
50.7%
Space Separator
ValueCountFrequency (%)
219
100.0%
Open Punctuation
ValueCountFrequency (%)
( 43
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1936
86.3%
Common 307
 
13.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
205
 
10.6%
133
 
6.9%
117
 
6.0%
98
 
5.1%
88
 
4.5%
71
 
3.7%
62
 
3.2%
61
 
3.2%
61
 
3.2%
59
 
3.0%
Other values (120) 981
50.7%
Common
ValueCountFrequency (%)
219
71.3%
( 43
 
14.0%
) 43
 
14.0%
_ 2
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1931
86.1%
ASCII 307
 
13.7%
Compat Jamo 5
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
219
71.3%
( 43
 
14.0%
) 43
 
14.0%
_ 2
 
0.7%
Hangul
ValueCountFrequency (%)
205
 
10.6%
133
 
6.9%
117
 
6.1%
98
 
5.1%
88
 
4.6%
71
 
3.7%
62
 
3.2%
61
 
3.2%
61
 
3.2%
59
 
3.1%
Other values (119) 976
50.5%
Compat Jamo
ValueCountFrequency (%)
5
100.0%
Distinct110
Distinct (%)53.7%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T06:54:22.150600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length15
Mean length7.0926829
Min length1

Characters and Unicode

Total characters1454
Distinct characters170
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)33.7%

Sample

1st row(주)전국로지스
2nd row더블유아이케이환경(주)
3rd row더블유아이케이환경(주)
4th row지씨테크(주)공주공장
5th row(주)건영종합환경
ValueCountFrequency (%)
주)태건환경건설 12
 
5.6%
주)보은산업 9
 
4.2%
주)보민환경 7
 
3.3%
상록수환경(주 6
 
2.8%
신한환경(주 6
 
2.8%
유영산업 6
 
2.8%
진도산업 5
 
2.3%
하얀환경개발(주 5
 
2.3%
지씨테크(주)공주공장 4
 
1.9%
유림환경(합 4
 
1.9%
Other values (106) 150
70.1%
2023-12-13T06:54:22.794218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 149
 
10.2%
) 149
 
10.2%
141
 
9.7%
96
 
6.6%
90
 
6.2%
44
 
3.0%
43
 
3.0%
28
 
1.9%
25
 
1.7%
21
 
1.4%
Other values (160) 668
45.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1126
77.4%
Open Punctuation 149
 
10.2%
Close Punctuation 149
 
10.2%
Space Separator 19
 
1.3%
Connector Punctuation 3
 
0.2%
Uppercase Letter 3
 
0.2%
Dash Punctuation 2
 
0.1%
Lowercase Letter 2
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
141
 
12.5%
96
 
8.5%
90
 
8.0%
44
 
3.9%
43
 
3.8%
28
 
2.5%
25
 
2.2%
21
 
1.9%
21
 
1.9%
17
 
1.5%
Other values (149) 600
53.3%
Uppercase Letter
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
E 1
33.3%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
p 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 149
100.0%
Close Punctuation
ValueCountFrequency (%)
) 149
100.0%
Space Separator
ValueCountFrequency (%)
19
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1126
77.4%
Common 323
 
22.2%
Latin 5
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
141
 
12.5%
96
 
8.5%
90
 
8.0%
44
 
3.9%
43
 
3.8%
28
 
2.5%
25
 
2.2%
21
 
1.9%
21
 
1.9%
17
 
1.5%
Other values (149) 600
53.3%
Common
ValueCountFrequency (%)
( 149
46.1%
) 149
46.1%
19
 
5.9%
_ 3
 
0.9%
- 2
 
0.6%
& 1
 
0.3%
Latin
ValueCountFrequency (%)
G 1
20.0%
N 1
20.0%
E 1
20.0%