Overview

Dataset statistics

Number of variables11
Number of observations814
Missing cells51
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory71.7 KiB
Average record size in memory90.2 B

Variable types

Numeric2
Categorical2
Text6
DateTime1

Dataset

Description충청남도 금산군 사업장폐기물배출자 신고현황(사업장 상호, 소재지, 도로명주소, 전화번호, 폐기물종류 등) 안내입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=382&beforeMenuCd=DOM_000000201001001000&publicdatapk=15060379

Alerts

폐기물구분(사업장일반폐기물지정폐기물) has constant value ""Constant
데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 신고기준년도High correlation
신고기준년도 is highly overall correlated with 연번High correlation
사업자등록번호 has 51 (6.3%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:42:00.954641
Analysis finished2024-01-09 20:42:02.267162
Duration1.31 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct814
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean407.5
Minimum1
Maximum814
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.3 KiB
2024-01-10T05:42:02.332929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile41.65
Q1204.25
median407.5
Q3610.75
95-th percentile773.35
Maximum814
Range813
Interquartile range (IQR)406.5

Descriptive statistics

Standard deviation235.12585
Coefficient of variation (CV)0.57699596
Kurtosis-1.2
Mean407.5
Median Absolute Deviation (MAD)203.5
Skewness0
Sum331705
Variance55284.167
MonotonicityStrictly increasing
2024-01-10T05:42:02.474785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
548 1
 
0.1%
538 1
 
0.1%
539 1
 
0.1%
540 1
 
0.1%
541 1
 
0.1%
542 1
 
0.1%
543 1
 
0.1%
544 1
 
0.1%
545 1
 
0.1%
Other values (804) 804
98.8%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
814 1
0.1%
813 1
0.1%
812 1
0.1%
811 1
0.1%
810 1
0.1%
809 1
0.1%
808 1
0.1%
807 1
0.1%
806 1
0.1%
805 1
0.1%
Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
사업장일반폐기물
814 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장일반폐기물
2nd row사업장일반폐기물
3rd row사업장일반폐기물
4th row사업장일반폐기물
5th row사업장일반폐기물

Common Values

ValueCountFrequency (%)
사업장일반폐기물 814
100.0%

Length

2024-01-10T05:42:02.605289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:42:02.708627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장일반폐기물 814
100.0%
Distinct328
Distinct (%)40.3%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
2024-01-10T05:42:02.957793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length8.54914
Min length1

Characters and Unicode

Total characters6959
Distinct characters283
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)22.5%

Sample

1st row금산환경재생산업(주)
2nd row대광무역 방치폐기물
3rd row대광무역 방치폐기물
4th row대광무역 방치폐기물
5th row승원
ValueCountFrequency (%)
주식회사 35
 
3.8%
금산공장 35
 
3.8%
한국타이어앤테크놀로지(주 26
 
2.8%
주)모던이앤알 23
 
2.5%
인선지에스(주 20
 
2.2%
인선기업(주 19
 
2.1%
주)시공아트 18
 
2.0%
두리화장품(주 16
 
1.7%
한국타이어(주)금산공장 14
 
1.5%
대신산업 14
 
1.5%
Other values (332) 698
76.0%
2024-01-10T05:42:03.388035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
622
 
8.9%
( 594
 
8.5%
) 594
 
8.5%
276
 
4.0%
203
 
2.9%
163
 
2.3%
158
 
2.3%
156
 
2.2%
132
 
1.9%
118
 
1.7%
Other values (273) 3943
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5603
80.5%
Open Punctuation 594
 
8.5%
Close Punctuation 594
 
8.5%
Space Separator 118
 
1.7%
Decimal Number 32
 
0.5%
Uppercase Letter 14
 
0.2%
Other Punctuation 3
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
622
 
11.1%
276
 
4.9%
203
 
3.6%
163
 
2.9%
158
 
2.8%
156
 
2.8%
132
 
2.4%
117
 
2.1%
107
 
1.9%
104
 
1.9%
Other values (256) 3565
63.6%
Decimal Number
ValueCountFrequency (%)
2 16
50.0%
0 4
 
12.5%
1 4
 
12.5%
3 4
 
12.5%
4 3
 
9.4%
7 1
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
G 3
21.4%
E 3
21.4%
S 3
21.4%
P 3
21.4%
C 1
 
7.1%
J 1
 
7.1%
Open Punctuation
ValueCountFrequency (%)
( 594
100.0%
Close Punctuation
ValueCountFrequency (%)
) 594
100.0%
Space Separator
ValueCountFrequency (%)
118
100.0%
Other Punctuation
ValueCountFrequency (%)
& 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5603
80.5%
Common 1342
 
19.3%
Latin 14
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
622
 
11.1%
276
 
4.9%
203
 
3.6%
163
 
2.9%
158
 
2.8%
156
 
2.8%
132
 
2.4%
117
 
2.1%
107
 
1.9%
104
 
1.9%
Other values (256) 3565
63.6%
Common
ValueCountFrequency (%)
( 594
44.3%
) 594
44.3%
118
 
8.8%
2 16
 
1.2%
0 4
 
0.3%
1 4
 
0.3%
3 4
 
0.3%
4 3
 
0.2%
& 3
 
0.2%
7 1
 
0.1%
Latin
ValueCountFrequency (%)
G 3
21.4%
E 3
21.4%
S 3
21.4%
P 3
21.4%
C 1
 
7.1%
J 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5603
80.5%
ASCII 1356
 
19.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
622
 
11.1%
276
 
4.9%
203
 
3.6%
163
 
2.9%
158
 
2.8%
156
 
2.8%
132
 
2.4%
117
 
2.1%
107
 
1.9%
104
 
1.9%
Other values (256) 3565
63.6%
ASCII
ValueCountFrequency (%)
( 594
43.8%
) 594
43.8%
118
 
8.7%
2 16
 
1.2%
0 4
 
0.3%
1 4
 
0.3%
3 4
 
0.3%
4 3
 
0.2%
G 3
 
0.2%
E 3
 
0.2%
Other values (7) 13
 
1.0%
Distinct105
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
2024-01-10T05:42:03.641008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length64
Mean length10.124079
Min length1

Characters and Unicode

Total characters8241
Distinct characters192
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)4.4%

Sample

1st row
2nd row폐유리
3rd row폐유리
4th row폐유리
5th row그 밖의 분진
ValueCountFrequency (%)
제외한다 137
 
9.8%
폐합성수지류(폐염화비닐수지류는 122
 
8.7%
100
 
7.1%
밖의 100
 
7.1%
폐합성수지류 69
 
4.9%
사업장폐기물 53
 
3.8%
폐수처리오니 52
 
3.7%
식물성잔재물 47
 
3.4%
폐합성고무류 28
 
2.0%
폐합성수지 24
 
1.7%
Other values (160) 668
47.7%
2024-01-10T05:42:03.999341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
761
 
9.2%
714
 
8.7%
429
 
5.2%
418
 
5.1%
363
 
4.4%
323
 
3.9%
267
 
3.2%
245
 
3.0%
181
 
2.2%
169
 
2.1%
Other values (182) 4371
53.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7127
86.5%
Space Separator 714
 
8.7%
Open Punctuation 161
 
2.0%
Close Punctuation 161
 
2.0%
Connector Punctuation 63
 
0.8%
Decimal Number 15
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
761
 
10.7%
429
 
6.0%
418
 
5.9%
363
 
5.1%
323
 
4.5%
267
 
3.7%
245
 
3.4%
181
 
2.5%
169
 
2.4%
157
 
2.2%
Other values (173) 3814
53.5%
Decimal Number
ValueCountFrequency (%)
1 11
73.3%
2 3
 
20.0%
3 1
 
6.7%
Open Punctuation
ValueCountFrequency (%)
( 160
99.4%
1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 160
99.4%
1
 
0.6%
Space Separator
ValueCountFrequency (%)
714
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 63
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7127
86.5%
Common 1114
 
13.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
761
 
10.7%
429
 
6.0%
418
 
5.9%
363
 
5.1%
323
 
4.5%
267
 
3.7%
245
 
3.4%
181
 
2.5%
169
 
2.4%
157
 
2.2%
Other values (173) 3814
53.5%
Common
ValueCountFrequency (%)
714
64.1%
( 160
 
14.4%
) 160
 
14.4%
_ 63
 
5.7%
1 11
 
1.0%
2 3
 
0.3%
3 1
 
0.1%
1
 
0.1%
1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7105
86.2%
ASCII 1112
 
13.5%
Compat Jamo 22
 
0.3%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
761
 
10.7%
429
 
6.0%
418
 
5.9%
363
 
5.1%
323
 
4.5%
267
 
3.8%
245
 
3.4%
181
 
2.5%
169
 
2.4%
157
 
2.2%
Other values (172) 3792
53.4%
ASCII
ValueCountFrequency (%)
714
64.2%
( 160
 
14.4%
) 160
 
14.4%
_ 63
 
5.7%
1 11
 
1.0%
2 3
 
0.3%
3 1
 
0.1%
Compat Jamo
ValueCountFrequency (%)
22
100.0%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%

사업자등록번호
Text

MISSING 

Distinct282
Distinct (%)37.0%
Missing51
Missing (%)6.3%
Memory size6.5 KiB
2024-01-10T05:42:04.224382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters9156
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)19.7%

Sample

1st row305-26-84981
2nd row597-81-01972
3rd row305-81-82022
4th row305-81-82022
5th row305-81-82022
ValueCountFrequency (%)
305-81-27257 39
 
5.1%
305-85-39523 39
 
5.1%
305-81-65417 23
 
3.0%
305-81-43859 20
 
2.6%
314-81-65181 14
 
1.8%
763-81-01152 13
 
1.7%
305-85-05251 13
 
1.7%
305-81-66639 13
 
1.7%
305-81-50383 11
 
1.4%
305-83-01284 10
 
1.3%
Other values (272) 568
74.4%
2024-01-10T05:42:04.545603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/