Overview

Dataset statistics

Number of variables14
Number of observations676
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory75.4 KiB
Average record size in memory114.2 B

Variable types

Categorical7
Text5
Numeric2

Dataset

Description경기도 이천시 내의 환경오염물질 배출업소에 대한 사업자명, 대표자명, 관할기관명, 업종명, 폐수관리등급, 대기관리등급, 폐수종별구분명, 대기종별구분명 등의 데이터 입니다.
Author경기도 이천시
URLhttps://www.data.go.kr/data/15038211/fileData.do

Alerts

시군명 has constant value ""Constant
관할기관명 has constant value ""Constant

Reproduction

Analysis started2023-12-12 00:48:21.780731
Analysis finished2023-12-12 00:48:23.747510
Duration1.97 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size5.4 KiB
이천시
676 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이천시
2nd row이천시
3rd row이천시
4th row이천시
5th row이천시

Common Values

ValueCountFrequency (%)
이천시 676
100.0%

Length

2023-12-12T09:48:23.846421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:48:23.984729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
이천시 676
100.0%
Distinct672
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size5.4 KiB
2023-12-12T09:48:24.265048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length8.045858
Min length2

Characters and Unicode

Total characters5439
Distinct characters453
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique668 ?
Unique (%)98.8%

Sample

1st row(주)태명실업
2nd row차사랑모터스
3rd row(주)넥스필
4th row용담엔지니어링
5th row(주)정석특장
ValueCountFrequency (%)
육군 20
 
2.5%
농업회사법인 16
 
2.0%
이천지점 6
 
0.8%
이천공장 6
 
0.8%
이천점 3
 
0.4%
푸디스트㈜ 2
 
0.3%
모터스 2
 
0.3%
이천지사 2
 
0.3%
셀프세차장 2
 
0.3%
제7135부대(707 2
 
0.3%
Other values (724) 732
92.3%
2023-12-12T09:48:24.786786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
374
 
6.9%
( 351
 
6.5%
) 351
 
6.5%
176
 
3.2%
119
 
2.2%
113
 
2.1%
110
 
2.0%
90
 
1.7%
85
 
1.6%
82
 
1.5%
Other values (443) 3588
66.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4295
79.0%
Open Punctuation 351
 
6.5%
Close Punctuation 351
 
6.5%
Decimal Number 151
 
2.8%
Space Separator 119
 
2.2%
Other Symbol 90
 
1.7%
Uppercase Letter 67
 
1.2%
Dash Punctuation 6
 
0.1%
Lowercase Letter 6
 
0.1%
Other Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
374
 
8.7%
176
 
4.1%
113
 
2.6%
110
 
2.6%
85
 
2.0%
82
 
1.9%
80
 
1.9%
74
 
1.7%
67
 
1.6%
65
 
1.5%
Other values (403) 3069
71.5%
Uppercase Letter
ValueCountFrequency (%)
S 11
16.4%
C 10
14.9%
K 7
10.4%
P 5
 
7.5%
G 4
 
6.0%
D 4
 
6.0%
N 4
 
6.0%
L 4
 
6.0%
F 3
 
4.5%
M 2
 
3.0%
Other values (9) 13
19.4%
Decimal Number
ValueCountFrequency (%)
1 41
27.2%
2 28
18.5%
7 23
15.2%
3 17
11.3%
0 13
 
8.6%
9 11
 
7.3%
5 7
 
4.6%
8 7
 
4.6%
4 3
 
2.0%
6 1
 
0.7%
Lowercase Letter
ValueCountFrequency (%)
a 2
33.3%
s 2
33.3%
r 1
16.7%
l 1
16.7%
Other Punctuation
ValueCountFrequency (%)
& 2
66.7%
' 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 351
100.0%
Close Punctuation
ValueCountFrequency (%)
) 351
100.0%
Space Separator
ValueCountFrequency (%)
119
100.0%
Other Symbol
ValueCountFrequency (%)
90
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4385
80.6%
Common 981
 
18.0%
Latin 73
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
374
 
8.5%
176
 
4.0%
113
 
2.6%
110
 
2.5%
90
 
2.1%
85
 
1.9%
82
 
1.9%
80
 
1.8%
74
 
1.7%
67
 
1.5%
Other values (404) 3134
71.5%
Latin
ValueCountFrequency (%)
S 11
15.1%
C 10
13.7%
K 7
 
9.6%
P 5
 
6.8%
G 4
 
5.5%
D 4
 
5.5%
N 4
 
5.5%
L 4
 
5.5%
F 3
 
4.1%
a 2
 
2.7%
Other values (13) 19
26.0%
Common
ValueCountFrequency (%)
( 351
35.8%
) 351
35.8%
119
 
12.1%
1 41
 
4.2%
2 28
 
2.9%
7 23
 
2.3%
3 17
 
1.7%
0 13
 
1.3%
9 11
 
1.1%
5 7
 
0.7%
Other values (6) 20
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4295
79.0%
ASCII 1054
 
19.4%
None 90
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
374
 
8.7%
176
 
4.1%
113
 
2.6%
110
 
2.6%
85
 
2.0%
82
 
1.9%
80
 
1.9%
74
 
1.7%
67
 
1.6%
65
 
1.5%
Other values (403) 3069
71.5%
ASCII
ValueCountFrequency (%)
( 351
33.3%
) 351
33.3%
119
 
11.3%
1 41
 
3.9%
2 28
 
2.7%
7 23
 
2.2%
3 17
 
1.6%
0 13
 
1.2%
9 11
 
1.0%
S 11
 
1.0%
Other values (29) 89
 
8.4%
None
ValueCountFrequency (%)
90
100.0%

업종명
Categorical

Distinct7
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size5.4 KiB
기타
435 
식료품제조
120 
비금속광물
63 
가공금속제품
 
38
기타화학제품제조
 
10
Other values (2)
 
10

Length

Max length8
Median length2
Mean length3.1789941
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row비금속광물
2nd row기타
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
기타 435
64.3%
식료품제조 120
 
17.8%
비금속광물 63
 
9.3%
가공금속제품 38
 
5.6%
기타화학제품제조 10
 
1.5%
섬유제품제조 8
 
1.2%
종이제조 2
 
0.3%

Length

2023-12-12T09:48:24.947527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:48:25.081860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 435
64.3%
식료품제조 120
 
17.8%
비금속광물 63
 
9.3%
가공금속제품 38
 
5.6%
기타화학제품제조 10
 
1.5%
섬유제품제조 8
 
1.2%
종이제조 2
 
0.3%
Distinct301
Distinct (%)44.5%
Missing0
Missing (%)0.0%
Memory size5.4 KiB
2023-12-12T09:48:25.560531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8.5
Mean length3.6079882
Min length2

Characters and Unicode

Total characters2439
Distinct characters163
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique292 ?
Unique (%)43.2%

Sample

1st row대표이사
2nd row김명섭
3rd row이신순
4th row엄재훈
5th row박정숙
ValueCountFrequency (%)
대표이사 338
49.1%
부대장 24
 
3.5%
조합장 10
 
1.5%
조합장(김동일 2
 
0.3%
유혁상 2
 
0.3%
김형준 2
 
0.3%
2
 
0.3%
김준철 2
 
0.3%
한정수 2
 
0.3%
이정숙 2
 
0.3%
Other values (301) 302
43.9%
2023-12-12T09:48:26.207942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
382
15.7%
368
15.1%
343
14.1%
341
14.0%
72
 
3.0%
45
 
1.8%
32
 
1.3%
24
 
1.0%
24
 
1.0%
21
 
0.9%
Other values (153) 787
32.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2407
98.7%
Space Separator 12
 
0.5%
Other Punctuation 8
 
0.3%
Open Punctuation 5
 
0.2%
Close Punctuation 5
 
0.2%
Decimal Number 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
382
15.9%
368
15.3%
343
14.3%
341
14.2%
72
 
3.0%
45
 
1.9%
32
 
1.3%
24
 
1.0%
24
 
1.0%
21
 
0.9%
Other values (148) 755
31.4%
Space Separator
ValueCountFrequency (%)
12
100.0%
Other Punctuation
ValueCountFrequency (%)
, 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2407
98.7%
Common 32
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
382
15.9%
368
15.3%
343
14.3%
341
14.2%
72
 
3.0%
45
 
1.9%
32
 
1.3%
24
 
1.0%
24
 
1.0%
21
 
0.9%
Other values (148) 755
31.4%
Common
ValueCountFrequency (%)
12
37.5%
, 8
25.0%
( 5
15.6%
) 5
15.6%
1 2
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2407
98.7%
ASCII 32
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
382
15.9%
368
15.3%
343
14.3%
341
14.2%
72
 
3.0%
45
 
1.9%
32
 
1.3%
24
 
1.0%
24
 
1.0%
21
 
0.9%
Other values (148) 755
31.4%
ASCII
ValueCountFrequency (%)
12
37.5%
, 8
25.0%
( 5
15.6%
) 5
15.6%
1 2
 
6.2%

관할기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size5.4 KiB
이천시
676 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이천시
2nd row이천시
3rd row이천시
4th row이천시
5th row이천시

Common Values

ValueCountFrequency (%)
이천시 676
100.0%

Length

2023-12-12T09:48:26.424750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:48:26.558237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
이천시 676
100.0%
Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.4 KiB
<NA>
311 
일반
204 
우수
153 
중점
 
8

Length

Max length4
Median length2
Mean length2.9201183
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row우수
2nd row<NA>
3rd row<NA>
4th row일반
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 311
46.0%
일반 204
30.2%
우수 153
22.6%
중점 8
 
1.2%

Length

2023-12-12T09:48:26.733637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/