Overview

Dataset statistics

Number of variables7
Number of observations209
Missing cells47
Missing cells (%)3.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.0 KiB
Average record size in memory58.6 B

Variable types

Categorical2
Numeric1
Text4

Dataset

Description시도별 온실유형별(시설유형별, 규격시설별, 피복자재별 등)재배면적 현황
Author농림축산식품부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220217000000002032

Alerts

191.897 has 4 (1.9%) missing valuesMissing
Unnamed: 5 has 39 (18.7%) missing valuesMissing
6,938,320 has 4 (1.9%) missing valuesMissing

Reproduction

Analysis started2024-07-06 10:07:03.125111
Analysis finished2024-07-06 10:07:03.894742
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

2011
Categorical

Distinct3
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2012
84 
2011
83 
2013
42 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2011
2nd row2011
3rd row2011
4th row2011
5th row2011

Common Values

ValueCountFrequency (%)
2012 84
40.2%
2011 83
39.7%
2013 42
20.1%

Length

2024-07-06T19:07:03.947546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-06T19:07:04.032963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2012 84
40.2%
2011 83
39.7%
2013 42
20.1%

노지
Categorical

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
시설
126 
노지
83 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row노지
2nd row노지
3rd row노지
4th row노지
5th row노지

Common Values

ValueCountFrequency (%)
시설 126
60.3%
노지 83
39.7%

Length

2024-07-06T19:07:04.123412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-06T19:07:04.204690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
시설 126
60.3%
노지 83
39.7%

1
Real number (ℝ)

Distinct42
Distinct (%)20.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.598086
Minimum1
Maximum42
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2024-07-06T19:07:04.305176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q111
median22
Q332
95-th percentile40
Maximum42
Range41
Interquartile range (IQR)21

Descriptive statistics

Standard deviation12.095422
Coefficient of variation (CV)0.56002285
Kurtosis-1.2000767
Mean21.598086
Median Absolute Deviation (MAD)10
Skewness-0.00093109348
Sum4514
Variance146.29923
MonotonicityNot monotonic
2024-07-06T19:07:04.417952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
2 5
 
2.4%
33 5
 
2.4%
25 5
 
2.4%
26 5
 
2.4%
27 5
 
2.4%
28 5
 
2.4%
29 5
 
2.4%
30 5
 
2.4%
31 5
 
2.4%
32 5
 
2.4%
Other values (32) 159
76.1%
ValueCountFrequency (%)
1 4
1.9%
2 5
2.4%
3 5
2.4%
4 5
2.4%
5 5
2.4%
6 5
2.4%
7 5
2.4%
8 5
2.4%
9 5
2.4%
10 5
2.4%
ValueCountFrequency (%)
42 5
2.4%
41 5
2.4%
40 5
2.4%
39 5
2.4%
38 5
2.4%
37 5
2.4%
36 5
2.4%
35 5
2.4%
34 5
2.4%
33 5
2.4%


Text

Distinct80
Distinct (%)38.3%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-07-06T19:07:04.619260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length8.4354067
Min length1

Characters and Unicode

Total characters1763
Distinct characters62
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)16.7%

Sample

1st row근 채 류
2nd row
3rd row( 봄 )
4th row( 고 랭 지 )
5th row( 가 을 )
ValueCountFrequency (%)
64
 
12.5%
27
 
5.3%
24
 
4.7%
17
 
3.3%
16
 
3.1%
14
 
2.7%
14
 
2.7%
13
 
2.5%
13
 
2.5%
13
 
2.5%
Other values (67) 295
57.8%
2024-07-06T19:07:04.930489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1201
68.1%
( 40
 
2.3%
) 40
 
2.3%
30
 
1.7%
30
 
1.7%
20
 
1.1%
20
 
1.1%
15
 
0.9%
15
 
0.9%
15
 
0.9%
Other values (52) 337
 
19.1%

Most occurring categories

ValueCountFrequency (%)
Space Separator 1201
68.1%
Other Letter 482
27.3%
Open Punctuation 40
 
2.3%
Close Punctuation 40
 
2.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
6.2%
30
 
6.2%
20
 
4.1%
20
 
4.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
Other values (49) 292
60.6%
Space Separator
ValueCountFrequency (%)
1201
100.0%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1281
72.7%
Hangul 482
 
27.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
6.2%
30
 
6.2%
20
 
4.1%
20
 
4.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
Other values (49) 292
60.6%
Common
ValueCountFrequency (%)
1201
93.8%
( 40
 
3.1%
) 40
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1281
72.7%
Hangul 482
 
27.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1201
93.8%
( 40
 
3.1%
) 40
 
3.1%
Hangul
ValueCountFrequency (%)
30
 
6.2%
30
 
6.2%
20
 
4.1%
20
 
4.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
Other values (49) 292
60.6%

191.897
Text

MISSING 

Distinct153
Distinct (%)74.6%
Missing4
Missing (%)1.9%
Memory size1.8 KiB
2024-07-06T19:07:05.235506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length4.2390244
Min length1

Characters and Unicode

Total characters869
Distinct characters14
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)68.8%

Sample

1st row25.171
2nd row21.578
3rd row9.117
4th row2.713
5th row9.748
ValueCountFrequency (%)
44
 
21.5%
1,127 2
 
1.0%
2 2
 
1.0%
1.490 2
 
1.0%
1,955 2
 
1.0%
4.783 2
 
1.0%
2,777 2
 
1.0%
2.870 2
 
1.0%
2.896 2
 
1.0%
1.488 2
 
1.0%
Other values (142) 143
69.8%
2024-07-06T19:07:05.672202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 98
11.3%
1 87
10.0%
2 83
9.6%
76
8.7%
4 72
8.3%
7 64
 
7.4%
3 63
 
7.2%
5 58
 
6.7%
6 53
 
6.1%
8 52
 
6.0%
Other values (4) 163
18.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 628
72.3%
Other Punctuation 121
 
13.9%
Space Separator 76
 
8.7%
Dash Punctuation 44
 
5.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 87
13.9%
2 83
13.2%
4 72
11.5%
7 64
10.2%
3 63
10.0%
5 58
9.2%
6 53
8.4%
8 52
8.3%
9 49
7.8%
0 47
7.5%
Other Punctuation
ValueCountFrequency (%)
. 98
81.0%
, 23
 
19.0%
Space Separator
ValueCountFrequency (%)
76
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 44
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 869
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
. 98
11.3%
1 87
10.0%
2 83
9.6%
76
8.7%
4 72
8.3%
7 64
 
7.4%
3 63
 
7.2%
5 58
 
6.7%
6 53
 
6.1%
8 52
 
6.0%
Other values (4) 163
18.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 869
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 98
11.3%
1 87
10.0%
2 83
9.6%
76
8.7%
4 72
8.3%
7 64
 
7.4%
3 63
 
7.2%
5 58
 
6.7%
6 53
 
6.1%
8 52
 
6.0%
Other values (4) 163
18.8%

Unnamed: 5
Text

MISSING 

Distinct123
Distinct (%)72.4%
Missing39
Missing (%)18.7%
Memory size1.8 KiB
2024-07-06T19:07:05.954990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/