Overview

Dataset statistics

Number of variables12
Number of observations44
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.3 KiB
Average record size in memory99.0 B

Variable types

Categorical2
Text10

Dataset

Description시도별 온실유형별(시설유형별, 규격시설별, 피복자재별 등)재배면적 현황
Author농림축산식품부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220217000000002032

Alerts

채소류별(3) is highly imbalanced (55.4%)Imbalance
2019 has unique valuesUnique
2019.2 has unique valuesUnique

Reproduction

Analysis started2024-07-06 10:07:14.331175
Analysis finished2024-07-06 10:07:14.966681
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

채소류별(1)
Categorical

Distinct8
Distinct (%)18.2%
Missing0
Missing (%)0.0%
Memory size484.0 B
엽채류
12 
과채류
11 
근채류
10 
조미채소류
채소류별(1)
Other values (3)

Length

Max length7
Median length3
Mean length3.4772727
Min length2

Unique

Unique3 ?
Unique (%)6.8%

Sample

1st row채소류별(1)
2nd row채소류별(1)
3rd row합계
4th row근채류
5th row근채류

Common Values

ValueCountFrequency (%)
엽채류 12
27.3%
과채류 11
25.0%
근채류 10
22.7%
조미채소류 6
13.6%
채소류별(1) 2
 
4.5%
합계 1
 
2.3%
양채류 1
 
2.3%
기타채소류 1
 
2.3%

Length

2024-07-06T19:07:15.026008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-06T19:07:15.125862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
엽채류 12
27.3%
과채류 11
25.0%
근채류 10
22.7%
조미채소류 6
13.6%
채소류별(1 2
 
4.5%
합계 1
 
2.3%
양채류 1
 
2.3%
기타채소류 1
 
2.3%
Distinct29
Distinct (%)65.9%
Missing0
Missing (%)0.0%
Memory size484.0 B
2024-07-06T19:07:15.297064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length2
Mean length2.25
Min length1

Characters and Unicode

Total characters99
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)56.8%

Sample

1st row채소류별(2)
2nd row채소류별(2)
3rd row소계
4th row소계
5th row
ValueCountFrequency (%)
소계 7
 
15.9%
5
 
11.4%
배추 5
 
11.4%
채소류별(2 2
 
4.5%
딸기 1
 
2.3%
오이 1
 
2.3%
1
 
2.3%
양파 1
 
2.3%
마늘 1
 
2.3%
고추 1
 
2.3%
Other values (19) 19
43.2%
2024-07-06T19:07:15.601658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
10.1%
9
 
9.1%
7
 
7.1%
6
 
6.1%
5
 
5.1%
3
 
3.0%
3
 
3.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (41) 50
50.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 93
93.9%
Close Punctuation 2
 
2.0%
Decimal Number 2
 
2.0%
Open Punctuation 2
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
10.8%
9
 
9.7%
7
 
7.5%
6
 
6.5%
5
 
5.4%
3
 
3.2%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (38) 44
47.3%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Decimal Number
ValueCountFrequency (%)
2 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 93
93.9%
Common 6
 
6.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
10.8%
9
 
9.7%
7
 
7.5%
6
 
6.5%
5
 
5.4%
3
 
3.2%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (38) 44
47.3%
Common
ValueCountFrequency (%)
) 2
33.3%
2 2
33.3%
( 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 93
93.9%
ASCII 6
 
6.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10
 
10.8%
9
 
9.7%
7
 
7.5%
6
 
6.5%
5
 
5.4%
3
 
3.2%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (38) 44
47.3%
ASCII
ValueCountFrequency (%)
) 2
33.3%
2 2
33.3%
( 2
33.3%

채소류별(3)
Categorical

IMBALANCE 

Distinct10
Distinct (%)22.7%
Missing0
Missing (%)0.0%
Memory size484.0 B
소계
34 
채소류별(3)
 
2
봄무
 
1
고랭지무
 
1
가을무
 
1
Other values (5)

Length

Max length7
Median length2
Mean length2.5
Min length2

Unique

Unique8 ?
Unique (%)18.2%

Sample

1st row채소류별(3)
2nd row채소류별(3)
3rd row소계
4th row소계
5th row소계

Common Values

ValueCountFrequency (%)
소계 34
77.3%
채소류별(3) 2
 
4.5%
봄무 1
 
2.3%
고랭지무 1
 
2.3%
가을무 1
 
2.3%
겨울무 1
 
2.3%
봄배추 1
 
2.3%
고랭지배추 1
 
2.3%
가을배추 1
 
2.3%
겨울배추 1
 
2.3%

Length

2024-07-06T19:07:15.723572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-06T19:07:15.834655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소계 34
77.3%
채소류별(3 2
 
4.5%
봄무 1
 
2.3%
고랭지무 1
 
2.3%
가을무 1
 
2.3%
겨울무 1
 
2.3%
봄배추 1
 
2.3%
고랭지배추 1
 
2.3%
가을배추 1
 
2.3%
겨울배추 1
 
2.3%

2019
Text

UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size484.0 B
2024-07-06T19:07:16.065473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length4
Mean length4.2045455
Min length1

Characters and Unicode

Total characters185
Distinct characters18
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)100.0%

Sample

1st row
2nd row면적 (ha)
3rd row225872
4th row22523
5th row19503
ValueCountFrequency (%)
1
 
2.2%
1277 1
 
2.2%
1502 1
 
2.2%
1610 1
 
2.2%
49758 1
 
2.2%
11973 1
 
2.2%
3647 1
 
2.2%
4962 1
 
2.2%
9874 1
 
2.2%
6462 1
 
2.2%
Other values (35) 35
77.8%
2024-07-06T19:07:16.403337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 21
11.4%
4 20
10.8%
3 20
10.8%
2 20
10.8%
6 19
10.3%
7 18
9.7%
5 16
8.6%
9 15
8.1%
8 15
8.1%
0 13
7.0%
Other values (8) 8
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 177
95.7%
Other Letter 3
 
1.6%
Lowercase Letter 2
 
1.1%
Close Punctuation 1
 
0.5%
Open Punctuation 1
 
0.5%
Space Separator 1
 
0.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 21
11.9%
4 20
11.3%
3 20
11.3%
2 20
11.3%
6 19
10.7%
7 18
10.2%
5 16
9.0%
9 15
8.5%
8 15
8.5%
0 13
7.3%
Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Lowercase Letter
ValueCountFrequency (%)
a 1
50.0%
h 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 180
97.3%
Hangul 3
 
1.6%
Latin 2
 
1.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 21
11.7%
4 20
11.1%
3 20
11.1%
2 20
11.1%
6 19
10.6%
7 18
10.0%
5 16
8.9%
9 15
8.3%
8 15
8.3%
0 13
7.2%
Other values (3) 3
 
1.7%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Latin
ValueCountFrequency (%)
a 1
50.0%
h 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 182
98.4%
Hangul 3
 
1.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 21
11.5%
4 20
11.0%
3 20
11.0%
2 20
11.0%
6 19
10.4%
7 18
9.9%
5 16
8.8%
9 15
8.2%
8 15
8.2%
0 13
7.1%
Other values (5) 5
 
2.7%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

2019.1
Text

Distinct38
Distinct (%)86.4%
Missing0
Missing (%)0.0%
Memory size484.0 B
2024-07-06T19:07:16.582705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length4
Mean length3.6136364
Min length1

Characters and Unicode

Total characters159
Distinct characters21
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)84.1%

Sample

1st row
2nd row단수 (kg/10a)
3rd row-
4th row-
5th row5696
ValueCountFrequency (%)
7
 
15.6%
3625 1
 
2.2%
2701 1
 
2.2%
2650 1
 
2.2%
3950 1
 
2.2%
3974 1
 
2.2%
4119 1
 
2.2%
7377 1
 
2.2%
3479 1
 
2.2%
5232 1
 
2.2%
Other values (29) 29
64.4%
2024-07-06T19:07:16.852448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 18
11.3%
3 17
10.7%
1 17
10.7%
7 16
10.1%
6 15
9.4%
5 14
8.8%
9 14
8.8%
4 13
8.2%
0 10
6.3%
8 8
5.0%
Other values (11) 17
10.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 142
89.3%
Dash Punctuation 7
 
4.4%
Lowercase Letter 3
 
1.9%
Other Letter 3
 
1.9%
Other Punctuation 1
 
0.6%
Open Punctuation 1
 
0.6%
Space Separator 1
 
0.6%
Close Punctuation 1
 
0.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 18
12.7%
3 17
12.0%
1 17
12.0%
7 16
11.3%
6 15
10.6%
5 14
9.9%
9 14
9.9%
4 13
9.2%
0 10
7.0%
8 8
5.6%
Lowercase Letter
ValueCountFrequency (%)
k 1
33.3%
a 1
33.3%
g 1
33.3%
Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 153
96.2%
Latin 3
 
1.9%
Hangul 3
 
1.9%

Most frequent character per script

Common
ValueCountFrequency (%)
2 18
11.8%
3 17
11.1%
1 17
11.1%
7 16
10.5%
6 15
9.8%
5 14
9.2%
9 14
9.2%
4 13
8.5%
0 10
6.5%
8 8
5.2%
Other values (5) 11
7.2%
Latin
ValueCountFrequency (%)
k 1
33.3%
a 1
33.3%
g 1
33.3%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 156
98.1%
Hangul 3
 
1.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 18
11.5%
3 17
10.9%
1 17
10.9%
7 16
10.3%
6 15
9.6%
5 14
9.0%
9 14
9.0%
4 13
8.3%
0 10
6.4%
8 8
5.1%
Other values (8) 14
9.0%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

2019.2
Text

UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size484.0 B
2024-07-06T19:07:17.082006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/