Overview

Dataset statistics

Number of variables9
Number of observations586
Missing cells338
Missing cells (%)6.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory43.0 KiB
Average record size in memory75.2 B

Variable types

Categorical5
Numeric2
Text2

Dataset

Description제11대 충청남도 의회 홈페이지에서 제공되는 의안정보(의안번호, 의안명, 제안일자, 제안회기 등)를 공공데이터 목록에 등록합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=414&beforeMenuCd=DOM_000000201001001000&publicdatapk=15040208

Alerts

본회의 처리결과 is highly overall correlated with 대수 and 1 other fieldsHigh correlation
소관 위원회 is highly overall correlated with 대수 and 1 other fieldsHigh correlation
소관 위원회 처리결과 is highly overall correlated with 대수 and 1 other fieldsHigh correlation
대수 is highly overall correlated with 회기 and 5 other fieldsHigh correlation
제안자 구분 is highly overall correlated with 대수 and 1 other fieldsHigh correlation
회기 is highly overall correlated with 의안번호 and 1 other fieldsHigh correlation
의안번호 is highly overall correlated with 회기 and 1 other fieldsHigh correlation
대수 is highly imbalanced (98.2%)Imbalance
소관 위원회 처리결과 is highly imbalanced (50.6%)Imbalance
본회의 처리결과 is highly imbalanced (71.2%)Imbalance
공동 발의의원 has 335 (57.2%) missing valuesMissing

Reproduction

Analysis started2024-01-09 21:01:59.636030
Analysis finished2024-01-09 21:02:01.245261
Duration1.61 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
12
585 
<NA>
 
1

Length

Max length4
Median length2
Mean length2.003413
Min length2

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row12
2nd row12
3rd row12
4th row12
5th row12

Common Values

ValueCountFrequency (%)
12 585
99.8%
<NA> 1
 
0.2%

Length

2024-01-10T06:02:01.317917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:02:01.412998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
12 585
99.8%
na 1
 
0.2%

회기
Real number (ℝ)

HIGH CORRELATION 

Distinct10
Distinct (%)1.7%
Missing1
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean343.20513
Minimum338
Maximum347
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2024-01-10T06:02:01.488155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum338
5-th percentile340
Q1341
median343
Q3346
95-th percentile347
Maximum347
Range9
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.6923867
Coefficient of variation (CV)0.0078448324
Kurtosis-1.3659583
Mean343.20513
Median Absolute Deviation (MAD)2
Skewness0.12939506
Sum200775
Variance7.2489463
MonotonicityNot monotonic
2024-01-10T06:02:01.598852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
347 118
20.1%
340 97
16.6%
341 85
14.5%
342 71
12.1%
343 56
9.6%
346 50
8.5%
345 43
 
7.3%
344 42
 
7.2%
339 17
 
2.9%
338 6
 
1.0%
(Missing) 1
 
0.2%
ValueCountFrequency (%)
338 6
 
1.0%
339 17
 
2.9%
340 97
16.6%
341 85
14.5%
342 71
12.1%
343 56
9.6%
344 42
 
7.2%
345 43
 
7.3%
346 50
8.5%
347 118
20.1%
ValueCountFrequency (%)
347 118
20.1%
346 50
8.5%
345 43
 
7.3%
344 42
 
7.2%
343 56
9.6%
342 71
12.1%
341 85
14.5%
340 97
16.6%
339 17
 
2.9%
338 6
 
1.0%

의안번호
Real number (ℝ)

HIGH CORRELATION 

Distinct585
Distinct (%)100.0%
Missing1
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean295.33504
Minimum1
Maximum589
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2024-01-10T06:02:01.736275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile30.2
Q1149
median295
Q3442
95-th percentile559.8
Maximum589
Range588
Interquartile range (IQR)293

Descriptive statistics

Standard deviation169.92161
Coefficient of variation (CV)0.57535201
Kurtosis-1.1950031
Mean295.33504
Median Absolute Deviation (MAD)147
Skewness-0.00024175783
Sum172771
Variance28873.353
MonotonicityNot monotonic
2024-01-10T06:02:01.892295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
589 1
 
0.2%
187 1
 
0.2%
201 1
 
0.2%
200 1
 
0.2%
199 1
 
0.2%
198 1
 
0.2%
162 1
 
0.2%
197 1
 
0.2%
196 1
 
0.2%
195 1
 
0.2%
Other values (575) 575
98.1%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
589 1
0.2%
588 1
0.2%
587 1
0.2%
586 1
0.2%
585 1
0.2%
584 1
0.2%
583 1
0.2%
582 1
0.2%
581 1
0.2%
580 1
0.2%
Distinct545
Distinct (%)93.2%
Missing1
Missing (%)0.2%
Memory size4.7 KiB
2024-01-10T06:02:02.219479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length50
Mean length28.926496
Min length5

Characters and Unicode

Total characters16922
Distinct characters417
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique524 ?
Unique (%)89.6%

Sample

1st row2023년도 행정사무감사 계획서 승인의 건
2nd row충청남도 양성평등 기본 조례 일부개정조례안
3rd row휴회의 건
4th row충청남도 학생인권 조례 폐지조례안(주민청구 조례안)
5th row충청남도 인권 기본 조례 폐지조례안(주민청구 조례안)
ValueCountFrequency (%)
충청남도 297
 
7.9%
조례 183
 
4.9%
관한 176
 
4.7%
조례안 136
 
3.6%
일부개정조례안 133
 
3.5%
124
 
3.3%
동의안 81
 
2.2%
66
 
1.8%
지원에 65
 
1.7%
충청남도교육청 57
 
1.5%
Other values (1028) 2438
64.9%
2024-01-10T06:02:02.673217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3171
 
18.7%
556
 
3.3%
549
 
3.2%
541
 
3.2%
519
 
3.1%
516
 
3.0%
442
 
2.6%
441
 
2.6%
405
 
2.4%
333
 
2.0%
Other values (407) 9449
55.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13102
77.4%
Space Separator 3171
 
18.7%
Decimal Number 504
 
3.0%
Other Punctuation 53
 
0.3%
Close Punctuation 36
 
0.2%
Open Punctuation 35
 
0.2%
Uppercase Letter 10
 
0.1%
Lowercase Letter 7
 
< 0.1%
Final Punctuation 1
 
< 0.1%
Initial Punctuation 1
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
556
 
4.2%
549
 
4.2%
541
 
4.1%
519
 
4.0%
516
 
3.9%
442
 
3.4%
441
 
3.4%
405
 
3.1%
333
 
2.5%
326
 
2.5%
Other values (371) 8474
64.7%
Decimal Number
ValueCountFrequency (%)
2 224
44.4%
0 98
19.4%
3 89
 
17.7%
4 45
 
8.9%
1 23
 
4.6%
5 8
 
1.6%
9 5
 
1.0%
8 4
 
0.8%
7 4
 
0.8%
6 4
 
0.8%
Uppercase Letter
ValueCountFrequency (%)
D 2
20.0%
B 2
20.0%
E 1
10.0%
L 1
10.0%
T 1
10.0%
S 1
10.0%
G 1
10.0%
W 1
10.0%
Lowercase Letter
ValueCountFrequency (%)
l 2
28.6%
g 1
14.3%
n 1
14.3%
i 1
14.3%
y 1
14.3%
e 1
14.3%
Other Punctuation
ValueCountFrequency (%)
· 40
75.5%
? 10
 
18.9%
, 3
 
5.7%
Open Punctuation
ValueCountFrequency (%)
( 29
82.9%
6
 
17.1%
Close Punctuation
ValueCountFrequency (%)
) 29
80.6%
7
 
19.4%
Space Separator
ValueCountFrequency (%)
3171
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13102
77.4%
Common 3803
 
22.5%
Latin 17
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
556
 
4.2%
549
 
4.2%
541
 
4.1%
519
 
4.0%
516
 
3.9%
442
 
3.4%
441
 
3.4%
405
 
3.1%
333
 
2.5%
326
 
2.5%
Other values (371) 8474
64.7%
Common
ValueCountFrequency (%)
3171
83.4%
2 224
 
5.9%
0 98
 
2.6%
3 89
 
2.3%
4 45
 
1.2%
· 40
 
1.1%
( 29
 
0.8%
) 29
 
0.8%
1 23
 
0.6%
? 10
 
0.3%
Other values (12) 45
 
1.2%
Latin
ValueCountFrequency (%)
l 2
11.8%
D 2
11.8%
B 2
11.8%
E 1
 
5.9%
L 1
 
5.9%
T 1
 
5.9%
g 1
 
5.9%
n 1
 
5.9%
i 1
 
5.9%
y 1
 
5.9%
Other values (4) 4
23.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13099
77.4%
ASCII 3765
 
22.2%
None 53
 
0.3%
Compat Jamo 3
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3171
84.2%
2 224
 
5.9%
0 98
 
2.6%
3 89
 
2.4%
4 45
 
1.2%
( 29
 
0.8%
) 29
 
0.8%
1 23
 
0.6%
? 10
 
0.3%
5 8
 
0.2%
Other values (21) 39
 
1.0%
Hangul
ValueCountFrequency (%)
556
 
4.2%
549
 
4.2%
541
 
4.1%
519
 
4.0%
516
 
3.9%
442
 
3.4%
441
 
3.4%
405
 
3.1%
333
 
2.5%
326
 
2.5%
Other values (370) 8471
64.7%
None
ValueCountFrequency (%)
· 40
75.5%
7
 
13.2%
6
 
11.3%
Compat Jamo
ValueCountFrequency (%)
3
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%

제안자 구분
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
의원
251 
도지사
199 
의장
52 
교육감
49 
위원회
34 

Length

Max length4
Median length2
Mean length2.4846416
Min length2

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row위원회
2nd row위원회
3rd row의장
4th row의장
5th row의장

Common Values

ValueCountFrequency (%)
의원 251
42.8%
도지사 199
34.0%
의장 52
 
8.9%
교육감 49
 
8.4%
위원회 34
 
5.8%
<NA> 1
 
0.2%

Length

2024-01-10T06:02:02.808946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:02:02.930828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
의원 251
42.8%
도지사 199
34.0%
의장 52
 
8.9%
교육감 49
 
8.4%
위원회 34
 
5.8%
na 1
 
0.2%

공동 발의의원
Text

MISSING 

Distinct222
Distinct (%)88.4%
Missing335
Missing (%)57.2%
Memory size4.7 KiB
2024-01-10T06:02:03.146176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length238
Median length158
Mean length79.613546
Min length23

Characters and Unicode

Total characters19983
Distinct characters78
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique210 ?
Unique (%)83.7%

Sample

1st row홍성현, 구형서, 정병인, 박정수, 고광철, 편삼범, 김응규, 박정식, 이용국, 윤기형, 이재운, 이철수, 이완식, 김기서, 전익현, 신영호, 김명숙, 이상근, 주진하, 윤희신, 김민수, 박미옥, 신순옥, 이현숙
2nd row홍성현, 구형서, 김도훈, 오인철, 정병인, 김선태, 고광철, 편삼범, 박정식, 조철기, 안장헌, 지민규, 이연희, 윤기형, 오인환, 이재운, 이철수, 이완식, 김석곤, 전익현, 이상근, 이종화, 방한일, 주진하, 윤희신, 이지윤, 김민수, 박미옥, 신순옥, 이현숙
3rd row박기영, 김응규, 조철기, 오인환, 이철수, 김기서, 신영호, 방한일, 주진하, 박미옥
4th row신한철, 구형서, 유성재, 박정식, 이완식, 주진하, 윤희신, 정광섭, 김민수, 신순옥
5th row홍성현, 유성재, 김도훈, 오인철, 정병인, 박정수, 양경모, 고광철, 박기영, 최광희, 김응규, 이철수, 김석곤, 김복만, 이상근, 이종화, 방한일, 주진하, 윤희신, 김민수, 박미옥, 신순옥, 이현숙
ValueCountFrequency (%)
방한일 130
 
3.2%
정병인 122
 
3.0%
구형서 120
 
2.9%
윤희신 116
 
2.8%
이철수 115
 
2.8%
김민수 108
 
2.6%
윤기형 106
 
2.6%
김응규 104
 
2.5%
이상근 103
 
2.5%
지민규 101
 
2.5%
Other values (38) 2972
72.5%
2024-01-10T06:02:03.503802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 3846
19.2%
3846
19.2%
794
 
4.0%
736
 
3.7%
442
 
2.2%
373
 
1.9%
361
 
1.8%
352
 
1.8%
344
 
1.7%
331
 
1.7%
Other values (68) 8558
42.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12291
61.5%
Other Punctuation 3846
 
19.2%
Space Separator 3846
 
19.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
794
 
6.5%
736
 
6.0%
442
 
3.6%
373
 
3.0%
361
 
2.9%
352
 
2.9%
344
 
2.8%
331
 
2.7%
312
 
2.5%
289
 
2.4%
Other values (66) 7957
64.7%
Other Punctuation
ValueCountFrequency (%)
, 3846
100.0%
Space Separator
ValueCountFrequency (%)
3846
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12291
61.5%
Common 7692
38.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
794
 
6.5%
736
 
6.0%
442
 
3.6%
373
 
3.0%
361
 
2.9%
352
 
2.9%
344
 
2.8%
331
 
2.7%
312
 
2.5%
289
 
2.4%
Other values (66) 7957
64.7%
Common
ValueCountFrequency (%)
, 3846
50.0%
3846
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12291
61.5%
ASCII 7692
38.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 3846
50.0%
3846
50.0%
Hangul
ValueCountFrequency (%)
794
 
6.5%
736
 
6.0%
442
 
3.6%
373
 
3.0%
361
 
2.9%
352
 
2.9%
344
 
2.8%
331
 
2.7%
312
 
2.5%
289
 
2.4%
Other values (66) 7957
64.7%

소관 위원회
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
본회의
133 
행정문화위원회
105 
교육위원회
78 
복지환경위원회
76 
기획경제위원회
74 
Other values (5)
120 

Length

Max length9
Median length7
Mean length5.9795222
Min length3

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row본회의
2nd row본회의
3rd row본회의
4th row교육위원회
5th row행정문화위원회

Common Values

ValueCountFrequency (%)
본회의 133
22.7%
행정문화위원회 105
17.9%
교육위원회 78
13.3%
복지환경위원회 76
13.0%
기획경제위원회 74
12.6%
농수산해양위원회 49
 
8.4%
건설소방위원회 35
 
6.0%
예산결산특별위원회 22
 
3.8%
의회운영위원회 13
 
2.2%
<NA> 1
 
0.2%

Length

2024-01-10T06:02:03.628199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:02:03.741691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
본회의 133
22.7%
행정문화위원회 105
17.9%
교육위원회 78
13.3%
복지환경위원회 76
13.0%
기획경제위원회 74
12.6%
농수산해양위원회 49
 
8.4%
건설소방위원회 35
 
6.0%
예산결산특별위원회 22
 
3.8%
의회운영위원회 13
 
2.2%
na 1
 
0.2%

소관 위원회 처리결과
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
원안가결
377 
<NA>
137 
수정가결
45 
미상정
 
9
본회의에 부의하지 아니하기로 의결
 
9
Other values (3)
 
9