Overview

Dataset statistics

Number of variables9
Number of observations1597
Missing cells723
Missing cells (%)5.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory117.1 KiB
Average record size in memory75.1 B

Variable types

Categorical5
Numeric2
Text1
Unsupported1

Dataset

Description제11대 충청남도 의회 홈페이지에서 제공되는 의안정보(의안번호, 의안명, 제안일자, 제안회기 등)를 공공데이터 목록에 등록합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=414&beforeMenuCd=DOM_000000201001001000&publicdatapk=15040208

Alerts

대수 has constant value ""Constant
본회의 처리결과 is highly overall correlated with 소관 위원회 처리결과High correlation
소관 위원회 처리결과 is highly overall correlated with 본회의 처리결과High correlation
회기 is highly overall correlated with 의안번호High correlation
의안번호 is highly overall correlated with 회기High correlation
제안자 구분 is highly overall correlated with 소관 위원회High correlation
소관 위원회 is highly overall correlated with 제안자 구분High correlation
소관 위원회 처리결과 is highly imbalanced (51.4%)Imbalance
본회의 처리결과 is highly imbalanced (66.2%)Imbalance
공동 발의의원 has 723 (45.3%) missing valuesMissing
의안번호 has unique valuesUnique
공동 발의의원 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-01-09 21:01:50.319060
Analysis finished2024-01-09 21:01:51.465883
Duration1.15 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대수
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
11
1597 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row11
2nd row11
3rd row11
4th row11
5th row11

Common Values

ValueCountFrequency (%)
11 1597
100.0%

Length

2024-01-10T06:01:51.527499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:01:51.613352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
11 1597
100.0%

회기
Real number (ℝ)

HIGH CORRELATION 

Distinct33
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean322.12336
Minimum305
Maximum337
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.2 KiB
2024-01-10T06:01:51.698196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum305
5-th percentile307
Q1314
median324
Q3330
95-th percentile335
Maximum337
Range32
Interquartile range (IQR)16

Descriptive statistics

Standard deviation9.4229729
Coefficient of variation (CV)0.029252684
Kurtosis-1.2489641
Mean322.12336
Median Absolute Deviation (MAD)8
Skewness-0.19195231
Sum514431
Variance88.792418
MonotonicityNot monotonic
2024-01-10T06:01:51.813255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
333 126
 
7.9%
325 99
 
6.2%
329 88
 
5.5%
331 87
 
5.4%
328 85
 
5.3%
335 66
 
4.1%
308 64
 
4.0%
337 63
 
3.9%
324 63
 
3.9%
330 62
 
3.9%
Other values (23) 794
49.7%
ValueCountFrequency (%)
305 24
 
1.5%
306 34
2.1%
307 43
2.7%
308 64
4.0%
309 44
2.8%
310 47
2.9%
311 54
3.4%
312 42
2.6%
313 28
1.8%
314 46
2.9%
ValueCountFrequency (%)
337 63
3.9%
336 4
 
0.3%
335 66
4.1%
334 32
 
2.0%
333 126
7.9%
332 8
 
0.5%
331 87
5.4%
330 62
3.9%
329 88
5.5%
328 85
5.3%

의안번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1597
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean799.90607
Minimum1
Maximum1598
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.2 KiB
2024-01-10T06:01:51.937988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile80.8
Q1401
median800
Q31199
95-th percentile1518.2
Maximum1598
Range1597
Interquartile range (IQR)798

Descriptive statistics

Standard deviation461.30602
Coefficient of variation (CV)0.57670024
Kurtosis-1.1991133
Mean799.90607
Median Absolute Deviation (MAD)399
Skewness-0.00090121651
Sum1277450
Variance212803.25
MonotonicityNot monotonic
2024-01-10T06:01:52.071920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1598 1
 
0.1%
525 1
 
0.1%
527 1
 
0.1%
528 1
 
0.1%
529 1
 
0.1%
530 1
 
0.1%
531 1
 
0.1%
532 1
 
0.1%
533 1
 
0.1%
534 1
 
0.1%
Other values (1587) 1587
99.4%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1598 1
0.1%
1597 1
0.1%
1596 1
0.1%
1595 1
0.1%
1594 1
0.1%
1593 1
0.1%
1592 1
0.1%
1591 1
0.1%
1590 1
0.1%
1589 1
0.1%
Distinct1428
Distinct (%)89.4%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
2024-01-10T06:01:52.335994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length46
Mean length27.290545
Min length5

Characters and Unicode

Total characters43583
Distinct characters476
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1337 ?
Unique (%)83.7%

Sample

1st row충청남도 공교육 강화를 위한 특별위원회 활동결과 보고
2nd row내포문화권 발전을 위한 특별위원회 활동결과 보고
3rd row충청남도 인삼산업 발전을 위한 특별위원회 활동결과 보고
4th row백제시대 술 발전을 위한 특별위원회 활동결과 보고
5th row해양‧환경 특별위원회 활동결과 보고
ValueCountFrequency (%)
충청남도 870
 
9.0%
관한 564
 
5.8%
조례안 500
 
5.2%
조례 392
 
4.1%
377
 
3.9%
일부개정조례안 325
 
3.4%
지원에 215
 
2.2%
198
 
2.1%
충청남도교육청 129
 
1.3%
충청남도의회 126
 
1.3%
Other values (2106) 5962
61.7%
2024-01-10T06:01:52.755152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8061
 
18.5%
1568
 
3.6%
1477
 
3.4%
1438
 
3.3%
1412
 
3.2%
1358
 
3.1%
1283
 
2.9%
1271
 
2.9%
921
 
2.1%
835
 
1.9%
Other values (466) 23959
55.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 33968
77.9%
Space Separator 8061
 
18.5%
Decimal Number 1308
 
3.0%
Other Punctuation 90
 
0.2%
Close Punctuation 53
 
0.1%
Open Punctuation 52
 
0.1%
Uppercase Letter 36
 
0.1%
Math Symbol 8
 
< 0.1%
Dash Punctuation 3
 
< 0.1%
Final Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1568
 
4.6%
1477
 
4.3%
1438
 
4.2%
1412
 
4.2%
1358
 
4.0%
1283
 
3.8%
1271
 
3.7%
921
 
2.7%
835
 
2.5%
821
 
2.4%
Other values (422) 21584
63.5%
Uppercase Letter
ValueCountFrequency (%)
T 4
11.1%
S 4
11.1%
M 4
11.1%
B 3
8.3%
G 3
8.3%
O 3
8.3%
P 3
8.3%
K 2
 
5.6%
C 2
 
5.6%
E 2
 
5.6%
Other values (5) 6
16.7%
Decimal Number
ValueCountFrequency (%)
2 501
38.3%
0 323
24.7%
1 203
15.5%
3 122
 
9.3%
9 63
 
4.8%
8 31
 
2.4%
4 26
 
2.0%
5 18
 
1.4%
7 12
 
0.9%
6 9
 
0.7%
Other Punctuation
ValueCountFrequency (%)
· 61
67.8%
10
 
11.1%
, 4
 
4.4%
4
 
4.4%
4
 
4.4%
3
 
3.3%
. 2
 
2.2%
% 1
 
1.1%
: 1
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 42
79.2%
11
 
20.8%
Open Punctuation
ValueCountFrequency (%)
( 41
78.8%
11
 
21.2%
Math Symbol
ValueCountFrequency (%)
~ 7
87.5%
1
 
12.5%
Space Separator
ValueCountFrequency (%)
8061
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 33967
77.9%
Common 9579
 
22.0%
Latin 36
 
0.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1568
 
4.6%
1477
 
4.3%
1438
 
4.2%
1412
 
4.2%
1358
 
4.0%
1283
 
3.8%
1271
 
3.7%
921
 
2.7%
835
 
2.5%
821
 
2.4%
Other values (421) 21583
63.5%
Common
ValueCountFrequency (%)
8061
84.2%
2 501
 
5.2%
0 323
 
3.4%
1 203
 
2.1%
3 122
 
1.3%
9 63
 
0.7%
· 61
 
0.6%
) 42
 
0.4%
( 41
 
0.4%
8 31
 
0.3%
Other values (19) 131
 
1.4%
Latin
ValueCountFrequency (%)
T 4
11.1%
S 4
11.1%
M 4
11.1%
B 3
8.3%
G 3
8.3%
O 3
8.3%
P 3
8.3%
K 2
 
5.6%
C 2
 
5.6%
E 2
 
5.6%
Other values (5) 6
16.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33930
77.9%
ASCII 9506
 
21.8%
None 86
 
0.2%
Compat Jamo 37
 
0.1%
Punctuation 18
 
< 0.1%
Katakana 4
 
< 0.1%
Math Operators 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8061
84.8%
2 501
 
5.3%
0 323
 
3.4%
1 203
 
2.1%
3 122
 
1.3%
9 63
 
0.7%
) 42
 
0.4%
( 41
 
0.4%
8 31
 
0.3%
4 26
 
0.3%
Other values (24) 93
 
1.0%
Hangul
ValueCountFrequency (%)
1568
 
4.6%
1477
 
4.4%
1438
 
4.2%
1412
 
4.2%
1358
 
4.0%
1283
 
3.8%
1271
 
3.7%
921
 
2.7%
835
 
2.5%
821
 
2.4%
Other values (420) 21546
63.5%
None
ValueCountFrequency (%)
· 61
70.9%
11
 
12.8%
11
 
12.8%
3
 
3.5%
Compat Jamo
ValueCountFrequency (%)
37
100.0%
Punctuation
ValueCountFrequency (%)
10
55.6%
4
 
22.2%
2
 
11.1%
2
 
11.1%
Katakana
ValueCountFrequency (%)
4
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

제안자 구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
의원
848 
도지사
358 
교육감
154 
위원회
123 
의장
114 

Length

Max length3
Median length2
Mean length2.3976205
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row위원회
2nd row위원회
3rd row위원회
4th row위원회
5th row위원회

Common Values

ValueCountFrequency (%)
의원 848
53.1%
도지사 358
22.4%
교육감 154
 
9.6%
위원회 123
 
7.7%
의장 114
 
7.1%

Length

2024-01-10T06:01:52.900949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:01:53.008568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
의원 848
53.1%
도지사 358
22.4%
교육감 154
 
9.6%
위원회 123
 
7.7%
의장 114
 
7.1%

공동 발의의원
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing723
Missing (%)45.3%
Memory size12.6 KiB

소관 위원회
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
본회의
383 
교육위원회
209 
행정문화위원회
129 
행정자치위원회
124 
기획경제위원회
116 
Other values (10)
636 

Length

Max length11
Median length9
Mean length6.3212273
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본회의
2nd row본회의
3rd row본회의
4th row본회의
5th row본회의

Common Values

ValueCountFrequency (%)
본회의 383
24.0%
교육위원회 209
13.1%
행정문화위원회 129
 
8.1%
행정자치위원회 124
 
7.8%
기획경제위원회 116
 
7.3%
농업경제환경위원회 113
 
7.1%
문화복지위원회 109
 
6.8%
복지환경위원회 89
 
5.6%
안전건설해양소방위원회 78
 
4.9%
예산결산특별위원회 75
 
4.7%
Other values (5) 172
10.8%

Length

2024-01-10T06:01:53.134301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
본회의 383
24.0%
교육위원회 209
13.1%
행정문화위원회 129
 
8.1%
행정자치위원회 124
 
7.8%
기획경제위원회 116
 
7.3%
농업경제환경위원회 113
 
7.1%
문화복지위원회 109
 
6.8%
복지환경위원회 89
 
5.6%
안전건설해양소방위원회 78
 
4.9%
예산결산특별위원회 75
 
4.7%
Other values (5) 172
10.8%

소관 위원회 처리결과
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
원안가결
928 
<NA>
409 
수정가결
220 
철회
 
13
미상정
 
11
Other values (4)
 
16

Length

Max length18
Median length4
Mean length3.9805886
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
원안가결 928
58.1%
<NA> 409
25.6%
수정가결 220
 
13.8%
철회 13
 
0.8%
미상정 11
 
0.7%
부결 10
 
0.6%
보류 3
 
0.2%
본회의에 부의하지 아니하기로 의결 2
 
0.1%
징계하지 아니함 1
 
0.1%

Length

2024-01-10T06:01:53.266494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:01:53.390953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
원안가결 928
57.9%
na 409
25.5%
수정가결 220
 
13.7%
철회 13
 
0.8%
미상정 11
 
0.7%
부결 10
 
0.6%
보류 3
 
0.2%
본회의에 2
 
0.1%
부의하지 2
 
0.1%
아니하기로 2
 
0.1%
Other values (3) 4
 
0.2%

본회의 처리결과
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
원안가결
1296 
수정가결
218 
임기만료 폐기
 
41
<NA>
 
26
폐기
 
10
Other values (2)
 
6

Length

Max length8
Median length4
Mean length4.0607389
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row원안가결
2nd row원안가결
3rd row원안가결
4th row원안가결
5th row원안가결

Common Values

ValueCountFrequency (%)
원안가결 1296
81.2%
수정가결 218
 
13.7%
임기만료 폐기 41
 
2.6%
<NA> 26
 
1.6%
폐기 10
 
0.6%
철회 5
 
0.3%
징계하지 아니함 1
 
0.1%

Length

2024-01-10T06:01:53.789577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:01:53.899476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
원안가결 1296
79.1%
수정가결 218
 
13.3%
폐기 51
 
3.1%
임기만료 41
 
2.5%
na 26
 
1.6%
철회 5
 
0.3%
징계하지 1
 
0.1%
아니함 1
 
0.1%

Interactions

2024-01-10T06:01:51.076238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:01:50.884244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:01:51.164628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:01:50.971641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:01:53.978554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회기의안번호제안자 구분소관 위원회소관 위원회 처리결과본회의 처리결과
회기1.0000.9900.2430.5510.1100.117
의안번호0.9901.0000.2650.5430.1180.090
제안자 구분0.2430.2651.0000.7730.0000.154
소관 위원회0.5510.5430.7731.0000.3770.725
소관 위원회 처리결과0.1100.1180.0000.3771.0000.964
본회의 처리결과0.1170.0900.1540.7250.9641.000
2024-01-10T06:01:54.078842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/