Overview

Dataset statistics

Number of variables5
Number of observations158
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.5 KiB
Average record size in memory41.8 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description이 자료는 충북혁신도시 이전기관 (한국고용정보원, 한국소비자원, 한국가스안전공사) 기관에서 관리하고 있는 충북지역 착한기업 데이터를 통합하여, 하나의 자료로 제공한다. 한국소비자원의 CMM인증기업, 한국가스안전공사의 우수 LPG판매 인증업체, 한국고용정보원의 청년친화강소기업 정보를 제공한다.
URLhttps://www.data.go.kr/data/15105719/fileData.do

Alerts

번호 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
업종 is highly overall correlated with 구분High correlation
업종 is highly imbalanced (57.6%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:05:54.046291
Analysis finished2023-12-12 03:05:54.745957
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct158
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean79.5
Minimum1
Maximum158
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-12T12:05:54.856965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.85
Q140.25
median79.5
Q3118.75
95-th percentile150.15
Maximum158
Range157
Interquartile range (IQR)78.5

Descriptive statistics

Standard deviation45.754781
Coefficient of variation (CV)0.57553184
Kurtosis-1.2
Mean79.5
Median Absolute Deviation (MAD)39.5
Skewness0
Sum12561
Variance2093.5
MonotonicityStrictly increasing
2023-12-12T12:05:55.079212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
110 1
 
0.6%
103 1
 
0.6%
104 1
 
0.6%
105 1
 
0.6%
106 1
 
0.6%
107 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
111 1
 
0.6%
Other values (148) 148
93.7%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
158 1
0.6%
157 1
0.6%
156 1
0.6%
155 1
0.6%
154 1
0.6%
153 1
0.6%
152 1
0.6%
151 1
0.6%
150 1
0.6%
149 1
0.6%

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
우수 LPG판매 인증업체
115 
청년친화강소기업
33 
CCM 인증기업
 
10

Length

Max length13
Median length13
Mean length11.639241
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row청년친화강소기업
2nd row청년친화강소기업
3rd row청년친화강소기업
4th row청년친화강소기업
5th row청년친화강소기업

Common Values

ValueCountFrequency (%)
우수 LPG판매 인증업체 115
72.8%
청년친화강소기업 33
 
20.9%
CCM 인증기업 10
 
6.3%

Length

2023-12-12T12:05:55.286534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:05:55.425229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
우수 115
28.9%
lpg판매 115
28.9%
인증업체 115
28.9%
청년친화강소기업 33
 
8.3%
ccm 10
 
2.5%
인증기업 10
 
2.5%
Distinct155
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T12:05:55.759250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length6.5886076
Min length4

Characters and Unicode

Total characters1041
Distinct characters203
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)96.2%

Sample

1st row(주)위즈너
2nd row케이엠텍(주)
3rd row(주)엠플러스
4th row주식회사 탑프라
5th rowDCT 머티리얼
ValueCountFrequency (%)
주식회사 9
 
5.3%
현대에너지 2
 
1.2%
삼성가스 2
 
1.2%
대성가스 2
 
1.2%
가스뱅크 1
 
0.6%
무궁화가스상사 1
 
0.6%
한국가스 1
 
0.6%
부산종합가스 1
 
0.6%
삼호가스 1
 
0.6%
주)위즈너 1
 
0.6%
Other values (148) 148
87.6%
2023-12-12T12:05:56.371455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
80
 
7.7%
72
 
6.9%
63
 
6.1%
49
 
4.7%
( 47
 
4.5%
) 47
 
4.5%
46
 
4.4%
44
 
4.2%
20
 
1.9%
18
 
1.7%
Other values (193) 555
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 913
87.7%
Open Punctuation 47
 
4.5%
Close Punctuation 47
 
4.5%
Other Symbol 14
 
1.3%
Space Separator 12
 
1.2%
Uppercase Letter 7
 
0.7%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
80
 
8.8%
72
 
7.9%
63
 
6.9%
49
 
5.4%
46
 
5.0%
44
 
4.8%
20
 
2.2%
18
 
2.0%
17
 
1.9%
17
 
1.9%
Other values (183) 487
53.3%
Uppercase Letter
ValueCountFrequency (%)
S 2
28.6%
K 2
28.6%
T 1
14.3%
C 1
14.3%
D 1
14.3%
Open Punctuation
ValueCountFrequency (%)
( 47
100.0%
Close Punctuation
ValueCountFrequency (%)
) 47
100.0%
Other Symbol
ValueCountFrequency (%)
14
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 927
89.0%
Common 107
 
10.3%
Latin 7
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
80
 
8.6%
72
 
7.8%
63
 
6.8%
49
 
5.3%
46
 
5.0%
44
 
4.7%
20
 
2.2%
18
 
1.9%
17
 
1.8%
17
 
1.8%
Other values (184) 501
54.0%
Latin
ValueCountFrequency (%)
S 2
28.6%
K 2
28.6%
T 1
14.3%
C 1
14.3%
D 1
14.3%
Common
ValueCountFrequency (%)
( 47
43.9%
) 47
43.9%
12
 
11.2%
, 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 913
87.7%
ASCII 114
 
11.0%
None 14
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
80
 
8.8%
72
 
7.9%
63
 
6.9%
49
 
5.4%
46
 
5.0%
44
 
4.8%
20
 
2.2%
18
 
2.0%
17
 
1.9%
17
 
1.9%
Other values (183) 487
53.3%
ASCII
ValueCountFrequency (%)
( 47
41.2%
) 47
41.2%
12
 
10.5%
S 2
 
1.8%
K 2
 
1.8%
, 1
 
0.9%
T 1
 
0.9%
C 1
 
0.9%
D 1
 
0.9%
None
ValueCountFrequency (%)
14
100.0%

주소
Text

Distinct154
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T12:05:56.822114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length29
Mean length20.753165
Min length12

Characters and Unicode

Total characters3279
Distinct characters221
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)96.2%

Sample

1st row충청북도 청주시 청원구 상당로 314, 341호(내덕동,청주첨단문화산업단지)
2nd row충청북도 청주시 청원구 오창읍 각리1길 33
3rd row충청북도 청주시 흥덕구 옥산면 옥산산단로 27
4th row충청북도 음성군 맹동면 맹동산단로 37-20
5th row충청북도 진천군 덕산읍 신척산단1로 47
ValueCountFrequency (%)
충청북도 43
 
5.6%
경기 32
 
4.2%
청주시 23
 
3.0%
부산 13
 
1.7%
흥덕구 10
 
1.3%
서울 10
 
1.3%
음성군 9
 
1.2%
경남 8
 
1.0%
강원 7
 
0.9%
진천군 7
 
0.9%
Other values (467) 607
78.9%
2023-12-12T12:05:57.387022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
632
 
19.3%
133
 
4.1%
1 119
 
3.6%
90
 
2.7%
2 77
 
2.3%
74
 
2.3%
3 73
 
2.2%
71
 
2.2%
64
 
2.0%
63
 
1.9%
Other values (211) 1883
57.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1972
60.1%
Space Separator 632
 
19.3%
Decimal Number 573
 
17.5%
Dash Punctuation 43
 
1.3%
Open Punctuation 22
 
0.7%
Close Punctuation 22
 
0.7%
Other Punctuation 12
 
0.4%
Uppercase Letter 2
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
133
 
6.7%
90
 
4.6%
74
 
3.8%
71
 
3.6%
64
 
3.2%
63
 
3.2%
60
 
3.0%
56
 
2.8%
53
 
2.7%
53
 
2.7%
Other values (193) 1255
63.6%
Decimal Number
ValueCountFrequency (%)
1 119
20.8%
2 77
13.4%
3 73
12.7%
4 60
10.5%
5 57
9.9%
6 44
 
7.7%
0 43
 
7.5%
7 38
 
6.6%
8 34
 
5.9%
9 28
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
A 1
50.0%
Space Separator
ValueCountFrequency (%)
632
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 43
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Other Punctuation
ValueCountFrequency (%)
, 12
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1972
60.1%
Common 1305
39.8%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
133
 
6.7%
90
 
4.6%
74
 
3.8%
71
 
3.6%
64
 
3.2%
63
 
3.2%
60
 
3.0%
56
 
2.8%
53
 
2.7%
53
 
2.7%
Other values (193) 1255
63.6%
Common
ValueCountFrequency (%)
632
48.4%
1 119
 
9.1%
2 77
 
5.9%
3 73
 
5.6%
4 60
 
4.6%
5 57
 
4.4%
6 44
 
3.4%
0 43
 
3.3%
- 43
 
3.3%
7 38
 
2.9%
Other values (6) 119
 
9.1%
Latin
ValueCountFrequency (%)
B 1
50.0%
A 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1972
60.1%
ASCII 1307
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
632
48.4%
1 119
 
9.1%
2 77
 
5.9%
3 73
 
5.6%
4 60
 
4.6%
5 57
 
4.4%
6 44
 
3.4%
0 43
 
3.3%
- 43
 
3.3%
7 38
 
2.9%
Other values (8) 121
 
9.3%
Hangul
ValueCountFrequency (%)
133
 
6.7%
90
 
4.6%
74
 
3.8%
71
 
3.6%
64
 
3.2%
63
 
3.2%
60
 
3.0%
56
 
2.8%
53
 
2.7%
53
 
2.7%
Other values (193) 1255
63.6%

업종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
판매업
115 
제조업
30 
전문, 과학 및 기술 서비스업
 
5
정보통신업
 
3
서비스업
 
2
Other values (3)
 
3

Length

Max length16
Median length3
Mean length3.5696203
Min length3

Unique

Unique3 ?
Unique (%)1.9%

Sample

1st row정보통신업
2nd row제조업
3rd row제조업
4th row제조업
5th row전문, 과학 및 기술 서비스업

Common Values

ValueCountFrequency (%)
판매업 115
72.8%
제조업 30
 
19.0%
전문, 과학 및 기술 서비스업 5
 
3.2%
정보통신업 3
 
1.9%
서비스업 2
 
1.3%
보건업 및 사회복지 서비스업 1
 
0.6%
도매 및 소매업 1
 
0.6%
금융업 1
 
0.6%

Length

2023-12-12T12:05:57.572289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:05:57.768905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
판매업 115
62.8%
제조업 30
 
16.4%
서비스업 8
 
4.4%
7
 
3.8%
전문 5
 
2.7%
과학 5
 
2.7%
기술 5
 
2.7%
정보통신업 3
 
1.6%
보건업 1
 
0.5%
사회복지 1
 
0.5%
Other values (3) 3
 
1.6%

Interactions

2023-12-12T12:05:54.380816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:05:58.271210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구분업종
번호1.0000.9140.609
구분0.9141.0000.847
업종0.6090.8471.000
2023-12-12T12:05:58.369040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/