Overview

Dataset statistics

Number of variables9
Number of observations259
Missing cells258
Missing cells (%)11.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.6 KiB
Average record size in memory73.5 B

Variable types

Numeric1
Categorical3
Text5

Dataset

Description인천시 관내 (예비)사회적기업의 기업명, 대표자, 사회적목적유형, 업종, 서비스유형(사업내용), 관할 구, 주소 등의 정보를 제공합니다.
Author인천광역시
URLhttps://www.incheon.go.kr/data/DATA010201/view?docId=15078032

Alerts

Unnamed: 8 has constant value ""Constant
연번 is highly overall correlated with 구분 and 1 other fieldsHigh correlation
구분 is highly overall correlated with 연번High correlation
군구 is highly overall correlated with 연번High correlation
Unnamed: 8 has 258 (99.6%) missing valuesMissing
연번 has unique valuesUnique
기업명 has unique valuesUnique

Reproduction

Analysis started2024-01-28 05:04:35.037579
Analysis finished2024-01-28 05:04:35.981102
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct259
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean130
Minimum1
Maximum259
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2024-01-28T14:04:36.099245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.9
Q165.5
median130
Q3194.5
95-th percentile246.1
Maximum259
Range258
Interquartile range (IQR)129

Descriptive statistics

Standard deviation74.911058
Coefficient of variation (CV)0.57623891
Kurtosis-1.2
Mean130
Median Absolute Deviation (MAD)65
Skewness0
Sum33670
Variance5611.6667
MonotonicityStrictly increasing
2024-01-28T14:04:36.263940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
164 1
 
0.4%
166 1
 
0.4%
167 1
 
0.4%
168 1
 
0.4%
169 1
 
0.4%
170 1
 
0.4%
171 1
 
0.4%
172 1
 
0.4%
173 1
 
0.4%
Other values (249) 249
96.1%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
259 1
0.4%
258 1
0.4%
257 1
0.4%
256 1
0.4%
255 1
0.4%
254 1
0.4%
253 1
0.4%
252 1
0.4%
251 1
0.4%
250 1
0.4%

구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
인증
190 
예비(지역형)
41 
예비(부처형)
23 
예비(지역형)+예비(부처형)
 
5

Length

Max length15
Median length2
Mean length3.4864865
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인증
2nd row인증
3rd row인증
4th row인증
5th row인증

Common Values

ValueCountFrequency (%)
인증 190
73.4%
예비(지역형) 41
 
15.8%
예비(부처형) 23
 
8.9%
예비(지역형)+예비(부처형) 5
 
1.9%

Length

2024-01-28T14:04:36.386867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T14:04:36.478878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인증 190
73.4%
예비(지역형 41
 
15.8%
예비(부처형 23
 
8.9%
예비(지역형)+예비(부처형 5
 
1.9%

기업명
Text

UNIQUE 

Distinct259
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-01-28T14:04:36.654897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length16
Mean length7.4671815
Min length3

Characters and Unicode

Total characters1934
Distinct characters360
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique259 ?
Unique (%)100.0%

Sample

1st row㈜정부물품재활용
2nd row㈜두손테크
3rd row㈜인천개항
4th row㈜해피크린
5th row사회적협동조합엠커뮤니티
ValueCountFrequency (%)
사회적협동조합 8
 
2.6%
협동조합 7
 
2.3%
농업회사법인 5
 
1.6%
3
 
1.0%
사회복지법인 2
 
0.6%
손과손 2
 
0.6%
사단법인 2
 
0.6%
㈜다사랑 2
 
0.6%
㈜더원아트코리아 1
 
0.3%
㈜파이코 1
 
0.3%
Other values (275) 275
89.3%
2024-01-28T14:04:36.972322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
204
 
10.5%
59
 
3.1%
54
 
2.8%
43
 
2.2%
42
 
2.2%
40
 
2.1%
40
 
2.1%
38
 
2.0%
37
 
1.9%
37
 
1.9%
Other values (350) 1340
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1623
83.9%
Other Symbol 204
 
10.5%
Space Separator 59
 
3.1%
Uppercase Letter 21
 
1.1%
Close Punctuation 10
 
0.5%
Open Punctuation 10
 
0.5%
Lowercase Letter 4
 
0.2%
Other Punctuation 2
 
0.1%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
3.3%
43
 
2.6%
42
 
2.6%
40
 
2.5%
40
 
2.5%
38
 
2.3%
37
 
2.3%
37
 
2.3%
37
 
2.3%
25
 
1.5%
Other values (331) 1230
75.8%
Uppercase Letter
ValueCountFrequency (%)
E 3
14.3%
O 3
14.3%
I 3
14.3%
A 2
9.5%
P 2
9.5%
R 2
9.5%
T 2
9.5%
Z 1
 
4.8%
J 1
 
4.8%
L 1
 
4.8%
Lowercase Letter
ValueCountFrequency (%)
c 2
50.0%
n 2
50.0%
Other Symbol
ValueCountFrequency (%)
204
100.0%
Space Separator
ValueCountFrequency (%)
59
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Decimal Number
ValueCountFrequency (%)
5 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1827
94.5%
Common 82
 
4.2%
Latin 25
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
204
 
11.2%
54
 
3.0%
43
 
2.4%
42
 
2.3%
40
 
2.2%
40
 
2.2%
38
 
2.1%
37
 
2.0%
37
 
2.0%
37
 
2.0%
Other values (332) 1255
68.7%
Latin
ValueCountFrequency (%)
E 3
12.0%
O 3
12.0%
I 3
12.0%
A 2
8.0%
P 2
8.0%
R 2
8.0%
c 2
8.0%
n 2
8.0%
T 2
8.0%
Z 1
 
4.0%
Other values (3) 3
12.0%
Common
ValueCountFrequency (%)
59
72.0%
) 10
 
12.2%
( 10
 
12.2%
. 2
 
2.4%
5 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1623
83.9%
None 204
 
10.5%
ASCII 107
 
5.5%

Most frequent character per block

None
ValueCountFrequency (%)
204
100.0%
ASCII
ValueCountFrequency (%)
59
55.1%
) 10
 
9.3%
( 10
 
9.3%
E 3
 
2.8%
O 3
 
2.8%
I 3
 
2.8%
A 2
 
1.9%
P 2
 
1.9%
R 2
 
1.9%
. 2
 
1.9%
Other values (8) 11
 
10.3%
Hangul
ValueCountFrequency (%)
54
 
3.3%
43
 
2.6%
42
 
2.6%
40
 
2.5%
40
 
2.5%
38
 
2.3%
37
 
2.3%
37
 
2.3%
37
 
2.3%
25
 
1.5%
Other values (331) 1230
75.8%
Distinct257
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-01-28T14:04:37.219397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length3
Mean length3.3320463
Min length2

Characters and Unicode

Total characters863
Distinct characters160
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique255 ?
Unique (%)98.5%

Sample

1st row윤성구
2nd row지금련,오상준
3rd row장미진
4th row방미호
5th row이명선
ValueCountFrequency (%)
이기선 2
 
0.7%
장영순 2
 
0.7%
허정문 1
 
0.4%
윤혜숙 1
 
0.4%
장슬아 1
 
0.4%
김태신 1
 
0.4%
장선영 1
 
0.4%
윤성구 1
 
0.4%
양경애 1
 
0.4%
호윤기 1
 
0.4%
Other values (259) 259
95.6%
2024-01-28T14:04:37.563767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
65
 
7.5%
37
 
4.3%
29
 
3.4%
28
 
3.2%
18
 
2.1%
16
 
1.9%
16
 
1.9%
16
 
1.9%
, 16
 
1.9%
15
 
1.7%
Other values (150) 607
70.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 833
96.5%
Other Punctuation 17
 
2.0%
Space Separator 13
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
 
7.8%
37
 
4.4%
29
 
3.5%
28
 
3.4%
18
 
2.2%
16
 
1.9%
16
 
1.9%
16
 
1.9%
15
 
1.8%
15
 
1.8%
Other values (147) 578
69.4%
Other Punctuation
ValueCountFrequency (%)
, 16
94.1%
. 1
 
5.9%
Space Separator
ValueCountFrequency (%)
13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 833
96.5%
Common 30
 
3.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
 
7.8%
37
 
4.4%
29
 
3.5%
28
 
3.4%
18
 
2.2%
16
 
1.9%
16
 
1.9%
16
 
1.9%
15
 
1.8%
15
 
1.8%
Other values (147) 578
69.4%
Common
ValueCountFrequency (%)
, 16
53.3%
13
43.3%
. 1
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 833
96.5%
ASCII 30
 
3.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
65
 
7.8%
37
 
4.4%
29
 
3.5%
28
 
3.4%
18
 
2.2%
16
 
1.9%
16
 
1.9%
16
 
1.9%
15
 
1.8%
15
 
1.8%
Other values (147) 578
69.4%
ASCII
ValueCountFrequency (%)
, 16
53.3%
13
43.3%
. 1
 
3.3%

업종
Categorical

Distinct14
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
제조
59 
교육
37 
기타
29 
문화예술
28 
청소
22 
Other values (9)
84 

Length

Max length4
Median length2
Mean length2.4247104
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row재활용
2nd row청소
3rd row식품
4th row청소
5th row교육

Common Values

ValueCountFrequency (%)
제조 59
22.8%
교육 37
14.3%
기타 29
11.2%
문화예술 28
10.8%
청소 22
 
8.5%
식품 20
 
7.7%
도소매 19
 
7.3%
간병가사 12
 
4.6%
건설 9
 
3.5%
IT 8
 
3.1%
Other values (4) 16
 
6.2%

Length

2024-01-28T14:04:37.687879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조 59
22.8%
교육 37
14.3%
기타 29
11.2%
문화예술 28
10.8%
청소 22
 
8.5%
식품 20
 
7.7%
도소매 19
 
7.3%
간병가사 12
 
4.6%
건설 9
 
3.5%
it 8
 
3.1%
Other values (4) 16
 
6.2%
Distinct256
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-01-28T14:04:37.922673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/