Overview

Dataset statistics

Number of variables7
Number of observations299
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.8 KiB
Average record size in memory57.4 B

Variable types

Numeric1
Categorical3
Text3

Dataset

Description인천시 관내 (예비)사회적기업의 기관명, 업종, 서비스유형(사업내용), 관할 구, 인증유무, 주소 등의 정보를 제공합니다.
Author인천광역시
URLhttps://www.data.go.kr/data/15099887/fileData.do

Alerts

연번 is highly overall correlated with 구 분High correlation
구 분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
기 업 명 has unique valuesUnique

Reproduction

Analysis started2024-03-23 03:41:31.945882
Analysis finished2024-03-23 03:41:42.212692
Duration10.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct299
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean150
Minimum1
Maximum299
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 KiB
2024-03-23T03:41:42.741524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.9
Q175.5
median150
Q3224.5
95-th percentile284.1
Maximum299
Range298
Interquartile range (IQR)149

Descriptive statistics

Standard deviation86.458082
Coefficient of variation (CV)0.57638722
Kurtosis-1.2
Mean150
Median Absolute Deviation (MAD)75
Skewness0
Sum44850
Variance7475
MonotonicityStrictly increasing
2024-03-23T03:41:43.717918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
207 1
 
0.3%
205 1
 
0.3%
204 1
 
0.3%
203 1
 
0.3%
202 1
 
0.3%
201 1
 
0.3%
200 1
 
0.3%
199 1
 
0.3%
198 1
 
0.3%
Other values (289) 289
96.7%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
299 1
0.3%
298 1
0.3%
297 1
0.3%
296 1
0.3%
295 1
0.3%
294 1
0.3%
293 1
0.3%
292 1
0.3%
291 1
0.3%
290 1
0.3%

구 분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
인증
213 
예비(지역형)
51 
예비(부처형)
35 

Length

Max length7
Median length2
Mean length3.4381271
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인증
2nd row인증
3rd row인증
4th row인증
5th row인증

Common Values

ValueCountFrequency (%)
인증 213
71.2%
예비(지역형) 51
 
17.1%
예비(부처형) 35
 
11.7%

Length

2024-03-23T03:41:44.442928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T03:41:45.074420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인증 213
71.2%
예비(지역형 51
 
17.1%
예비(부처형 35
 
11.7%

기 업 명
Text

UNIQUE 

Distinct299
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2024-03-23T03:41:45.702692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length18
Mean length7.722408
Min length3

Characters and Unicode

Total characters2309
Distinct characters399
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique299 ?
Unique (%)100.0%

Sample

1st row㈜정부물품재활용
2nd row㈜두손테크
3rd row㈜인천개항
4th row㈜해피크린
5th row사회적협동조합엠커뮤니티
ValueCountFrequency (%)
사회적협동조합 13
 
3.7%
협동조합 9
 
2.6%
농업회사법인 4
 
1.1%
㈜다사랑 2
 
0.6%
손과손 2
 
0.6%
사회복지법인 2
 
0.6%
인천 2
 
0.6%
㈜스타아트아카데미 1
 
0.3%
㈜유니디자인경영연구소 1
 
0.3%
㈜몬스터레코드 1
 
0.3%
Other values (314) 314
89.5%
2024-03-23T03:41:46.996571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
231
 
10.0%
66
 
2.9%
66
 
2.9%
55
 
2.4%
54
 
2.3%
54
 
2.3%
52
 
2.3%
50
 
2.2%
49
 
2.1%
43
 
1.9%
Other values (389) 1589
68.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1948
84.4%
Other Symbol 231
 
10.0%
Space Separator 66
 
2.9%
Uppercase Letter 32
 
1.4%
Open Punctuation 12
 
0.5%
Close Punctuation 12
 
0.5%
Other Punctuation 4
 
0.2%
Lowercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
66
 
3.4%
55
 
2.8%
54
 
2.8%
54
 
2.8%
52
 
2.7%
50
 
2.6%
49
 
2.5%
43
 
2.2%
40
 
2.1%
30
 
1.5%
Other values (367) 1455
74.7%
Uppercase Letter
ValueCountFrequency (%)
O 6
18.8%
I 4
12.5%
P 4
12.5%
E 3
9.4%
L 3
9.4%
R 2
 
6.2%
C 2
 
6.2%
M 1
 
3.1%
N 1
 
3.1%
S 1
 
3.1%
Other values (5) 5
15.6%
Lowercase Letter
ValueCountFrequency (%)
n 2
50.0%
c 2
50.0%
Other Symbol
ValueCountFrequency (%)
231
100.0%
Space Separator
ValueCountFrequency (%)
66
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2179
94.4%
Common 94
 
4.1%
Latin 36
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
231
 
10.6%
66
 
3.0%
55
 
2.5%
54
 
2.5%
54
 
2.5%
52
 
2.4%
50
 
2.3%
49
 
2.2%
43
 
2.0%
40
 
1.8%
Other values (368) 1485
68.2%
Latin
ValueCountFrequency (%)
O 6
16.7%
I 4
11.1%
P 4
11.1%
E 3
8.3%
L 3
8.3%
R 2
 
5.6%
n 2
 
5.6%
c 2
 
5.6%
C 2
 
5.6%
M 1
 
2.8%
Other values (7) 7
19.4%
Common
ValueCountFrequency (%)
66
70.2%
( 12
 
12.8%
) 12
 
12.8%
. 4
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1948
84.4%
None 231
 
10.0%
ASCII 130
 
5.6%

Most frequent character per block

None
ValueCountFrequency (%)
231
100.0%
Hangul
ValueCountFrequency (%)
66
 
3.4%
55
 
2.8%
54
 
2.8%
54
 
2.8%
52
 
2.7%
50
 
2.6%
49
 
2.5%
43
 
2.2%
40
 
2.1%
30
 
1.5%
Other values (367) 1455
74.7%
ASCII
ValueCountFrequency (%)
66
50.8%
( 12
 
9.2%
) 12
 
9.2%
O 6
 
4.6%
. 4
 
3.1%
I 4
 
3.1%
P 4
 
3.1%
E 3
 
2.3%
L 3
 
2.3%
R 2
 
1.5%
Other values (11) 14
 
10.8%

업종
Categorical

Distinct15
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
제조
66 
교육
52 
기타
44 
문화예술
30 
도소매
25 
Other values (10)
82 

Length

Max length4
Median length2
Mean length2.4080268
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row재활용
2nd row청소
3rd row식품
4th row청소
5th row교육

Common Values

ValueCountFrequency (%)
제조 66
22.1%
교육 52
17.4%
기타 44
14.7%
문화예술 30
10.0%
도소매 25
 
8.4%
청소 23
 
7.7%
식품 16
 
5.4%
간병가사 12
 
4.0%
건설 11
 
3.7%
재활용 6
 
2.0%
Other values (5) 14
 
4.7%

Length

2024-03-23T03:41:47.545461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조 66
22.1%
교육 52
17.4%
기타 44
14.7%
문화예술 30
10.0%
도소매 25
 
8.4%
청소 23
 
7.7%
식품 16
 
5.4%
간병가사 12
 
4.0%
건설 11
 
3.7%
재활용 6
 
2.0%
Other values (5) 14
 
4.7%
Distinct292
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2024-03-23T03:41:48.545985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length111
Median length52
Mean length19.866221
Min length2

Characters and Unicode

Total characters5940
Distinct characters440
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique286 ?
Unique (%)95.7%

Sample

1st row공공기관물품재활용, 사무용가구 등
2nd row건물위생관리, 경비, 방역 등
3rd row카페, 관광기념품, 체험
4th row청소, 소독, 인테리어
5th row복지 서비스
ValueCountFrequency (%)
89
 
6.6%
47
 
3.5%
판매 32
 
2.4%
제조 31
 
2.3%
교육 25
 
1.9%
제공 20
 
1.5%
운영 16
 
1.2%
도소매 15
 
1.1%
개발 15
 
1.1%
서비스 11
 
0.8%
Other values (761) 1043
77.6%
2024-03-23T03:41:50.235646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1073
 
18.1%
, 321
 
5.4%
146
 
2.5%
97
 
1.6%
91
 
1.5%
88
 
1.5%
87
 
1.5%
86
 
1.4%
85
 
1.4%
79
 
1.3%
Other values (430) 3787
63.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4404
74.1%
Space Separator 1073
 
18.1%
Other Punctuation 362
 
6.1%
Uppercase Letter 33
 
0.6%
Close Punctuation 27
 
0.5%
Open Punctuation 27
 
0.5%
Lowercase Letter 6
 
0.1%
Decimal Number 4
 
0.1%
Other Symbol 2
 
< 0.1%
Initial Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
146
 
3.3%
97
 
2.2%
91
 
2.1%
88
 
2.0%
87
 
2.0%
86
 
2.0%
85
 
1.9%
79
 
1.8%
75
 
1.7%
73
 
1.7%
Other values (393) 3497
79.4%
Uppercase Letter
ValueCountFrequency (%)
R 4
12.1%
C 3
9.1%
P 3
9.1%
E 3
9.1%
D 3
9.1%
T 2
 
6.1%
J 2
 
6.1%
V 2
 
6.1%
O 2
 
6.1%
L 2
 
6.1%
Other values (7) 7
21.2%
Lowercase Letter
ValueCountFrequency (%)
d 1
16.7%
l 1
16.7%
e 1
16.7%
t 1
16.7%
c 1
16.7%
i 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 321
88.7%
/ 22
 
6.1%
· 15
 
4.1%
& 3
 
0.8%
: 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
4 1
25.0%
2 1
25.0%
Space Separator
ValueCountFrequency (%)
1073
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Other Symbol
ValueCountFrequency (%)
© 2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4404
74.1%
Common 1497
 
25.2%
Latin 39
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
146
 
3.3%
97
 
2.2%
91
 
2.1%
88
 
2.0%
87
 
2.0%
86
 
2.0%
85
 
1.9%
79
 
1.8%
75
 
1.7%
73
 
1.7%
Other values (393) 3497
79.4%
Latin
ValueCountFrequency (%)
R 4
 
10.3%
C 3
 
7.7%
P 3
 
7.7%
E 3
 
7.7%
D 3
 
7.7%
T 2
 
5.1%
J 2
 
5.1%
V 2
 
5.1%
O 2
 
5.1%
L 2
 
5.1%
Other values (13) 13
33.3%
Common
ValueCountFrequency (%)
1073
71.7%
, 321
 
21.4%
) 27
 
1.8%
( 27
 
1.8%
/ 22
 
1.5%
· 15
 
1.0%
& 3
 
0.2%
1 2
 
0.1%
© 2
 
0.1%
4 1
 
0.1%
Other values (4) 4
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4404
74.1%
ASCII 1517
 
25.5%
None 17
 
0.3%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1073
70.7%
, 321
 
21.2%
) 27
 
1.8%
( 27
 
1.8%
/ 22
 
1.5%
R 4
 
0.3%
& 3
 
0.2%
C 3
 
0.2%
P 3
 
0.2%
E 3
 
0.2%
Other values (23) 31
 
2.0%
Hangul
ValueCountFrequency (%)
146
 
3.3%
97
 
2.2%
91
 
2.1%
88
 
2.0%
87
 
2.0%
86
 
2.0%
85
 
1.9%
79
 
1.8%
75
 
1.7%
73
 
1.7%
Other values (393) 3497
79.4%
None
ValueCountFrequency (%)
· 15
88.2%
© 2
 
11.8%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%

군구
Categorical

Distinct10
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
남동구
69 
서구
61 
미추홀구
36 
연수구
35 
부평구
30 
Other values (5)
68 

Length

Max length4
Median length3
Mean length2.812709
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
남동구 69
23.1%
서구 61
20.4%
미추홀구 36
12.0%
연수구 35
11.7%
부평구 30
10.0%
계양구 20
 
6.7%
중구 17
 
5.7%
동구 14
 
4.7%
강화군 11
 
3.7%
옹진군 6
 
2.0%

Length

2024-03-23T03:41:50.904913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T03:41:51.465092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남동구 69
23.1%
서구 61
20.4%
미추홀구 36
12.0%
연수구 35
11.7%
부평구 30
10.0%
계양구 20
 
6.7%
중구 17
 
5.7%
동구 14
 
4.7%
강화군 11
 
3.7%
옹진군 6
 
2.0%
Distinct295
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2024-03-23T03:41:52.708327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/