Overview

Dataset statistics

Number of variables7
Number of observations300
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.8 KiB
Average record size in memory57.4 B

Variable types

Numeric1
Categorical3
Text3

Dataset

Description인천시 관내 (예비)사회적기업의 기관명, 업종, 서비스유형(사업내용), 관할 구, 인증유무, 주소 등의 정보를 제공합니다.<br/>
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15099887&srcSe=7661IVAWM27C61E190

Alerts

연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
기업명 has unique valuesUnique

Reproduction

Analysis started2024-04-06 09:37:31.132294
Analysis finished2024-04-06 09:37:33.935406
Duration2.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct300
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean150.5
Minimum1
Maximum300
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 KiB
2024-04-06T18:37:34.042690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.95
Q175.75
median150.5
Q3225.25
95-th percentile285.05
Maximum300
Range299
Interquartile range (IQR)149.5

Descriptive statistics

Standard deviation86.746758
Coefficient of variation (CV)0.57639042
Kurtosis-1.2
Mean150.5
Median Absolute Deviation (MAD)75
Skewness0
Sum45150
Variance7525
MonotonicityStrictly increasing
2024-04-06T18:37:34.193885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
208 1
 
0.3%
206 1
 
0.3%
205 1
 
0.3%
204 1
 
0.3%
203 1
 
0.3%
202 1
 
0.3%
201 1
 
0.3%
200 1
 
0.3%
199 1
 
0.3%
Other values (290) 290
96.7%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
300 1
0.3%
299 1
0.3%
298 1
0.3%
297 1
0.3%
296 1
0.3%
295 1
0.3%
294 1
0.3%
293 1
0.3%
292 1
0.3%
291 1
0.3%

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
인증
204 
예비(지역형)
52 
예비(부처형)
44 

Length

Max length7
Median length2
Mean length3.6
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인증
2nd row인증
3rd row인증
4th row인증
5th row인증

Common Values

ValueCountFrequency (%)
인증 204
68.0%
예비(지역형) 52
 
17.3%
예비(부처형) 44
 
14.7%

Length

2024-04-06T18:37:34.319859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:37:34.440361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인증 204
68.0%
예비(지역형 52
 
17.3%
예비(부처형 44
 
14.7%

기업명
Text

UNIQUE 

Distinct300
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2024-04-06T18:37:34.696632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length18
Mean length7.6166667
Min length3

Characters and Unicode

Total characters2285
Distinct characters389
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique300 ?
Unique (%)100.0%

Sample

1st row㈜정부물품재활용
2nd row㈜두손테크
3rd row㈜인천개항
4th row㈜해피크린
5th row사회적협동조합엠커뮤니티
ValueCountFrequency (%)
사회적협동조합 13
 
3.7%
협동조합 9
 
2.5%
농업회사법인 4
 
1.1%
사회복지법인 2
 
0.6%
㈜다사랑 2
 
0.6%
인천 2
 
0.6%
손과손 2
 
0.6%
㈜원클릭글로벌마켓 1
 
0.3%
㈜위드마 1
 
0.3%
엠아이에스씨사회적협동조합 1
 
0.3%
Other values (316) 316
89.5%
2024-04-06T18:37:35.208542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
233
 
10.2%
67
 
2.9%
64
 
2.8%
59
 
2.6%
52
 
2.3%
50
 
2.2%
47
 
2.1%
46
 
2.0%
45
 
2.0%
43
 
1.9%
Other values (379) 1579
69.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1920
84.0%
Other Symbol 233
 
10.2%
Space Separator 67
 
2.9%
Uppercase Letter 29
 
1.3%
Open Punctuation 14
 
0.6%
Close Punctuation 14
 
0.6%
Other Punctuation 4
 
0.2%
Lowercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
 
3.3%
59
 
3.1%
52
 
2.7%
50
 
2.6%
47
 
2.4%
46
 
2.4%
45
 
2.3%
43
 
2.2%
41
 
2.1%
34
 
1.8%
Other values (358) 1439
74.9%
Uppercase Letter
ValueCountFrequency (%)
O 6
20.7%
I 4
13.8%
E 3
10.3%
P 3
10.3%
L 3
10.3%
R 2
 
6.9%
T 1
 
3.4%
D 1
 
3.4%
C 1
 
3.4%
N 1
 
3.4%
Other values (4) 4
13.8%
Lowercase Letter
ValueCountFrequency (%)
n 2
50.0%
c 2
50.0%
Other Symbol
ValueCountFrequency (%)
233
100.0%
Space Separator
ValueCountFrequency (%)
67
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2153
94.2%
Common 99
 
4.3%
Latin 33
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
233
 
10.8%
64
 
3.0%
59
 
2.7%
52
 
2.4%
50
 
2.3%
47
 
2.2%
46
 
2.1%
45
 
2.1%
43
 
2.0%
41
 
1.9%
Other values (359) 1473
68.4%
Latin
ValueCountFrequency (%)
O 6
18.2%
I 4
12.1%
E 3
9.1%
P 3
9.1%
L 3
9.1%
R 2
 
6.1%
n 2
 
6.1%
c 2
 
6.1%
T 1
 
3.0%
D 1
 
3.0%
Other values (6) 6
18.2%
Common
ValueCountFrequency (%)
67
67.7%
( 14
 
14.1%
) 14
 
14.1%
. 4
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1920
84.0%
None 233
 
10.2%
ASCII 132
 
5.8%

Most frequent character per block

None
ValueCountFrequency (%)
233
100.0%
ASCII
ValueCountFrequency (%)
67
50.8%
( 14
 
10.6%
) 14
 
10.6%
O 6
 
4.5%
. 4
 
3.0%
I 4
 
3.0%
E 3
 
2.3%
P 3
 
2.3%
L 3
 
2.3%
R 2
 
1.5%
Other values (10) 12
 
9.1%
Hangul
ValueCountFrequency (%)
64
 
3.3%
59
 
3.1%
52
 
2.7%
50
 
2.6%
47
 
2.4%
46
 
2.4%
45
 
2.3%
43
 
2.2%
41
 
2.1%
34
 
1.8%
Other values (358) 1439
74.9%

업종
Categorical

Distinct15
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
제조
71 
교육
45 
기타
42 
문화예술
30 
도소매
28 
Other values (10)
84 

Length

Max length4
Median length2
Mean length2.42
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row재활용
2nd row청소
3rd row식품
4th row청소
5th row교육

Common Values

ValueCountFrequency (%)
제조 71
23.7%
교육 45
15.0%
기타 42
14.0%
문화예술 30
10.0%
도소매 28
 
9.3%
청소 22
 
7.3%
식품 17
 
5.7%
간병가사 12
 
4.0%
건설 11
 
3.7%
재활용 7
 
2.3%
Other values (5) 15
 
5.0%

Length

2024-04-06T18:37:35.384815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조 71
23.7%
교육 45
15.0%
기타 42
14.0%
문화예술 30
10.0%
도소매 28
 
9.3%
청소 22
 
7.3%
식품 17
 
5.7%
간병가사 12
 
4.0%
건설 11
 
3.7%
재활용 7
 
2.3%
Other values (5) 15
 
5.0%
Distinct297
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2024-04-06T18:37:35.798781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length111
Median length53
Mean length20.113333
Min length2

Characters and Unicode

Total characters6034
Distinct characters442
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique294 ?
Unique (%)98.0%

Sample

1st row공공기관물품재활용, 사무용가구 등
2nd row건물위생관리, 경비, 방역 등
3rd row카페, 관광기념품, 체험
4th row청소, 소독, 인테리어
5th row복지 서비스
ValueCountFrequency (%)
81
 
5.9%
44
 
3.2%
판매 37
 
2.7%
제조 36
 
2.6%
교육 25
 
1.8%
제공 22
 
1.6%
운영 19
 
1.4%
도소매 17
 
1.2%
개발 15
 
1.1%
제작 13
 
0.9%
Other values (776) 1070
77.6%
2024-04-06T18:37:36.325096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1112
 
18.4%
, 323
 
5.4%
153
 
2.5%
95
 
1.6%
92
 
1.5%
92
 
1.5%
85
 
1.4%
84
 
1.4%
83
 
1.4%
81
 
1.3%
Other values (432) 3834
63.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4462
73.9%
Space Separator 1112
 
18.4%
Other Punctuation 367
 
6.1%
Uppercase Letter 32
 
0.5%
Close Punctuation 24
 
0.4%
Open Punctuation 24
 
0.4%
Lowercase Letter 6
 
0.1%
Decimal Number 5
 
0.1%
Final Punctuation 1
 
< 0.1%
Initial Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
153
 
3.4%
95
 
2.1%
92
 
2.1%
92
 
2.1%
85
 
1.9%
84
 
1.9%
83
 
1.9%
81
 
1.8%
77
 
1.7%
73
 
1.6%
Other values (395) 3547
79.5%
Uppercase Letter
ValueCountFrequency (%)
R 4
12.5%
D 4
12.5%
C 3
9.4%
E 3
9.4%
T 2
 
6.2%
P 2
 
6.2%
L 2
 
6.2%
O 2
 
6.2%
V 2
 
6.2%
H 1
 
3.1%
Other values (7) 7
21.9%
Lowercase Letter
ValueCountFrequency (%)
l 1
16.7%
e 1
16.7%
d 1
16.7%
i 1
16.7%
c 1
16.7%
t 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 323
88.0%
/ 22
 
6.0%
· 16
 
4.4%
& 5
 
1.4%
: 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 2
40.0%
3 1
20.0%
2 1
20.0%
4 1
20.0%
Space Separator
ValueCountFrequency (%)
1112
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4462
73.9%
Common 1534
 
25.4%
Latin 38
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
153
 
3.4%
95
 
2.1%
92
 
2.1%
92
 
2.1%
85
 
1.9%
84
 
1.9%
83
 
1.9%
81
 
1.8%
77
 
1.7%
73
 
1.6%
Other values (395) 3547
79.5%
Latin
ValueCountFrequency (%)
R 4
 
10.5%
D 4
 
10.5%
C 3
 
7.9%
E 3
 
7.9%
T 2
 
5.3%
P 2
 
5.3%
L 2
 
5.3%
O 2
 
5.3%
V 2
 
5.3%
l 1
 
2.6%
Other values (13) 13
34.2%
Common
ValueCountFrequency (%)
1112
72.5%
, 323
 
21.1%
) 24
 
1.6%
( 24
 
1.6%
/ 22
 
1.4%
· 16
 
1.0%
& 5
 
0.3%
1 2
 
0.1%
3 1
 
0.1%
2 1
 
0.1%
Other values (4) 4
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4462
73.9%
ASCII 1554
 
25.8%
None 16
 
0.3%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1112
71.6%
, 323
 
20.8%
) 24
 
1.5%
( 24
 
1.5%
/ 22
 
1.4%
& 5
 
0.3%
R 4
 
0.3%
D 4
 
0.3%
C 3
 
0.2%
E 3
 
0.2%
Other values (24) 30
 
1.9%
Hangul
ValueCountFrequency (%)
153
 
3.4%
95
 
2.1%
92
 
2.1%
92
 
2.1%
85
 
1.9%
84
 
1.9%
83
 
1.9%
81
 
1.8%
77
 
1.7%
73
 
1.6%
Other values (395) 3547
79.5%
None
ValueCountFrequency (%)
· 16
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%

군구
Categorical

Distinct10
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
남동구
66 
서구
59 
미추홀구
39 
연수구
38 
부평구
30 
Other values (5)
68 

Length

Max length4
Median length3
Mean length2.8266667
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
남동구 66
22.0%
서구 59
19.7%
미추홀구 39
13.0%
연수구 38
12.7%
부평구 30
10.0%
계양구 20
 
6.7%
중구 19
 
6.3%
동구 13
 
4.3%
강화군 10
 
3.3%
옹진군 6
 
2.0%

Length

2024-04-06T18:37:36.481081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:37:36.617600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남동구 66
22.0%
서구 59
19.7%
미추홀구 39
13.0%
연수구 38
12.7%
부평구 30
10.0%
계양구 20
 
6.7%
중구 19
 
6.3%
동구 13
 
4.3%
강화군 10
 
3.3%
옹진군 6
 
2.0%
Distinct296
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2024-04-06T18:37:37.068289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/