Overview

Dataset statistics

Number of variables16
Number of observations39
Missing cells24
Missing cells (%)3.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.1 KiB
Average record size in memory134.4 B

Variable types

Numeric3
Text8
Categorical2
Boolean2
DateTime1

Dataset

Description인천광역시 미추홀구 사회적기업 현황에 관한 데이터로 기관명, 분류, 유형, 사업내용, 인증여부, 인증일자, 대표자명, 전화번호, 주소, 위도, 경도 등을 제공합니다.
URLhttps://www.data.go.kr/data/15085542/fileData.do

Alerts

연번 is highly overall correlated with 인증여부 and 1 other fieldsHigh correlation
인증여부 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
예비여부 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
분류 is highly imbalanced (60.6%)Imbalance
전화번호 has 3 (7.7%) missing valuesMissing
홈페이지주소 has 21 (53.8%) missing valuesMissing
연번 has unique valuesUnique
기관명 has unique valuesUnique
인증번호 has unique valuesUnique
대표자명 has unique valuesUnique
도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:32:26.962230
Analysis finished2023-12-12 14:32:29.562290
Duration2.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct39
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20
Minimum1
Maximum39
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size483.0 B
2023-12-12T23:32:29.649124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.9
Q110.5
median20
Q329.5
95-th percentile37.1
Maximum39
Range38
Interquartile range (IQR)19

Descriptive statistics

Standard deviation11.401754
Coefficient of variation (CV)0.57008771
Kurtosis-1.2
Mean20
Median Absolute Deviation (MAD)10
Skewness0
Sum780
Variance130
MonotonicityStrictly increasing
2023-12-12T23:32:29.870895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
1 1
 
2.6%
2 1
 
2.6%
23 1
 
2.6%
24 1
 
2.6%
25 1
 
2.6%
26 1
 
2.6%
27 1
 
2.6%
28 1
 
2.6%
29 1
 
2.6%
30 1
 
2.6%
Other values (29) 29
74.4%
ValueCountFrequency (%)
1 1
2.6%
2 1
2.6%
3 1
2.6%
4 1
2.6%
5 1
2.6%
6 1
2.6%
7 1
2.6%
8 1
2.6%
9 1
2.6%
10 1
2.6%
ValueCountFrequency (%)
39 1
2.6%
38 1
2.6%
37 1
2.6%
36 1
2.6%
35 1
2.6%
34 1
2.6%
33 1
2.6%
32 1
2.6%
31 1
2.6%
30 1
2.6%

기관명
Text

UNIQUE 

Distinct39
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size444.0 B
2023-12-12T23:32:30.141674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length12
Mean length7.2051282
Min length2

Characters and Unicode

Total characters281
Distinct characters137
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)100.0%

Sample

1st row㈜최고의환한미소
2nd row모씨네사회적협동조합
3rd row사회복지법인 손사손사업단 예림일터
4th row㈜재미난나
5th row인천로컬푸드생산자협동조합
ValueCountFrequency (%)
사회적협동조합 2
 
4.3%
㈜최고의환한미소 1
 
2.2%
행복도시락 1
 
2.2%
㈜주 1
 
2.2%
㈜로하스컴퍼니 1
 
2.2%
㈜채가 1
 
2.2%
도서관학교 1
 
2.2%
한국렌탈판매협동조합 1
 
2.2%
㈜은하수팩토리 1
 
2.2%
㈜예솜 1
 
2.2%
Other values (35) 35
76.1%
2023-12-12T23:32:30.568646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
10.0%
12
 
4.3%
9
 
3.2%
7
 
2.5%
6
 
2.1%
6
 
2.1%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.8%
Other values (127) 190
67.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 242
86.1%
Other Symbol 28
 
10.0%
Space Separator 7
 
2.5%
Open Punctuation 2
 
0.7%
Close Punctuation 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
5.0%
9
 
3.7%
6
 
2.5%
6
 
2.5%
6
 
2.5%
6
 
2.5%
6
 
2.5%
5
 
2.1%
4
 
1.7%
4
 
1.7%
Other values (123) 178
73.6%
Other Symbol
ValueCountFrequency (%)
28
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 270
96.1%
Common 11
 
3.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
 
10.4%
12
 
4.4%
9
 
3.3%
6
 
2.2%
6
 
2.2%
6
 
2.2%
6
 
2.2%
6
 
2.2%
5
 
1.9%
4
 
1.5%
Other values (124) 182
67.4%
Common
ValueCountFrequency (%)
7
63.6%
( 2
 
18.2%
) 2
 
18.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 242
86.1%
None 28
 
10.0%
ASCII 11
 
3.9%

Most frequent character per block

None
ValueCountFrequency (%)
28
100.0%
Hangul
ValueCountFrequency (%)
12
 
5.0%
9
 
3.7%
6
 
2.5%
6
 
2.5%
6
 
2.5%
6
 
2.5%
6
 
2.5%
5
 
2.1%
4
 
1.7%
4
 
1.7%
Other values (123) 178
73.6%
ASCII
ValueCountFrequency (%)
7
63.6%
( 2
 
18.2%
) 2
 
18.2%

분류
Categorical

IMBALANCE 

Distinct5
Distinct (%)12.8%
Missing0
Missing (%)0.0%
Memory size444.0 B
기타
33 
간병및가사지원
 
2
청소
 
2
예술·관광
 
1
보육
 
1

Length

Max length7
Median length2
Mean length2.3333333
Min length2

Unique

Unique2 ?
Unique (%)5.1%

Sample

1st row기타
2nd row예술·관광
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
기타 33
84.6%
간병및가사지원 2
 
5.1%
청소 2
 
5.1%
예술·관광 1
 
2.6%
보육 1
 
2.6%

Length

2023-12-12T23:32:30.750253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:32:30.885015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 33
84.6%
간병및가사지원 2
 
5.1%
청소 2
 
5.1%
예술·관광 1
 
2.6%
보육 1
 
2.6%

유형
Categorical

Distinct4
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Memory size444.0 B
일자리제공형
26 
창의혁신형(기타)
10 
혼합형
 
2
지역사회공헌형
 
1

Length

Max length9
Median length6
Mean length6.6410256
Min length3

Unique

Unique1 ?
Unique (%)2.6%

Sample

1st row일자리제공형
2nd row일자리제공형
3rd row일자리제공형
4th row창의혁신형(기타)
5th row창의혁신형(기타)

Common Values

ValueCountFrequency (%)
일자리제공형 26
66.7%
창의혁신형(기타) 10
 
25.6%
혼합형 2
 
5.1%
지역사회공헌형 1
 
2.6%

Length

2023-12-12T23:32:31.023322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:32:31.176950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일자리제공형 26
66.7%
창의혁신형(기타 10
 
25.6%
혼합형 2
 
5.1%
지역사회공헌형 1
 
2.6%
Distinct38
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size444.0 B
2023-12-12T23:32:31.426688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length10.102564
Min length4

Characters and Unicode

Total characters394
Distinct characters134
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)94.9%

Sample

1st row빈집 리모델링
2nd row상영 미디어교육 영상제작등
3rd row종이컵생산
4th row목재 이용한 가구제작
5th row농산물가공 및 위탁판매
ValueCountFrequency (%)
13
 
12.7%
판매 5
 
4.9%
제조 3
 
2.9%
제작 3
 
2.9%
도시락제조 2
 
2.0%
대안교육 2
 
2.0%
교육 2
 
2.0%
전자상거래 2
 
2.0%
행사 2
 
2.0%
빈집 2
 
2.0%
Other values (65) 66
64.7%
2023-12-12T23:32:31.939646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
63
 
16.0%
14
 
3.6%
13
 
3.3%
13
 
3.3%
10
 
2.5%
9
 
2.3%
8
 
2.0%
8
 
2.0%
7
 
1.8%
6
 
1.5%
Other values (124) 243
61.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 329
83.5%
Space Separator 63
 
16.0%
Open Punctuation 1
 
0.3%
Close Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
4.3%
13
 
4.0%
13
 
4.0%
10
 
3.0%
9
 
2.7%
8
 
2.4%
8
 
2.4%
7
 
2.1%
6
 
1.8%
6
 
1.8%
Other values (121) 235
71.4%
Space Separator
ValueCountFrequency (%)
63
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 329
83.5%
Common 65
 
16.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
4.3%
13
 
4.0%
13
 
4.0%
10
 
3.0%
9
 
2.7%
8
 
2.4%
8
 
2.4%
7
 
2.1%
6
 
1.8%
6
 
1.8%
Other values (121) 235
71.4%
Common
ValueCountFrequency (%)
63
96.9%
( 1
 
1.5%
) 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 329
83.5%
ASCII 65
 
16.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
63
96.9%
( 1
 
1.5%
) 1
 
1.5%
Hangul
ValueCountFrequency (%)
14
 
4.3%
13
 
4.0%
13
 
4.0%
10
 
3.0%
9
 
2.7%
8
 
2.4%
8
 
2.4%
7
 
2.1%
6
 
1.8%
6
 
1.8%
Other values (121) 235
71.4%

인증여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size171.0 B
True
33 
False
ValueCountFrequency (%)
True 33
84.6%
False 6
 
15.4%
2023-12-12T23:32:32.092342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

예비여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size171.0 B
False
33 
True
ValueCountFrequency (%)
False 33
84.6%
True 6
 
15.4%
2023-12-12T23:32:32.201908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct32
Distinct (%)82.1%
Missing0
Missing (%)0.0%
Memory size444.0 B
Minimum2007-10-29 00:00:00
Maximum2023-06-15 00:00:00
2023-12-12T23:32:32.345563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:32:32.499502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)

인증번호
Text

UNIQUE 

Distinct39
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size444.0 B
2023-12-12T23:32:32.740494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/