Overview

Dataset statistics

Number of variables9
Number of observations75
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.5 KiB
Average record size in memory75.8 B

Variable types

Numeric2
Categorical2
Text4
DateTime1

Dataset

Description금산군 농공단지 내 입주업체현황(기업명, 주소, 전화번호, 공장등록일시 등)에 대한 내역입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=396&beforeMenuCd=DOM_000000201001001000&publicdatapk=15028982

Alerts

데이터기준일자 has constant value ""Constant
순번 is highly overall correlated with 산업단지명High correlation
종업원수 is highly overall correlated with 산업단지명High correlation
산업단지명 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
종업원수 has 1 (1.3%) missing valuesMissing
순번 has unique valuesUnique
회사명 has unique valuesUnique

Reproduction

Analysis started2024-01-09 21:25:39.883268
Analysis finished2024-01-09 21:25:40.913606
Duration1.03 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct75
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38
Minimum1
Maximum75
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size807.0 B
2024-01-10T06:25:40.968297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.7
Q119.5
median38
Q356.5
95-th percentile71.3
Maximum75
Range74
Interquartile range (IQR)37

Descriptive statistics

Standard deviation21.794495
Coefficient of variation (CV)0.57353933
Kurtosis-1.2
Mean38
Median Absolute Deviation (MAD)19
Skewness0
Sum2850
Variance475
MonotonicityStrictly increasing
2024-01-10T06:25:41.072469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.3%
49 1
 
1.3%
56 1
 
1.3%
55 1
 
1.3%
54 1
 
1.3%
53 1
 
1.3%
52 1
 
1.3%
51 1
 
1.3%
50 1
 
1.3%
48 1
 
1.3%
Other values (65) 65
86.7%
ValueCountFrequency (%)
1 1
1.3%
2 1
1.3%
3 1
1.3%
4 1
1.3%
5 1
1.3%
6 1
1.3%
7 1
1.3%
8 1
1.3%
9 1
1.3%
10 1
1.3%
ValueCountFrequency (%)
75 1
1.3%
74 1
1.3%
73 1
1.3%
72 1
1.3%
71 1
1.3%
70 1
1.3%
69 1
1.3%
68 1
1.3%
67 1
1.3%
66 1
1.3%

산업단지명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size732.0 B
추부농공단지
32 
금성농공단지
22 
복수농공단지
14 
인삼약초특화농공단지
금산일반산업단지
 
1

Length

Max length10
Median length6
Mean length6.3466667
Min length6

Unique

Unique1 ?
Unique (%)1.3%

Sample

1st row금산일반산업단지
2nd row금성농공단지
3rd row금성농공단지
4th row금성농공단지
5th row금성농공단지

Common Values

ValueCountFrequency (%)
추부농공단지 32
42.7%
금성농공단지 22
29.3%
복수농공단지 14
18.7%
인삼약초특화농공단지 6
 
8.0%
금산일반산업단지 1
 
1.3%

Length

2024-01-10T06:25:41.182207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:25:41.274341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
추부농공단지 32
42.7%
금성농공단지 22
29.3%
복수농공단지 14
18.7%
인삼약초특화농공단지 6
 
8.0%
금산일반산업단지 1
 
1.3%

회사명
Text

UNIQUE 

Distinct75
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size732.0 B
2024-01-10T06:25:41.469221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length6.1466667
Min length3

Characters and Unicode

Total characters461
Distinct characters153
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)100.0%

Sample

1st row한국타이어㈜ 금산공장
2nd row태형산업㈜
3rd row한국생약영농조합법인
4th row㈜한영계기
5th row㈜부광케미컬
ValueCountFrequency (%)
농업회사법인 2
 
2.4%
한국타이어㈜ 1
 
1.2%
주)b 1
 
1.2%
주)이엑스쏠라 1
 
1.2%
주)미래기전 1
 
1.2%
주)더드림솔루션 1
 
1.2%
주)에스코알티에스 1
 
1.2%
진테크 1
 
1.2%
c 1
 
1.2%
d 1
 
1.2%
Other values (71) 71
86.6%
2024-01-10T06:25:41.773338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
37
 
8.0%
23
 
5.0%
21
 
4.6%
) 19
 
4.1%
18
 
3.9%
11
 
2.4%
11
 
2.4%
9
 
2.0%
9
 
2.0%
8
 
1.7%
Other values (143) 295
64.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 382
82.9%
Other Symbol 37
 
8.0%
Close Punctuation 19
 
4.1%
Uppercase Letter 9
 
2.0%
Space Separator 7
 
1.5%
Open Punctuation 6
 
1.3%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
6.0%
21
 
5.5%
18
 
4.7%
11
 
2.9%
11
 
2.9%
9
 
2.4%
9
 
2.4%
8
 
2.1%
8
 
2.1%
7
 
1.8%
Other values (130) 257
67.3%
Uppercase Letter
ValueCountFrequency (%)
D 2
22.2%
J 1
11.1%
S 1
11.1%
C 1
11.1%
B 1
11.1%
P 1
11.1%
E 1
11.1%
I 1
11.1%
Other Symbol
ValueCountFrequency (%)
37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 419
90.9%
Common 33
 
7.2%
Latin 9
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
 
8.8%
23
 
5.5%
21
 
5.0%
18
 
4.3%
11
 
2.6%
11
 
2.6%
9
 
2.1%
9
 
2.1%
8
 
1.9%
8
 
1.9%
Other values (131) 264
63.0%
Latin
ValueCountFrequency (%)
D 2
22.2%
J 1
11.1%
S 1
11.1%
C 1
11.1%
B 1
11.1%
P 1
11.1%
E 1
11.1%
I 1
11.1%
Common
ValueCountFrequency (%)
) 19
57.6%
7
 
21.2%
( 6
 
18.2%
& 1
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 382
82.9%
ASCII 42
 
9.1%
None 37
 
8.0%

Most frequent character per block

None
ValueCountFrequency (%)
37
100.0%
Hangul
ValueCountFrequency (%)
23
 
6.0%
21
 
5.5%
18
 
4.7%
11
 
2.9%
11
 
2.9%
9
 
2.4%
9
 
2.4%
8
 
2.1%
8
 
2.1%
7
 
1.8%
Other values (130) 257
67.3%
ASCII
ValueCountFrequency (%)
) 19
45.2%
7
 
16.7%
( 6
 
14.3%
D 2
 
4.8%
J 1
 
2.4%
S 1
 
2.4%
C 1
 
2.4%
B 1
 
2.4%
P 1
 
2.4%
E 1
 
2.4%
Other values (2) 2
 
4.8%
Distinct71
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size732.0 B
Minimum1988-08-15 00:00:00
Maximum2022-08-22 00:00:00
2024-01-10T06:25:41.886346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:25:42.003756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct74
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size732.0 B
2024-01-10T06:25:42.482463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.973333
Min length9

Characters and Unicode

Total characters898
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)97.3%

Sample

1st row041-750-5101
2nd row041-754-6672
3rd row041-751-4400
4th row041-751-1051
5th row041-751-3205
ValueCountFrequency (%)
041-752-1029 2
 
2.7%
070-4038-7187 1
 
1.3%
041-752-0399 1
 
1.3%
041-751-4803 1
 
1.3%
041-754-0501 1
 
1.3%
041-751-6262 1
 
1.3%
041-752-1022 1
 
1.3%
041-751-1470 1
 
1.3%
041-751-9599 1
 
1.3%
041-754-7041 1
 
1.3%
Other values (64) 64
85.3%
2024-01-10T06:25:42.794518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 148
16.5%
0 140
15.6%
1 130
14.5%
4 114
12.7%
7 106
11.8%
5 101
11.2%
2 49
 
5.5%
3 35
 
3.9%
9 29
 
3.2%
8 26
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 750
83.5%
Dash Punctuation 148
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 140
18.7%
1 130
17.3%
4 114
15.2%
7 106
14.1%
5 101
13.5%
2 49
 
6.5%
3 35
 
4.7%
9 29
 
3.9%
8 26
 
3.5%
6 20
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 148
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 898
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 148
16.5%
0 140
15.6%
1 130
14.5%
4 114
12.7%
7 106
11.8%
5 101
11.2%
2 49
 
5.5%
3 35
 
3.9%
9 29
 
3.2%
8 26
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 898
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 148
16.5%
0 140
15.6%
1 130
14.5%
4 114
12.7%
7 106
11.8%
5 101
11.2%
2 49
 
5.5%
3 35
 
3.9%
9 29
 
3.2%
8 26
 
2.9%

종업원수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct38
Distinct (%)51.4%
Missing1
Missing (%)1.3%
Infinite0
Infinite (%)0.0%
Mean58.391892
Minimum1
Maximum2978
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size807.0 B
2024-01-10T06:25:42.905456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q18
median16
Q324.75
95-th percentile47.05
Maximum2978
Range2977
Interquartile range (IQR)16.75

Descriptive statistics

Standard deviation344.41234
Coefficient of variation (CV)5.8982904
Kurtosis73.673242
Mean58.391892
Median Absolute Deviation (MAD)8.5
Skewness8.5744516
Sum4321
Variance118619.86
MonotonicityNot monotonic
2024-01-10T06:25:43.009195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/