Overview

Dataset statistics

Number of variables13
Number of observations52
Missing cells2
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.6 KiB
Average record size in memory110.5 B

Variable types

Numeric4
Categorical2
DateTime2
Text5

Alerts

상태정보 has constant value ""Constant
데이터기준일 has constant value ""Constant
대지면적 is highly overall correlated with 건물면적 and 1 other fieldsHigh correlation
건물면적 is highly overall correlated with 대지면적 and 1 other fieldsHigh correlation
정비형태 is highly overall correlated with 대지면적 and 1 other fieldsHigh correlation
정비형태 is highly imbalanced (66.8%)Imbalance
대지면적 has 1 (1.9%) missing valuesMissing
건물면적 has 1 (1.9%) missing valuesMissing
순번 has unique valuesUnique
관리사업등록번호 has unique valuesUnique
사업자 상호(명칭) has unique valuesUnique
사업장주소 has unique valuesUnique
대표자명 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:10:04.061724
Analysis finished2024-01-09 22:10:05.957383
Duration1.9 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.5
Minimum1
Maximum52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2024-01-10T07:10:06.017493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.55
Q113.75
median26.5
Q339.25
95-th percentile49.45
Maximum52
Range51
Interquartile range (IQR)25.5

Descriptive statistics

Standard deviation15.154757
Coefficient of variation (CV)0.57187763
Kurtosis-1.2
Mean26.5
Median Absolute Deviation (MAD)13
Skewness0
Sum1378
Variance229.66667
MonotonicityStrictly increasing
2024-01-10T07:10:06.122437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
28 1
 
1.9%
30 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
Other values (42) 42
80.8%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%
44 1
1.9%
43 1
1.9%

상태정보
Categorical

CONSTANT 

Distinct1
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size548.0 B
영업
52 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 52
100.0%

Length

2024-01-10T07:10:06.219009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:10:06.295474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 52
100.0%
Distinct45
Distinct (%)86.5%
Missing0
Missing (%)0.0%
Memory size548.0 B
Minimum1994-10-11 00:00:00
Maximum2016-01-21 00:00:00
2024-01-10T07:10:06.395022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:10:06.530877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2024-01-10T07:10:06.727408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters728
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row01-4426-000080
2nd row01-4426-000081
3rd row01-4426-000041
4th row01-4426-000037
5th row01-4426-000042
ValueCountFrequency (%)
01-4426-000080 1
 
1.9%
01-4426-000081 1
 
1.9%
01-4426-000040 1
 
1.9%
01-4426-000060 1
 
1.9%
01-4426-000061 1
 
1.9%
01-4426-000062 1
 
1.9%
01-4426-000058 1
 
1.9%
01-4426-000065 1
 
1.9%
01-4426-000064 1
 
1.9%
01-4426-000066 1
 
1.9%
Other values (42) 42
80.8%
2024-01-10T07:10:07.001602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 266
36.5%
4 118
16.2%
- 104
 
14.3%
6 66
 
9.1%
2 60
 
8.2%
1 58
 
8.0%
7 17
 
2.3%
8 13
 
1.8%
5 13
 
1.8%
3 8
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 624
85.7%
Dash Punctuation 104
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 266
42.6%
4 118
18.9%
6 66
 
10.6%
2 60
 
9.6%
1 58
 
9.3%
7 17
 
2.7%
8 13
 
2.1%
5 13
 
2.1%
3 8
 
1.3%
9 5
 
0.8%
Dash Punctuation
ValueCountFrequency (%)
- 104
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 728
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 266
36.5%
4 118
16.2%
- 104
 
14.3%
6 66
 
9.1%
2 60
 
8.2%
1 58
 
8.0%
7 17
 
2.3%
8 13
 
1.8%
5 13
 
1.8%
3 8
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 728
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 266
36.5%
4 118
16.2%
- 104
 
14.3%
6 66
 
9.1%
2 60
 
8.2%
1 58
 
8.0%
7 17
 
2.3%
8 13
 
1.8%
5 13
 
1.8%
3 8
 
1.1%
Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2024-01-10T07:10:07.200409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11.5
Mean length6.9423077
Min length3

Characters and Unicode

Total characters361
Distinct characters96
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row반도자동차공업사
2nd row태안 현대서비스
3rd row안면자동차공업사
4th row태안기아서비스
5th row평천동양카독크
ValueCountFrequency (%)
태안점 2
 
3.3%
현대자동차 2
 
3.3%
차사랑자동차정비공업사 1
 
1.6%
차사랑카센타 1
 
1.6%
보광카센타 1
 
1.6%
천일카센타 1
 
1.6%
스피드카 1
 
1.6%
성윤카 1
 
1.6%
창기카센타 1
 
1.6%
홍반장카센타 1
 
1.6%
Other values (49) 49
80.3%
2024-01-10T07:10:07.501267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
7.8%
22
 
6.1%
20
 
5.5%
18
 
5.0%
16
 
4.4%
14
 
3.9%
14
 
3.9%
11
 
3.0%
10
 
2.8%
9
 
2.5%
Other values (86) 199
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 350
97.0%
Space Separator 9
 
2.5%
Other Punctuation 2
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
 
8.0%
22
 
6.3%
20
 
5.7%
18
 
5.1%
16
 
4.6%
14
 
4.0%
14
 
4.0%
11
 
3.1%
10
 
2.9%
8
 
2.3%
Other values (84) 189
54.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Other Punctuation
ValueCountFrequency (%)
? 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 350
97.0%
Common 11
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
 
8.0%
22
 
6.3%
20
 
5.7%
18
 
5.1%
16
 
4.6%
14
 
4.0%
14
 
4.0%
11
 
3.1%
10
 
2.9%
8
 
2.3%
Other values (84) 189
54.0%
Common
ValueCountFrequency (%)
9
81.8%
? 2
 
18.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 350
97.0%
ASCII 11
 
3.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
28
 
8.0%
22
 
6.3%
20
 
5.7%
18
 
5.1%
16
 
4.6%
14
 
4.0%
14
 
4.0%
11
 
3.1%
10
 
2.9%
8
 
2.3%
Other values (84) 189
54.0%
ASCII
ValueCountFrequency (%)
9
81.8%
? 2
 
18.2%

사업장주소
Text

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2024-01-10T07:10:07.695676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length16.519231
Min length15

Characters and Unicode

Total characters859
Distinct characters59
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row 태안군 태안읍 환동로 11
2nd row 태안군 태안읍 원이로 79
3rd row 태안군 안면읍 장터로 181-5
4th row 태안군 태안읍 서해로 2068-16
5th row 태안군 태안읍 서해로 2028
ValueCountFrequency (%)
태안군 52
24.9%
태안읍 37
17.7%
동백로 8
 
3.8%
중앙로 7
 
3.3%
안면읍 7
 
3.3%
장터로 5
 
2.4%
원이로 4
 
1.9%
환동로 4
 
1.9%
근흥면 3
 
1.4%
서해로 3
 
1.4%
Other values (73) 79
37.8%
2024-01-10T07:10:07.987468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/