Overview

Dataset statistics

Number of variables33
Number of observations229
Missing cells2605
Missing cells (%)34.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory63.6 KiB
Average record size in memory284.6 B

Variable types

Numeric11
Categorical7
Unsupported8
Text6
DateTime1

Dataset

Description6270000_대구광역시_09_28_14_P_특정고압가스업_12월
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=DMI_0000083399&dataSetDetailId=DDI_0000083459&provdMethod=FILE

Alerts

개방서비스명 has constant value ""Constant
개방서비스ID has constant value ""Constant
인허가취소일자 has 229 (100.0%) missing valuesMissing
폐업일자 has 205 (89.5%) missing valuesMissing
휴업시작일자 has 229 (100.0%) missing valuesMissing
휴업종료일자 has 229 (100.0%) missing valuesMissing
재개업일자 has 229 (100.0%) missing valuesMissing
소재지전화 has 158 (69.0%) missing valuesMissing
소재지면적 has 229 (100.0%) missing valuesMissing
소재지우편번호 has 229 (100.0%) missing valuesMissing
소재지전체주소 has 37 (16.2%) missing valuesMissing
도로명전체주소 has 48 (21.0%) missing valuesMissing
도로명우편번호 has 159 (69.4%) missing valuesMissing
업태구분명 has 229 (100.0%) missing valuesMissing
좌표정보(X) has 42 (18.3%) missing valuesMissing
좌표정보(Y) has 42 (18.3%) missing valuesMissing
사용목적 has 19 (8.3%) missing valuesMissing
사용방법 has 30 (13.1%) missing valuesMissing
월사용량 has 8 (3.5%) missing valuesMissing
수용정원수 has 25 (10.9%) missing valuesMissing
시설사용여부 has 229 (100.0%) missing valuesMissing
번호 has unique valuesUnique
관리번호 has unique valuesUnique
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
휴업시작일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
휴업종료일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
재개업일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지우편번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
업태구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
시설사용여부 is an unsupported type, check if it needs cleaning or further analysisUnsupported
수용정원수 has 6 (2.6%) zerosZeros

Reproduction

Analysis started2024-04-21 23:58:39.014974
Analysis finished2024-04-21 23:58:39.647228
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct229
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean115
Minimum1
Maximum229
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2024-04-22T08:58:39.706209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.4
Q158
median115
Q3172
95-th percentile217.6
Maximum229
Range228
Interquartile range (IQR)114

Descriptive statistics

Standard deviation66.250786
Coefficient of variation (CV)0.57609379
Kurtosis-1.2
Mean115
Median Absolute Deviation (MAD)57
Skewness0
Sum26335
Variance4389.1667
MonotonicityStrictly increasing
2024-04-22T08:58:39.821229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
145 1
 
0.4%
147 1
 
0.4%
148 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
152 1
 
0.4%
153 1
 
0.4%
154 1
 
0.4%
Other values (219) 219
95.6%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
229 1
0.4%
228 1
0.4%
227 1
0.4%
226 1
0.4%
225 1
0.4%
224 1
0.4%
223 1
0.4%
222 1
0.4%
221 1
0.4%
220 1
0.4%

개방서비스명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
특정고압가스업
229 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row특정고압가스업
2nd row특정고압가스업
3rd row특정고압가스업
4th row특정고압가스업
5th row특정고압가스업

Common Values

ValueCountFrequency (%)
특정고압가스업 229
100.0%

Length

2024-04-22T08:58:39.924614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T08:58:39.996872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특정고압가스업 229
100.0%

개방서비스ID
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
09_28_14_P
229 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row09_28_14_P
2nd row09_28_14_P
3rd row09_28_14_P
4th row09_28_14_P
5th row09_28_14_P

Common Values

ValueCountFrequency (%)
09_28_14_P 229
100.0%

Length

2024-04-22T08:58:40.076605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T08:58:40.148199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
09_28_14_p 229
100.0%

개방자치단체코드
Real number (ℝ)

Distinct8
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3454235.8
Minimum3410000
Maximum3480000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2024-04-22T08:58:40.216261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3410000
5-th percentile3420000
Q13430000
median3470000
Q33470000
95-th percentile3480000
Maximum3480000
Range70000
Interquartile range (IQR)40000

Descriptive statistics

Standard deviation22844.945
Coefficient of variation (CV)0.0066136032
Kurtosis-1.0770772
Mean3454235.8
Median Absolute Deviation (MAD)10000
Skewness-0.64962187
Sum7.9102 × 108
Variance5.2189152 × 108
MonotonicityIncreasing
2024-04-22T08:58:40.310298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
3470000 81
35.4%
3420000 41
17.9%
3480000 35
15.3%
3450000 29
 
12.7%
3460000 18
 
7.9%
3410000 10
 
4.4%
3430000 10
 
4.4%
3440000 5
 
2.2%
ValueCountFrequency (%)
3410000 10
 
4.4%
3420000 41
17.9%
3430000 10
 
4.4%
3440000 5
 
2.2%
3450000 29
 
12.7%
3460000 18
 
7.9%
3470000 81
35.4%
3480000 35
15.3%
ValueCountFrequency (%)
3480000 35
15.3%
3470000 81
35.4%
3460000 18
 
7.9%
3450000 29
 
12.7%
3440000 5
 
2.2%
3430000 10
 
4.4%
3420000 41
17.9%
3410000 10
 
4.4%

관리번호
Real number (ℝ)

UNIQUE 

Distinct229
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.011376 × 1018
Minimum1.979342 × 1018
Maximum2.019348 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2024-04-22T08:58:40.431391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.979342 × 1018
5-th percentile2.000347 × 1018
Q12.003348 × 1018
median2.014345 × 1018
Q32.018341 × 1018
95-th percentile2.019347 × 1018
Maximum2.019348 × 1018
Range4.0006031 × 1016
Interquartile range (IQR)1.4993005 × 1016

Descriptive statistics

Standard deviation7.2555698 × 1015
Coefficient of variation (CV)0.0036072667
Kurtosis0.074832843
Mean2.011376 × 1018
Median Absolute Deviation (MAD)4.9969813 × 1015
Skewness-0.72459413
Sum-5.6349682 × 1017
Variance5.2643293 × 1031
MonotonicityNot monotonic
2024-04-22T08:58:40.565964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2014341010512200001 1
 
0.4%
2015347012412200002 1
 
0.4%
2016347012412200007 1
 
0.4%
2016347012412200006 1
 
0.4%
2016347012412200004 1
 
0.4%
2017347012412200002 1
 
0.4%
2016347012412200012 1
 
0.4%
2016347012412200010 1
 
0.4%
2016347012412200008 1
 
0.4%
2003347007502700002 1
 
0.4%
Other values (219) 219
95.6%
ValueCountFrequency (%)
1979342005802702015 1
0.4%
1994342005802700029 1
0.4%
2000341004602700001 1
0.4%
2000342005802700028 1
0.4%
2000347001302700001 1
0.4%
2000347001302700002 1
0.4%
2000347001302700003 1
0.4%
2000347001302700004 1
0.4%
2000347001302700005 1
0.4%
2000347001302700006 1
0.4%
ValueCountFrequency (%)
2019348036512200004 1
0.4%
2019348036512200003 1
0.4%
2019348036512200002 1
0.4%
2019348036512200001 1
0.4%
2019347018012200012 1
0.4%
2019347018012200011 1
0.4%
2019347018012200010 1
0.4%
2019347018012200009 1
0.4%
2019347018012200008 1
0.4%
2019347018012200007 1
0.4%

인허가일자
Real number (ℝ)

Distinct195
Distinct (%)85.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20109200
Minimum19791113
Maximum20191122
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2024-04-22T08:58:40.712625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19791113
5-th percentile19990101
Q120031231
median20140401
Q320180220
95-th percentile20190723
Maximum20191122
Range400009
Interquartile range (IQR)148989

Descriptive statistics

Standard deviation75275.34
Coefficient of variation (CV)0.0037433285
Kurtosis-0.1079976
Mean20109200
Median Absolute Deviation (MAD)49688
Skewness-0.73496866
Sum4.6050067 × 109
Variance5.6663768 × 109
MonotonicityNot monotonic
2024-04-22T08:58:40.846771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19990101 13
 
5.7%
20180910 5
 
2.2%
20160930 4
 
1.7%
20190723 3
 
1.3%
20180911 3
 
1.3%
20161014 2
 
0.9%
20161005 2
 
0.9%
20190719 2
 
0.9%
20161024 2
 
0.9%
20160216 2
 
0.9%
Other values (185) 191
83.4%
ValueCountFrequency (%)
19791113 1
 
0.4%
19941024 1
 
0.4%
19960426 1
 
0.4%
19990101 13
5.7%
19990126 1
 
0.4%
19990202 1
 
0.4%
19990219 1
 
0.4%
19990413 1
 
0.4%
19991124 1
 
0.4%
20000412 1
 
0.4%
ValueCountFrequency (%)
20191122 1
0.4%
20191029 1
0.4%
20190925 1
0.4%
20190919 1
0.4%
20190909 1
0.4%
20190905 1
0.4%
20190904 1
0.4%
20190902 1
0.4%
20190822 1
0.4%
20190809 1
0.4%

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing229
Missing (%)100.0%
Memory size2.1 KiB
Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
1
135 
2
69 
3
25 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 135
59.0%
2 69
30.1%
3 25
 
10.9%

Length

2024-04-22T08:58:40.964047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T08:58:41.048734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 135
59.0%
2 69
30.1%
3 25
 
10.9%

영업상태명
Categorical

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
영업/정상
135 
휴업
69 
폐업
25 

Length

Max length5
Median length5
Mean length3.768559
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row휴업
2nd row휴업
3rd row휴업
4th row휴업
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 135
59.0%
휴업 69
30.1%
폐업 25
 
10.9%

Length

2024-04-22T08:58:41.142206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T08:58:41.235130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 135
59.0%
휴업 69
30.1%
폐업 25
 
10.9%
Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
BBBB
135 
1
69 
2
25 

Length

Max length4
Median length4
Mean length2.768559
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th rowBBBB

Common Values

ValueCountFrequency (%)
BBBB 135
59.0%
1 69
30.1%
2 25
 
10.9%

Length

2024-04-22T08:58:41.332358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T08:58:41.424049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
bbbb 135
59.0%
1 69
30.1%
2 25
 
10.9%
Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
<NA>
135 
휴업처리
69 
폐업처리
25 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row휴업처리
2nd row휴업처리
3rd row휴업처리
4th row휴업처리
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 135
59.0%
휴업처리 69
30.1%
폐업처리 25
 
10.9%

Length

2024-04-22T08:58:41.509039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T08:58:41.590977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 135
59.0%
휴업처리 69
30.1%
폐업처리 25
 
10.9%

폐업일자
Real number (ℝ)

MISSING 

Distinct24
Distinct (%)100.0%
Missing205
Missing (%)89.5%
Infinite0
Infinite (%)0.0%
Mean20160388
Minimum20060207
Maximum20191023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2024-04-22T08:58:41.680554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20060207
5-th percentile20093873
Q120161022
median20170265
Q320181014
95-th percentile20190998
Maximum20191023
Range130816
Interquartile range (IQR)19991.25

Descriptive statistics

Standard deviation32663.961
Coefficient of variation (CV)0.001620205
Kurtosis3.265008
Mean20160388
Median Absolute Deviation (MAD)10699
Skewness-1.8237134
Sum4.8384931 × 108
Variance1.0669344 × 109
MonotonicityNot monotonic
2024-04-22T08:58:41.792175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
20170228 1
 
0.4%
20170208 1
 
0.4%
20181025 1
 
0.4%
20170405 1
 
0.4%
20191023 1
 
0.4%
20191014 1
 
0.4%
20181010 1
 
0.4%
20060207 1
 
0.4%
20131226 1
 
0.4%
20161024 1
 
0.4%
Other values (14) 14
 
6.1%
(Missing) 205
89.5%
ValueCountFrequency (%)
20060207 1
0.4%
20090811 1
0.4%
20111223 1
0.4%
20131226 1
0.4%
20140716 1
0.4%
20161018 1
0.4%
20161024 1
0.4%
20161025 1
0.4%
20161026 1
0.4%
20161121 1
0.4%
ValueCountFrequency (%)
20191023 1
0.4%
20191014 1
0.4%
20190904 1
0.4%
20190102 1
0.4%
20181226 1
0.4%
20181025 1
0.4%
20181010 1
0.4%
20180918 1
0.4%
20171220 1
0.4%
20170405 1
0.4%

휴업시작일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing229
Missing (%)100.0%
Memory size2.1 KiB

휴업종료일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing229
Missing (%)100.0%
Memory size2.1 KiB

재개업일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing229
Missing (%)100.0%
Memory size2.1 KiB

소재지전화
Text

MISSING 

Distinct70
Distinct (%)98.6%
Missing158
Missing (%)69.0%
Memory size1.9 KiB
2024-04-22T08:58:41.989147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length10.915493
Min length3

Characters and Unicode

Total characters775
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)97.2%

Sample

1st row053 2123000
2nd row053 6053798
3rd row053 2547801
4th row053 9826923
5th row053 9407087
ValueCountFrequency (%)
053 68
48.9%
5913707 2
 
1.4%
5838162 1
 
0.7%
5828955 1
 
0.7%
5899145 1
 
0.7%
5813711 1
 
0.7%
5816912 1
 
0.7%
853 1
 
0.7%
0533578 1
 
0.7%
6361771 1
 
0.7%
Other values (61) 61
43.9%
2024-04-22T08:58:42.294286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 151
19.5%
5 129
16.6%
3 113
14.6%
68
8.8%
1 64
8.3%
9 49
 
6.3%
2 47
 
6.1%
7 41
 
5.3%
8 41
 
5.3%
6 38
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 707
91.2%
Space Separator 68
 
8.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 151
21.4%
5 129
18.2%
3 113
16.0%
1 64
9.1%
9 49
 
6.9%
2 47
 
6.6%
7 41
 
5.8%
8 41
 
5.8%
6 38
 
5.4%
4 34
 
4.8%
Space Separator
ValueCountFrequency (%)
68
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 775
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 151
19.5%
5 129
16.6%
3 113
14.6%
68
8.8%
1 64
8.3%
9 49
 
6.3%
2 47
 
6.1%
7 41
 
5.3%
8 41
 
5.3%
6 38
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 775
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 151
19.5%
5 129
16.6%
3 113
14.6%
68
8.8%
1 64
8.3%
9 49
 
6.3%
2 47
 
6.1%
7 41
 
5.3%
8 41
 
5.3%
6 38
 
4.9%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing229
Missing (%)100.0%
Memory size2.1 KiB

소재지우편번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing229
Missing (%)100.0%
Memory size2.1 KiB

소재지전체주소
Text

MISSING 

Distinct180
Distinct (%)93.8%
Missing37
Missing (%)16.2%
Memory size1.9 KiB
2024-04-22T08:58:42.646664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/