Overview

Dataset statistics

Number of variables14
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.5 KiB
Average record size in memory119.3 B

Variable types

Categorical12
Numeric2

Alerts

지역화폐명 has constant value ""Constant
이용년월 has constant value ""Constant
주중_주말 has constant value ""Constant
가맹점_업종 대분류 is highly overall correlated with 가맹점_업종 소분류High correlation
고객거주지_읍면동 is highly overall correlated with 고객 거주지_시군구 and 2 other fieldsHigh correlation
가맹점_시군구 is highly overall correlated with 고객 거주지_시군구 and 2 other fieldsHigh correlation
가맹점_업종 소분류 is highly overall correlated with 연령대 and 1 other fieldsHigh correlation
고객 거주지_시군구 is highly overall correlated with 고객거주지_읍면동 and 2 other fieldsHigh correlation
가맹점_읍면동 is highly overall correlated with 고객 거주지_시군구 and 2 other fieldsHigh correlation
연령대 is highly overall correlated with 가맹점_업종 소분류High correlation
성별 is highly imbalanced (53.1%)Imbalance
가맹점_업종 대분류 is highly imbalanced (57.4%)Imbalance
가맹점_업종 소분류 is highly imbalanced (61.7%)Imbalance
이용금액 합계 has unique valuesUnique

Reproduction

Analysis started2024-03-03 19:44:33.194877
Analysis finished2024-03-03 19:44:36.079866
Duration2.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역화폐명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size368.0 B
울산페이
30 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산페이
2nd row울산페이
3rd row울산페이
4th row울산페이
5th row울산페이

Common Values

ValueCountFrequency (%)
울산페이 30
100.0%

Length

2024-03-04T04:44:36.202222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T04:44:36.377212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
울산페이 30
100.0%

이용년월
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size368.0 B
202308
30 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row202308
2nd row202308
3rd row202308
4th row202308
5th row202308

Common Values

ValueCountFrequency (%)
202308 30
100.0%

Length

2024-03-04T04:44:36.632605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T04:44:36.804409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202308 30
100.0%

주중_주말
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size368.0 B
주중
30 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주중
2nd row주중
3rd row주중
4th row주중
5th row주중

Common Values

ValueCountFrequency (%)
주중 30
100.0%

Length

2024-03-04T04:44:36.969026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T04:44:37.131784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주중 30
100.0%

이용시간대
Categorical

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size368.0 B
14시-18시
18 
18시-21시
10시-14시

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row14시-18시
2nd row18시-21시
3rd row14시-18시
4th row14시-18시
5th row10시-14시

Common Values

ValueCountFrequency (%)
14시-18시 18
60.0%
18시-21시 8
26.7%
10시-14시 4
 
13.3%

Length

2024-03-04T04:44:37.298220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T04:44:37.470274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
14시-18시 18
60.0%
18시-21시 8
26.7%
10시-14시 4
 
13.3%

연령대
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size368.0 B
50대
18 
60대
11 
40대
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row50대
2nd row60대
3rd row50대
4th row60대
5th row60대

Common Values

ValueCountFrequency (%)
50대 18
60.0%
60대 11
36.7%
40대 1
 
3.3%

Length

2024-03-04T04:44:37.725981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T04:44:37.921943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
50대 18
60.0%
60대 11
36.7%
40대 1
 
3.3%

성별
Categorical

IMBALANCE 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size368.0 B
여성
27 
남성

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여성
2nd row여성
3rd row여성
4th row여성
5th row여성

Common Values

ValueCountFrequency (%)
여성 27
90.0%
남성 3
 
10.0%

Length

2024-03-04T04:44:38.104703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T04:44:38.269791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여성 27
90.0%
남성 3
 
10.0%

고객 거주지_시군구
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size368.0 B
울산 남구
울산 울주군
울산 북구
울산 중구
울산 동구

Length

Max length6
Median length5
Mean length5.2333333
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산 중구
2nd row울산 남구
3rd row울산 남구
4th row울산 동구
5th row울산 동구

Common Values

ValueCountFrequency (%)
울산 남구 8
26.7%
울산 울주군 7
23.3%
울산 북구 6
20.0%
울산 중구 5
16.7%
울산 동구 4
13.3%

Length

2024-03-04T04:44:38.447600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T04:44:38.650235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
울산 30
50.0%
남구 8
 
13.3%
울주군 7
 
11.7%
북구 6
 
10.0%
중구 5
 
8.3%
동구 4
 
6.7%

고객거주지_읍면동
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)46.7%
Missing0
Missing (%)0.0%
Memory size368.0 B
범서읍
옥동
다운동
방어동
농소3동
Other values (9)
12 

Length

Max length4
Median length3
Mean length3.0666667
Min length2

Unique

Unique6 ?
Unique (%)20.0%

Sample

1st row다운동
2nd row옥동
3rd row삼산동
4th row전하2동
5th row방어동

Common Values

ValueCountFrequency (%)
범서읍 5
16.7%
옥동 4
13.3%
다운동 3
10.0%
방어동 3
10.0%
농소3동 3
10.0%
삼산동 2
 
6.7%
송정동 2
 
6.7%
언양읍 2
 
6.7%
전하2동 1
 
3.3%
반구1동 1
 
3.3%
Other values (4) 4
13.3%

Length

2024-03-04T04:44:38.953915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
범서읍 5
16.7%
옥동 4
13.3%
다운동 3
10.0%
방어동 3
10.0%
농소3동 3
10.0%
삼산동 2
 
6.7%
송정동 2
 
6.7%
언양읍 2
 
6.7%
전하2동 1
 
3.3%
반구1동 1
 
3.3%
Other values (4) 4
13.3%

가맹점_시군구
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size368.0 B
울산 남구
울산 울주군
울산 북구
울산 중구
울산 동구

Length

Max length6
Median length5
Mean length5.2333333
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산 중구
2nd row울산 남구
3rd row울산 남구
4th row울산 동구
5th row울산 동구

Common Values

ValueCountFrequency (%)
울산 남구 8
26.7%
울산 울주군 7
23.3%
울산 북구 6
20.0%
울산 중구 5
16.7%
울산 동구 4
13.3%

Length

2024-03-04T04:44:39.184072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T04:44:39.370143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
울산 30
50.0%
남구 8
 
13.3%
울주군 7
 
11.7%
북구 6
 
10.0%
중구 5
 
8.3%
동구 4
 
6.7%

가맹점_읍면동
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)46.7%
Missing0
Missing (%)0.0%
Memory size368.0 B
범서읍
옥동
다운동
방어동
농소3동
Other values (9)
12 

Length

Max length4
Median length3
Mean length3.0666667
Min length2

Unique

Unique6 ?
Unique (%)20.0%

Sample

1st row다운동
2nd row옥동
3rd row삼산동
4th row전하2동
5th row방어동

Common Values

ValueCountFrequency (%)
범서읍 5
16.7%
옥동 4
13.3%
다운동 3
10.0%
방어동 3
10.0%
농소3동 3
10.0%
삼산동 2
 
6.7%
송정동 2
 
6.7%
언양읍 2
 
6.7%
전하2동 1
 
3.3%
반구1동 1
 
3.3%
Other values (4) 4
13.3%

Length

2024-03-04T04:44:39.592410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
범서읍 5
16.7%
옥동 4
13.3%
다운동 3
10.0%
방어동 3
10.0%
농소3동 3
10.0%
삼산동 2
 
6.7%
송정동 2
 
6.7%
언양읍 2
 
6.7%
전하2동 1
 
3.3%
반구1동 1
 
3.3%
Other values (4) 4
13.3%

가맹점_업종 대분류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size368.0 B
소매
26 
일반음식점
의료기관/제약
 
1

Length

Max length7
Median length2
Mean length2.4666667
Min length2

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row소매
2nd row소매
3rd row소매
4th row소매
5th row소매

Common Values

ValueCountFrequency (%)
소매 26
86.7%
일반음식점 3
 
10.0%
의료기관/제약 1
 
3.3%

Length

2024-03-04T04:44:39.813944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T04:44:39.990844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소매 26
86.7%
일반음식점 3
 
10.0%
의료기관/제약 1
 
3.3%

가맹점_업종 소분류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size368.0 B
슈퍼마켓
26 
일반음식점 기타
 
2
약국
 
1
양식
 
1

Length

Max length8
Median length4
Mean length4.1333333
Min length2

Unique

Unique2 ?
Unique (%)6.7%

Sample

1st row슈퍼마켓
2nd row슈퍼마켓
3rd row슈퍼마켓
4th row슈퍼마켓
5th row슈퍼마켓

Common Values

ValueCountFrequency (%)
슈퍼마켓 26
86.7%
일반음식점 기타 2
 
6.7%
약국 1
 
3.3%
양식 1
 
3.3%

Length

2024-03-04T04:44:40.291998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T04:44:40.481264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
슈퍼마켓 26
81.2%
일반음식점 2
 
6.2%
기타 2
 
6.2%
약국 1
 
3.1%
양식 1
 
3.1%

이용금액 합계
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean210993.17
Minimum76860
Maximum422700
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size398.0 B
2024-03-04T04:44:40.666946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum76860
5-th percentile100825
Q1172235
median202295
Q3244817.5
95-th percentile355404.25
Maximum422700
Range345840
Interquartile range (IQR)72582.5

Descriptive statistics

Standard deviation79192.866
Coefficient of variation (CV)0.3753338
Kurtosis0.76436288
Mean210993.17
Median Absolute Deviation (MAD)41400
Skewness0.675521
Sum6329795
Variance6.2715101 × 109
MonotonicityNot monotonic
2024-03-04T04:44:40.881947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
177250 1
 
3.3%
121700 1
 
3.3%
212150 1
 
3.3%
422700 1
 
3.3%
241000 1
 
3.3%
171190 1
 
3.3%
241450 1
 
3.3%
93400 1
 
3.3%
76860 1
 
3.3%
109900 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
76860 1
3.3%
93400 1
3.3%
109900 1
3.3%
110480 1
3.3%
121700 1
3.3%
141070 1
3.3%
147920 1
3.3%
171190 1
3.3%
175370 1
3.3%
177250 1
3.3%
ValueCountFrequency (%)
422700 1
3.3%
369190 1
3.3%
338555 1
3.3%
282850 1
3.3%
276620 1
3.3%
271800 1
3.3%
266150 1
3.3%
245940 1
3.3%
241450 1
3.3%
241000 1
3.3%

이용건수 합계
Real number (ℝ)

Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.866667
Minimum12
Maximum20
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size398.0 B
2024-03-04T04:44:41.088732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile12
Q113
median14
Q317.75
95-th percentile19.55
Maximum20
Range8
Interquartile range (IQR)4.75

Descriptive statistics

Standard deviation2.7131015
Coefficient of variation (CV)0.18249561
Kurtosis-1.0773695
Mean14.866667
Median Absolute Deviation (MAD)2
Skewness0.65018999
Sum446
Variance7.3609195
MonotonicityDecreasing
2024-03-04T04:44:41.395201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
13 8
26.7%
12 6
20.0%
18 4
13.3%
14 4
13.3%
20 2
 
6.7%
19 2
 
6.7%
16 2
 
6.7%
17 1
 
3.3%
15 1
 
3.3%
ValueCountFrequency (%)
12 6
20.0%
13 8
26.7%
14 4
13.3%
15 1
 
3.3%
16 2
 
6.7%
17 1
 
3.3%
18 4
13.3%
19 2
 
6.7%
20 2
 
6.7%
ValueCountFrequency (%)
20 2
 
6.7%
19 2
 
6.7%
18 4
13.3%
17 1
 
3.3%
16 2
 
6.7%
15 1
 
3.3%
14 4
13.3%
13 8
26.7%
12 6
20.0%

Interactions

2024-03-04T04:44:34.780192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-04T04:44:34.266591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-04T04:44:35.038393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-04T04:44:34.517340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-04T04:44:41.596467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이용시간대연령대성별고객 거주지_시군구고객거주지_읍면동가맹점_시군구가맹점_읍면동가맹점_업종 대분류가맹점_업종 소분류이용금액 합계이용건수 합계
이용시간대1.0000.0000.1050.0000.0000.0000.0000.6170.3320.4650.000
연령대0.0001.0000.0000.3520.0000.3520.0000.6970.6670.0000.000
성별0.1050.0001.0000.0440.0000.0440.0000.0000.0000.5990.000
고객 거주지_시군구0.0000.3520.0441.0001.0001.0001.0000.4400.2440.3190.000
고객거주지_읍면동0.0000.0000.0001.0001.0001.0001.0000.0000.0000.0000.290
가맹점_시군구0.0000.3520.0441.0001.0001.0001.0000.4400.2440.3190.000
가맹점_읍면동0.0000.0000.0001.0001.0001.0001.0000.0000.0000.0000.290
가맹점_업종 대분류0.6170.6970.0000.4400.0000.4400.0001.0001.0000.8160.000
가맹점_업종 소분류0.3320.6670.0000.2440.0000.2440.0001.0001.0000.6740.000
이용금액 합계0.4650.0000.5990.3190.0000.3190.0000.8160.6741.0000.598
이용건수 합계0.0000.0000.0000.0000.2900.0000.2900.0000.0000.5981.000
2024-03-04T04:44:41.834074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가맹점_업종 대분류고객거주지_읍면동연령대가맹점_시군구가맹점_업종 소분류고객 거주지_시군구성별가맹점_읍면동이용시간대
가맹점_업종 대분류1.0000.0000.3510.3510.9810.3510.0000.0000.281
고객거주지_읍면동0.0001.0000.0000.8000.0000.8000.0001.0000.000
연령대0.3510.0001.0000.2650.6810.2650.0000.0000.000
가맹점_시군구0.3510.8000.2651.0000.1851.0000.0000.8000.000
가맹점_업종 소분류0.9810.0000.6810.1851.0000.1850.0000.0000.309
고객 거주지_시군구0.3510.8000.2651.0000.1851.0000.0000.8000.000
성별0.0000.0000.0000.0000.0000.0001.0000.0000.163
가맹점_읍면동0.0001.0000.0000.8000.0000.8000.0001.0000.000
이용시간대0.2810.0000.0000.0000.3090.0000.1630.0001.000
2024-03-04T04:44:42.052943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이용금액 합계이용건수 합계이용시간대연령대성별고객 거주지_시군구고객거주지_읍면동가맹점_시군구가맹점_읍면동가맹점_업종 대분류가맹점_업종 소분류
이용금액 합계1.0000.0710.0740.0000.3650.1540.0000.1540.0000.4470.418
이용건수 합계0.0711.0000.0000.0000.0000.0000.0000.0000.0000.0000.000
이용시간대0.0740.0001.0000.0000.1630.0000.0000.0000.0000.2810.309
연령대0.0000.0000.0001.0000.0000.2650.0000.2650.0000.3510.681
성별0.3650.0000.1630.0001.0000.0000.0000.0000.0000.0000.000
고객 거주지_시군구0.1540.0000.0000.2650.0001.0000.8001.0000.8000.3510.185
고객거주지_읍면동0.0000.0000.0000.0000.0000.8001.0000.8001.0000.0000.000
가맹점_시군구0.1540.0000.0000.2650.0001.0000.8001.0000.8000.3510.185
가맹점_읍면동0.0000.0000.0000.0000.0000.8001.0000.8001.0000.0000.000
가맹점_업종 대분류0.4470.0000.2810.3510.0000.3510.0000.3510.0001.0000.981
가맹점_업종 소분류0.4180.0000.3090.6810.0000.1850.0000.1850.0000.9811.000

Missing values

2024-03-04T04:44:35.411402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-04T04:44:35.953755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/