Overview

Dataset statistics

Number of variables14
Number of observations1494
Missing cells3005
Missing cells (%)14.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory181.0 KiB
Average record size in memory124.1 B

Variable types

Numeric12
Categorical1
Text1

Dataset

Description2022년 4월~6월 중 강릉, 경주, 목포, 부산, 안동, 영주, 여수, 전주를 방문한 철도 이용객의 거주지 시군구 단위로 발매처별 철도회원의 카드이용 관련 정보 (발매처별 중복 포함)
Author한국철도공사
URLhttps://www.data.go.kr/data/15111377/fileData.do

Alerts

법정동시도코드 is highly overall correlated with 시도명High correlation
골프이용_1 is highly overall correlated with 골프이용_2 and 7 other fieldsHigh correlation
골프이용_2 is highly overall correlated with 골프이용_1 and 7 other fieldsHigh correlation
골프이용_3 is highly overall correlated with 골프이용_1 and 7 other fieldsHigh correlation
반려관심_1 is highly overall correlated with 골프이용_1 and 7 other fieldsHigh correlation
반려관심_2 is highly overall correlated with 골프이용_1 and 7 other fieldsHigh correlation
반려관심_3 is highly overall correlated with 골프이용_1 and 7 other fieldsHigh correlation
여행이용_1 is highly overall correlated with 골프이용_1 and 7 other fieldsHigh correlation
여행이용_2 is highly overall correlated with 골프이용_1 and 7 other fieldsHigh correlation
여행이용_3 is highly overall correlated with 골프이용_1 and 7 other fieldsHigh correlation
시도명 is highly overall correlated with 법정동시도코드High correlation
골프이용_1 has 440 (29.5%) missing valuesMissing
골프이용_2 has 416 (27.8%) missing valuesMissing
골프이용_3 has 408 (27.3%) missing valuesMissing
반려관심_1 has 306 (20.5%) missing valuesMissing
반려관심_2 has 274 (18.3%) missing valuesMissing
반려관심_3 has 289 (19.3%) missing valuesMissing
여행이용_1 has 305 (20.4%) missing valuesMissing
여행이용_2 has 300 (20.1%) missing valuesMissing
여행이용_3 has 267 (17.9%) missing valuesMissing
골프이용_1 has 356 (23.8%) zerosZeros
골프이용_2 has 360 (24.1%) zerosZeros
골프이용_3 has 354 (23.7%) zerosZeros
반려관심_1 has 370 (24.8%) zerosZeros
반려관심_2 has 380 (25.4%) zerosZeros
반려관심_3 has 363 (24.3%) zerosZeros
여행이용_1 has 368 (24.6%) zerosZeros
여행이용_2 has 360 (24.1%) zerosZeros
여행이용_3 has 351 (23.5%) zerosZeros

Reproduction

Analysis started2023-12-12 07:59:01.867249
Analysis finished2023-12-12 07:59:20.235211
Duration18.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

법정동시도코드
Real number (ℝ)

HIGH CORRELATION 

Distinct17
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.678046
Minimum11
Maximum50
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T16:59:20.288776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11
5-th percentile11
Q129
median42
Q346
95-th percentile48
Maximum50
Range39
Interquartile range (IQR)17

Descriptive statistics

Standard deviation11.259175
Coefficient of variation (CV)0.29882587
Kurtosis0.47812197
Mean37.678046
Median Absolute Deviation (MAD)4
Skewness-1.273072
Sum56291
Variance126.76902
MonotonicityNot monotonic
2023-12-12T16:59:20.408770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
41 252
16.9%
11 150
10.0%
47 143
9.6%
48 131
8.8%
46 130
8.7%
42 108
7.2%
26 96
 
6.4%
44 96
 
6.4%
45 88
 
5.9%
43 84
 
5.6%
Other values (7) 216
14.5%
ValueCountFrequency (%)
11 150
10.0%
26 96
 
6.4%
27 48
 
3.2%
28 60
 
4.0%
29 30
 
2.0%
30 30
 
2.0%
31 30
 
2.0%
36 6
 
0.4%
41 252
16.9%
42 108
7.2%
ValueCountFrequency (%)
50 12
 
0.8%
48 131
8.8%
47 143
9.6%
46 130
8.7%
45 88
 
5.9%
44 96
 
6.4%
43 84
 
5.6%
42 108
7.2%
41 252
16.9%
36 6
 
0.4%

시도명
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size11.8 KiB
경기
252 
서울
150 
경북
143 
경남
131 
전남
130 
Other values (12)
688 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기
2nd row경기
3rd row경기
4th row경기
5th row경기

Common Values

ValueCountFrequency (%)
경기 252
16.9%
서울 150
10.0%
경북 143
9.6%
경남 131
8.8%
전남 130
8.7%
강원 108
7.2%
충남 96
 
6.4%
부산 96
 
6.4%
전북 88
 
5.9%
충북 84
 
5.6%
Other values (7) 216
14.5%

Length

2023-12-12T16:59:20.558652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 252
16.9%
서울 150
10.0%
경북 143
9.6%
경남 131
8.8%
전남 130
8.7%
강원 108
7.2%
충남 96
 
6.4%
부산 96
 
6.4%
전북 88
 
5.9%
충북 84
 
5.6%
Other values (7) 216
14.5%

법정동시군구코드
Real number (ℝ)

Distinct101
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean434.62249
Minimum110
Maximum940
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T16:59:20.698033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum110
5-th percentile111
Q1170
median310
Q3737.5
95-th percentile860
Maximum940
Range830
Interquartile range (IQR)567.5

Descriptive statistics

Standard deviation281.33436
Coefficient of variation (CV)0.64730741
Kurtosis-1.5543023
Mean434.62249
Median Absolute Deviation (MAD)189
Skewness0.32841999
Sum649326
Variance79149.021
MonotonicityNot monotonic
2023-12-12T16:59:20.837210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
110 66
 
4.4%
170 60
 
4.0%
140 48
 
3.2%
230 48
 
3.2%
200 48
 
3.2%
710 47
 
3.1%
720 41
 
2.7%
130 36
 
2.4%
150 36
 
2.4%
800 36
 
2.4%
Other values (91) 1028
68.8%
ValueCountFrequency (%)
110 66
4.4%
111 24
 
1.6%
112 6
 
0.4%
113 24
 
1.6%
114 6
 
0.4%
115 6
 
0.4%
117 6
 
0.4%
121 6
 
0.4%
123 6
 
0.4%
125 6
 
0.4%
ValueCountFrequency (%)
940 5
0.3%
930 6
0.4%
920 6
0.4%
910 5
0.3%
900 12
0.8%
890 12
0.8%
880 12
0.8%
870 12
0.8%
860 12
0.8%
850 12
0.8%
Distinct228
Distinct (%)15.3%
Missing0
Missing (%)0.0%
Memory size11.8 KiB
2023-12-12T16:59:21.149112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.4698795
Min length2

Characters and Unicode

Total characters5184
Distinct characters143
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수원시 장안구
2nd row수원시 장안구
3rd row수원시 장안구
4th row수원시 장안구
5th row수원시 장안구
ValueCountFrequency (%)
동구 36
 
2.1%
중구 36
 
2.1%
서구 30
 
1.8%
남구 30
 
1.8%
북구 30
 
1.8%
창원시 30
 
1.8%
수원시 24
 
1.4%
청주시 24
 
1.4%
고양시 18
 
1.1%
용인시 18
 
1.1%
Other values (227) 1410
83.6%
2023-12-12T16:59:21.699091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
636
 
12.3%
600
 
11.6%
504
 
9.7%
192
 
3.7%
144
 
2.8%
138
 
2.7%
138
 
2.7%
131
 
2.5%
126
 
2.4%
120
 
2.3%
Other values (133) 2455
47.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4992
96.3%
Space Separator 192
 
3.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
636
 
12.7%
600
 
12.0%
504
 
10.1%
144
 
2.9%
138
 
2.8%
138
 
2.8%
131
 
2.6%
126
 
2.5%
120
 
2.4%
108
 
2.2%
Other values (132) 2347
47.0%
Space Separator
ValueCountFrequency (%)
192
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4992
96.3%
Common 192
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
636
 
12.7%
600
 
12.0%
504
 
10.1%
144
 
2.9%
138
 
2.8%
138
 
2.8%
131
 
2.6%
126
 
2.5%
120
 
2.4%
108
 
2.2%
Other values (132) 2347
47.0%
Common
ValueCountFrequency (%)
192
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4992
96.3%
ASCII 192
 
3.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
636
 
12.7%
600
 
12.0%
504
 
10.1%
144
 
2.9%
138
 
2.8%
138
 
2.8%
131
 
2.6%
126
 
2.5%
120
 
2.4%
108
 
2.2%
Other values (132) 2347
47.0%
ASCII
ValueCountFrequency (%)
192
100.0%

발매처코드
Real number (ℝ)

Distinct6
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.51004
Minimum11
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T16:59:21.863570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11
5-th percentile11
Q112
median14
Q315
95-th percentile16
Maximum16
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7044423
Coefficient of variation (CV)0.12616116
Kurtosis-1.2643455
Mean13.51004
Median Absolute Deviation (MAD)1
Skewness-0.0049973236
Sum20184
Variance2.9051235
MonotonicityNot monotonic
2023-12-12T16:59:22.322655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
16 250
16.7%
14 250
16.7%
12 250
16.7%
13 250
16.7%
15 250
16.7%
11 244
16.3%
ValueCountFrequency (%)
11 244
16.3%
12 250
16.7%
13 250
16.7%
14 250
16.7%
15 250
16.7%
16 250
16.7%
ValueCountFrequency (%)
16 250
16.7%
15 250
16.7%
14 250
16.7%
13 250
16.7%
12 250
16.7%
11 244
16.3%

골프이용_1
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct184
Distinct (%)17.5%
Missing440
Missing (%)29.5%
Infinite0
Infinite (%)0.0%
Mean45.242884
Minimum0
Maximum713
Zeros356
Zeros (%)23.8%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T16:59:22.481483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median12
Q335.75
95-th percentile246.1
Maximum713
Range713
Interquartile range (IQR)35.75

Descriptive statistics

Standard deviation99.844731
Coefficient of variation (CV)2.2068604
Kurtosis15.195639
Mean45.242884
Median Absolute Deviation (MAD)12
Skewness3.732148
Sum47686
Variance9968.9704
MonotonicityNot monotonic
2023-12-12T16:59:22.633770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 356
23.8%
6 36
 
2.4%
8 32
 
2.1%
7 30
 
2.0%
11 28
 
1.9%
14 23
 
1.5%
9 22
 
1.5%
12 21
 
1.4%
10 20
 
1.3%
13 19
 
1.3%
Other values (174) 467
31.3%
(Missing) 440
29.5%
ValueCountFrequency (%)
0 356
23.8%
6 36
 
2.4%
7 30
 
2.0%
8 32
 
2.1%
9 22
 
1.5%
10 20
 
1.3%
11 28
 
1.9%
12 21
 
1.4%
13 19
 
1.3%
14 23
 
1.5%
ValueCountFrequency (%)
713 1
0.1%
679 1
0.1%
671 1
0.1%
666 1
0.1%
633 1
0.1%
603 1
0.1%
587 1
0.1%
583 1
0.1%
542 1
0.1%
537 1
0.1%

골프이용_2
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct193
Distinct (%)17.9%
Missing416
Missing (%)27.8%
Infinite0
Infinite (%)0.0%
Mean49.165121
Minimum0
Maximum838
Zeros360
Zeros (%)24.1%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T16:59:22.810904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median12
Q338
95-th percentile285.15
Maximum838
Range838
Interquartile range (IQR)38

Descriptive statistics

Standard deviation109.21343
Coefficient of variation (CV)2.22136
Kurtosis17.432789
Mean49.165121
Median Absolute Deviation (MAD)12
Skewness3.9064376
Sum53000
Variance11927.573
MonotonicityNot monotonic
2023-12-12T16:59:22.981965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 360
24.1%
6 38
 
2.5%
7 28
 
1.9%
8 27
 
1.8%
9 25
 
1.7%
11 24
 
1.6%
13 23
 
1.5%
10 21
 
1.4%
20 20
 
1.3%
22 19
 
1.3%
Other values (183) 493
33.0%
(Missing) 416
27.8%
ValueCountFrequency (%)
0 360
24.1%
6 38
 
2.5%
7 28
 
1.9%
8 27
 
1.8%
9 25
 
1.7%
10 21
 
1.4%
11 24
 
1.6%
12 19
 
1.3%
13 23
 
1.5%
14 15
 
1.0%
ValueCountFrequency (%)
838 1
0.1%
811 1
0.1%
765 1
0.1%
732 1
0.1%
727 1
0.1%
699 1
0.1%
693 1
0.1%
685 1
0.1%
680 1
0.1%
670 1
0.1%

골프이용_3
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct232
Distinct (%)21.4%
Missing408
Missing (%)27.3%
Infinite0
Infinite (%)0.0%
Mean68.654696
Minimum0
Maximum1481
Zeros354
Zeros (%)23.7%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T16:59:23.158340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median15
Q352.75
95-th percentile364.25
Maximum1481
Range1481
Interquartile range (IQR)52.75

Descriptive statistics

Standard deviation158.47121
Coefficient of variation (CV)2.3082355
Kurtosis21.888555
Mean68.654696
Median Absolute Deviation (MAD)15
Skewness4.2545233
Sum74559
Variance25113.124
MonotonicityNot monotonic
2023-12-12T16:59:23.379600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 354
23.7%
7 26
 
1.7%
6 23
 
1.5%
10 22
 
1.5%
9 20
 
1.3%
8 20
 
1.3%
15 20
 
1.3%
14 18
 
1.2%
12 17
 
1.1%
21 14
 
0.9%
Other values (222) 552
36.9%
(Missing) 408
27.3%
ValueCountFrequency (%)
0 354
23.7%
6 23
 
1.5%
7 26
 
1.7%
8 20
 
1.3%
9 20
 
1.3%
10 22
 
1.5%
11 12
 
0.8%
12 17
 
1.1%
13 14
 
0.9%
14 18
 
1.2%
ValueCountFrequency (%)
1481 1
0.1%
1357 1
0.1%
1115 1
0.1%
1105 1
0.1%
1076 1
0.1%
1029 1
0.1%
979 1
0.1%
952 1
0.1%
860 1
0.1%
834 1
0.1%

반려관심_1
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct207
Distinct (%)17.4%
Missing306
Missing (%)20.5%
Infinite0
Infinite (%)0.0%
Mean58.626263
Minimum0
Maximum1134
Zeros370
Zeros (%)24.8%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T16:59:23.629388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median14
Q339
95-th percentile343
Maximum1134
Range1134
Interquartile range (IQR)39

Descriptive statistics

Standard deviation138.794
Coefficient of variation (CV)2.3674373
Kurtosis16.834406
Mean58.626263
Median Absolute Deviation (MAD)14
Skewness3.9003875
Sum69648
Variance19263.774
MonotonicityNot monotonic
2023-12-12T16:59:23.822521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 370
24.8%
6 37
 
2.5%
7 37
 
2.5%
9 30
 
2.0%
8 28
 
1.9%
11 28
 
1.9%
10 28
 
1.9%
19 22
 
1.5%
15 21
 
1.4%
18 21
 
1.4%
Other values (197) 566
37.9%
(Missing) 306
20.5%
ValueCountFrequency (%)
0 370
24.8%
6 37
 
2.5%
7 37
 
2.5%
8 28
 
1.9%
9 30
 
2.0%
10 28
 
1.9%
11 28
 
1.9%
12 20
 
1.3%
13 14
 
0.9%
14 16
 
1.1%
ValueCountFrequency (%)
1134 1
0.1%
1065 1
0.1%
984 1
0.1%
858 1
0.1%
842 1
0.1%
827 1
0.1%
824 1
0.1%
782 1
0.1%
765 1
0.1%
746 1
0.1%

반려관심_2
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct228
Distinct (%)18.7%
Missing274
Missing (%)18.3%
Infinite0
Infinite (%)0.0%
Mean69.347541
Minimum0
Maximum1278
Zeros380
Zeros (%)25.4%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T16:59:23.991180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median16
Q348
95-th percentile417.2
Maximum1278
Range1278
Interquartile range (IQR)48

Descriptive statistics

Standard deviation161.0504
Coefficient of variation (CV)2.3223665
Kurtosis15.639994
Mean69.347541
Median Absolute Deviation (MAD)16
Skewness3.7921217
Sum84604
Variance25937.233
MonotonicityNot monotonic
2023-12-12T16:59:24.170435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 380
25.4%
6 35
 
2.3%
7 29
 
1.9%
8 26
 
1.7%
12 23
 
1.5%
15 22
 
1.5%
10 22
 
1.5%
9 19
 
1.3%
17 19
 
1.3%
34 18
 
1.2%
Other values (218) 627
42.0%
(Missing) 274
18.3%
ValueCountFrequency (%)
0 380
25.4%
6 35
 
2.3%
7 29
 
1.9%
8 26
 
1.7%
9 19
 
1.3%
10 22
 
1.5%
11 16
 
1.1%
12 23
 
1.5%
13 15
 
1.0%
14 16
 
1.1%
ValueCountFrequency (%)
1278 1
0.1%
1063 1
0.1%
1046 1
0.1%
1045 1
0.1%
1014 1
0.1%
999 1
0.1%
993 1
0.1%
939 1
0.1%
919 1
0.1%
901 1
0.1%

반려관심_3
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct234
Distinct (%)19.4%
Missing289
Missing (%)19.3%
Infinite0
Infinite (%)0.0%
Mean78.308714
Minimum0
Maximum1642
Zeros363
Zeros (%)24.3%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T16:59:24.360877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median17
Q356
95-th percentile464.6
Maximum1642
Range1642
Interquartile range (IQR)56

Descriptive statistics

Standard deviation183.44878
Coefficient of variation (CV)2.3426356
Kurtosis17.111353
Mean78.308714
Median Absolute Deviation (MAD)17
Skewness3.8980696
Sum94362
Variance33653.454
MonotonicityNot monotonic
2023-12-12T16:59:24.509960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 363
24.3%
6 29
 
1.9%
7 27
 
1.8%
8 26
 
1.7%
10 24
 
1.6%
12 23
 
1.5%
16 21
 
1.4%
14 19
 
1.3%
9 19
 
1.3%
15 18
 
1.2%
Other values (224) 636
42.6%
(Missing) 289
19.3%
ValueCountFrequency (%)
0 363
24.3%
6 29
 
1.9%
7 27
 
1.8%
8 26
 
1.7%
9 19
 
1.3%
10 24
 
1.6%
11 14
 
0.9%
12 23
 
1.5%
13 10
 
0.7%
14 19
 
1.3%
ValueCountFrequency (%)
1642 1
0.1%
1350 1
0.1%
1189 1
0.1%
1091 1
0.1%
1090 1
0.1%
1080 1
0.1%
1070 1
0.1%
1047 2
0.1%
1027 1
0.1%
972 1
0.1%

여행이용_1
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct220
Distinct (%)18.5%
Missing305
Missing (%)20.4%
Infinite0
Infinite (%)0.0%
Mean69.340622
Minimum0
Maximum1443
Zeros368
Zeros (%)24.6%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T16:59:24.656539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median15
Q346
95-th percentile397.4
Maximum1443
Range1443
Interquartile range (IQR)46

Descriptive statistics

Standard deviation167.09372
Coefficient of variation (CV)2.4097522
Kurtosis19.254508
Mean69.340622
Median Absolute Deviation (MAD)15
Skewness4.12687
Sum82446
Variance27920.311
MonotonicityNot monotonic
2023-12-12T16:59:24.807682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 368
24.6%
6 43
 
2.9%
7 30
 
2.0%
11 29
 
1.9%
8 27
 
1.8%
10 22
 
1.5%
12 20
 
1.3%
9 20
 
1.3%
21 19
 
1.3%
23 17
 
1.1%
Other values (210) 594
39.8%
(Missing) 305
20.4%
ValueCountFrequency (%)
0 368
24.6%
6 43
 
2.9%
7 30
 
2.0%
8 27
 
1.8%
9 20
 
1.3%
10 22
 
1.5%
11 29
 
1.9%
12 20
 
1.3%
13 13
 
0.9%
14 16
 
1.1%
ValueCountFrequency (%)
1443 1
0.1%
1241 1
0.1%
1220 1
0.1%
1210 1
0.1%
1038 1
0.1%
999 1
0.1%
990 1
0.1%
966 1
0.1%
958 1
0.1%
943 1
0.1%

여행이용_2
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct245
Distinct (%)20.5%
Missing300
Missing (%)20.1%
Infinite0
Infinite (%)0.0%
Mean82.393635
Minimum0
Maximum1723
Zeros360
Zeros (%)24.1%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T16:59:24.942004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median17.5
Q357
95-th percentile440.7
Maximum1723
Range1723
Interquartile range (IQR)57

Descriptive statistics

Standard deviation199.54736
Coefficient of variation (CV)2.4218783
Kurtosis20.717343
Mean82.393635
Median Absolute Deviation (MAD)17.5
Skewness4.2623954
Sum98378
Variance39819.148
MonotonicityNot monotonic
2023-12-12T16:59:25.094421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 360
24.1%
6 31
 
2.1%
8 24
 
1.6%
14 24
 
1.6%
10 23
 
1.5%
9 21
 
1.4%
13 21
 
1.4%
18 20
 
1.3%
11 18
 
1.2%
15 17
 
1.1%
Other values (235) 635
42.5%
(Missing) 300
20.1%
ValueCountFrequency (%)
0 360
24.1%
6 31
 
2.1%
7 16
 
1.1%
8 24
 
1.6%
9 21
 
1.4%
10 23
 
1.5%
11 18
 
1.2%
12 16
 
1.1%
13 21
 
1.4%
14 24
 
1.6%
ValueCountFrequency (%)
1723 1
0.1%
1509 1
0.1%
1507 1
0.1%
1381 1
0.1%
1291 1
0.1%
1283 1
0.1%
1282 1
0.1%
1226 1
0.1%
1206 1
0.1%
1188 1
0.1%

여행이용_3
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct308
Distinct (%)25.1%
Missing267
Missing (%)17.9%
Infinite0
Infinite (%)0.0%
Mean127.63814
Minimum0
Maximum3056
Zeros351
Zeros (%)23.5%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2023-12-12T16:59:25.226994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median23
Q382
95-th percentile660.8
Maximum3056
Range3056
Interquartile range (IQR)82

Descriptive statistics

Standard deviation335.10552
Coefficient of variation (CV)2.6254341
Kurtosis28.726968
Mean127.63814
Median Absolute Deviation (MAD)23
Skewness4.9095479
Sum156612
Variance112295.71
MonotonicityNot monotonic
2023-12-12T16:59:25.359674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/