Overview

Dataset statistics

Number of variables8
Number of observations6451
Missing cells3328
Missing cells (%)6.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory447.4 KiB
Average record size in memory71.0 B

Variable types

Text1
Numeric7

Dataset

Description인천광역시 시내버스 정류장별 이용승객 현황으로 <br/>정류소명, 정류소아이디, 승차건수, 하차건수 등을 제공합니다.
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15048264&srcSe=7661IVAWM27C61E190

Alerts

총승차건수 is highly overall correlated with 총하차건수 and 4 other fieldsHigh correlation
총하차건수 is highly overall correlated with 총승차건수 and 4 other fieldsHigh correlation
승차건수(카드) is highly overall correlated with 총승차건수 and 4 other fieldsHigh correlation
하차건수(카드) is highly overall correlated with 총승차건수 and 4 other fieldsHigh correlation
승차건수(현금) is highly overall correlated with 총승차건수 and 4 other fieldsHigh correlation
일평균승하차건수 is highly overall correlated with 총승차건수 and 4 other fieldsHigh correlation
총승차건수 has 169 (2.6%) missing valuesMissing
총하차건수 has 160 (2.5%) missing valuesMissing
승차건수(카드) has 183 (2.8%) missing valuesMissing
하차건수(카드) has 160 (2.5%) missing valuesMissing
승차건수(현금) has 2656 (41.2%) missing valuesMissing
일평균승하차건수 has 514 (8.0%) zerosZeros

Reproduction

Analysis started2024-05-17 21:28:04.137294
Analysis finished2024-05-17 21:28:20.300407
Duration16.16 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3757
Distinct (%)58.2%
Missing0
Missing (%)0.0%
Memory size50.5 KiB
2024-05-18T06:28:20.660525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length6.6595877
Min length2

Characters and Unicode

Total characters42961
Distinct characters628
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1532 ?
Unique (%)23.7%

Sample

1st row(구)국제여객터미널
2nd row(구)국제여객터미널
3rd row(구)시민회관사거리
4th row(구)시민회관사거리
5th row(주)경동세라믹스
ValueCountFrequency (%)
대동아파트 13
 
0.2%
현대아파트 12
 
0.2%
부대앞 11
 
0.2%
한국아파트 11
 
0.2%
입구 10
 
0.2%
쌍용아파트 9
 
0.1%
광명아파트 9
 
0.1%
풍림아파트 8
 
0.1%
신동아아파트 8
 
0.1%
뉴서울아파트 8
 
0.1%
Other values (3753) 6393
98.5%
2024-05-18T06:28:21.419222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1104
 
2.6%
1059
 
2.5%
942
 
2.2%
882
 
2.1%
882
 
2.1%
830
 
1.9%
820
 
1.9%
803
 
1.9%
691
 
1.6%
535
 
1.2%
Other values (618) 34413
80.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39921
92.9%
Decimal Number 1270
 
3.0%
Close Punctuation 515
 
1.2%
Open Punctuation 515
 
1.2%
Uppercase Letter 327
 
0.8%
Other Punctuation 319
 
0.7%
Space Separator 51
 
0.1%
Lowercase Letter 37
 
0.1%
Dash Punctuation 5
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1104
 
2.8%
1059
 
2.7%
942
 
2.4%
882
 
2.2%
882
 
2.2%
830
 
2.1%
820
 
2.1%
803
 
2.0%
691
 
1.7%
535
 
1.3%
Other values (568) 31373
78.6%
Uppercase Letter
ValueCountFrequency (%)
K 42
12.8%
S 40
12.2%
T 40
12.2%
C 31
9.5%
L 29
8.9%
G 27
8.3%
H 25
7.6%
A 17
 
5.2%
I 15
 
4.6%
B 14
 
4.3%
Other values (12) 47
14.4%
Decimal Number
ValueCountFrequency (%)
1 397
31.3%
2 296
23.3%
3 167
13.1%
0 100
 
7.9%
4 87
 
6.9%
5 73
 
5.7%
6 49
 
3.9%
7 45
 
3.5%
9 30
 
2.4%
8 26
 
2.0%
Lowercase Letter
ValueCountFrequency (%)
e 24
64.9%
s 4
 
10.8%
k 2
 
5.4%
f 1
 
2.7%
y 1
 
2.7%
t 1
 
2.7%
n 1
 
2.7%
i 1
 
2.7%
m 1
 
2.7%
g 1
 
2.7%
Other Punctuation
ValueCountFrequency (%)
. 275
86.2%
· 42
 
13.2%
, 2
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 515
100.0%
Open Punctuation
ValueCountFrequency (%)
( 515
100.0%
Space Separator
ValueCountFrequency (%)
51
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39921
92.9%
Common 2676
 
6.2%
Latin 364
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1104
 
2.8%
1059
 
2.7%
942
 
2.4%
882
 
2.2%
882
 
2.2%
830
 
2.1%
820
 
2.1%
803
 
2.0%
691
 
1.7%
535
 
1.3%
Other values (568) 31373
78.6%
Latin
ValueCountFrequency (%)
K 42
11.5%
S 40
11.0%
T 40
11.0%
C 31
8.5%
L 29
 
8.0%
G 27
 
7.4%
H 25
 
6.9%
e 24
 
6.6%
A 17
 
4.7%
I 15
 
4.1%
Other values (22) 74
20.3%
Common
ValueCountFrequency (%)
) 515
19.2%
( 515
19.2%
1 397
14.8%
2 296
11.1%
. 275
10.3%
3 167
 
6.2%
0 100
 
3.7%
4 87
 
3.3%
5 73
 
2.7%
51
 
1.9%
Other values (8) 200
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39921
92.9%
ASCII 2998
 
7.0%
None 42
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1104
 
2.8%
1059
 
2.7%
942
 
2.4%
882
 
2.2%
882
 
2.2%
830
 
2.1%
820
 
2.1%
803
 
2.0%
691
 
1.7%
535
 
1.3%
Other values (568) 31373
78.6%
ASCII
ValueCountFrequency (%)
) 515
17.2%
( 515
17.2%
1 397
13.2%
2 296
9.9%
. 275
9.2%
3 167
 
5.6%
0 100
 
3.3%
4 87
 
2.9%
5 73
 
2.4%
51
 
1.7%
Other values (39) 522
17.4%
None
ValueCountFrequency (%)
· 42
100.0%

정류소아이디
Real number (ℝ)

Distinct6438
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46978.475
Minimum11002
Maximum92076
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size56.8 KiB
2024-05-18T06:28:21.676337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11002
5-th percentile35302.5
Q138349.5
median40830
Q343518.5
95-th percentile90093.5
Maximum92076
Range81074
Interquartile range (IQR)5169

Descriptive statistics

Standard deviation17434.943
Coefficient of variation (CV)0.37112621
Kurtosis1.8423011
Mean46978.475
Median Absolute Deviation (MAD)2606
Skewness1.8234066
Sum3.0305814 × 108
Variance3.0397725 × 108
MonotonicityNot monotonic
2024-05-18T06:28:22.058897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35761 2
 
< 0.1%
35396 2
 
< 0.1%
35147 2
 
< 0.1%
35610 2
 
< 0.1%
35073 2
 
< 0.1%
35097 2
 
< 0.1%
35219 2
 
< 0.1%
35214 2
 
< 0.1%
35074 2
 
< 0.1%
35246 2
 
< 0.1%
Other values (6428) 6431
99.7%
ValueCountFrequency (%)
11002 1
< 0.1%
11013 1
< 0.1%
11014 1
< 0.1%
11015 1
< 0.1%
11020 1
< 0.1%
11021 1
< 0.1%
11187 1
< 0.1%
14001 1
< 0.1%
14002 1
< 0.1%
14149 1
< 0.1%
ValueCountFrequency (%)
92076 1
< 0.1%
92075 1
< 0.1%
92074 1
< 0.1%
92071 1
< 0.1%
92070 1
< 0.1%
92069 1
< 0.1%
92068 1
< 0.1%
92067 1
< 0.1%
92066 1
< 0.1%
92065 1
< 0.1%

총승차건수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct3415
Distinct (%)54.4%
Missing169
Missing (%)2.6%
Infinite0
Infinite (%)0.0%
Mean3720.2813
Minimum1
Maximum166865
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size56.8 KiB
2024-05-18T06:28:22.428700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q173.25
median875.5
Q33908.5
95-th percentile15207.4
Maximum166865
Range166864
Interquartile range (IQR)3835.25

Descriptive statistics

Standard deviation8666.774
Coefficient of variation (CV)2.3296018
Kurtosis98.062929
Mean3720.2813
Median Absolute Deviation (MAD)863.5
Skewness7.7703155
Sum23370807
Variance75112971
MonotonicityNot monotonic
2024-05-18T06:28:22.692727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 99
 
1.5%
3 72
 
1.1%
2 67
 
1.0%
4 63
 
1.0%
5 51
 
0.8%
12 48
 
0.7%
6 46
 
0.7%
15 45
 
0.7%
8 44
 
0.7%
22 43
 
0.7%
Other values (3405) 5704
88.4%
(Missing) 169
 
2.6%
ValueCountFrequency (%)
1 99
1.5%
2 67
1.0%
3 72
1.1%
4 63
1.0%
5 51
0.8%
6 46
0.7%
7 33
 
0.5%
8 44
0.7%
9 41
0.6%
10 38
 
0.6%
ValueCountFrequency (%)
166865 1
< 0.1%
165753 1
< 0.1%
158398 1
< 0.1%
143214 1
< 0.1%
131019 1
< 0.1%
107944 1
< 0.1%
103692 1
< 0.1%
101955 1
< 0.1%
91436 1
< 0.1%
91210 1
< 0.1%

총하차건수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct3493
Distinct (%)55.5%
Missing160
Missing (%)2.5%
Infinite0
Infinite (%)0.0%
Mean3532.9638
Minimum1
Maximum156090
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size56.8 KiB
2024-05-18T06:28:22.969901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q190
median1012
Q33794
95-th percentile14342
Maximum156090
Range156089
Interquartile range (IQR)3704

Descriptive statistics

Standard deviation7804.6121
Coefficient of variation (CV)2.2090836
Kurtosis83.352485
Mean3532.9638
Median Absolute Deviation (MAD)994
Skewness7.1892962
Sum22225875
Variance60911971
MonotonicityNot monotonic
2024-05-18T06:28:23.286065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 89
 
1.4%
2 68
 
1.1%
3 62
 
1.0%
5 55
 
0.9%
4 43
 
0.7%
10 42
 
0.7%
8 42
 
0.7%
9 39
 
0.6%
6 37
 
0.6%
12 35
 
0.5%
Other values (3483) 5779
89.6%
(Missing) 160
 
2.5%
ValueCountFrequency (%)
1 89
1.4%
2 68
1.1%
3 62
1.0%
4 43
0.7%
5 55
0.9%
6 37
0.6%
7 31
 
0.5%
8 42
0.7%
9 39
0.6%
10 42
0.7%
ValueCountFrequency (%)
156090 1
< 0.1%
135085 1
< 0.1%
121709 1
< 0.1%
115546 1
< 0.1%
110231 1
< 0.1%
107576 1
< 0.1%
90560 1
< 0.1%
86908 1
< 0.1%
77374 1
< 0.1%
76558 1
< 0.1%

승차건수(카드)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct3385
Distinct (%)54.0%
Missing183
Missing (%)2.8%
Infinite0
Infinite (%)0.0%
Mean3710.1878
Minimum1
Maximum166349
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size56.8 KiB
2024-05-18T06:28:23.658846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q171
median874
Q33903
95-th percentile15110.25
Maximum166349
Range166348
Interquartile range (IQR)3832

Descriptive statistics

Standard deviation8645.5693
Coefficient of variation (CV)2.3302242
Kurtosis98.304709
Mean3710.1878
Median Absolute Deviation (MAD)862
Skewness7.7803509
Sum23255457
Variance74745869
MonotonicityNot monotonic
2024-05-18T06:28:24.110939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 103
 
1.6%
3 78
 
1.2%
2 78
 
1.2%
4 58
 
0.9%
5 49
 
0.8%
6 48
 
0.7%
12 46
 
0.7%
15 45
 
0.7%
14 43
 
0.7%
10 42
 
0.7%
Other values (3375) 5678
88.0%
(Missing) 183
 
2.8%
ValueCountFrequency (%)
1 103
1.6%
2 78
1.2%
3 78
1.2%
4 58
0.9%
5 49
0.8%
6 48
0.7%
7 41
 
0.6%
8 41
 
0.6%
9 38
 
0.6%
10 42
0.7%
ValueCountFrequency (%)
166349 1
< 0.1%
165634 1
< 0.1%
157869 1
< 0.1%
143214 1
< 0.1%
130546 1
< 0.1%
107592 1
< 0.1%
103240 1
< 0.1%
101781 1
< 0.1%
91356 1
< 0.1%
91014 1
< 0.1%

하차건수(카드)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct3493
Distinct (%)55.5%
Missing160
Missing (%)2.5%
Infinite0
Infinite (%)0.0%
Mean3532.9638
Minimum1
Maximum156090
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size56.8 KiB
2024-05-18T06:28:24.366712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q190
median1012
Q33794
95-th percentile14342
Maximum156090
Range156089
Interquartile range (IQR)3704

Descriptive statistics

Standard deviation7804.6121
Coefficient of variation (CV)2.2090836
Kurtosis83.352485
Mean3532.9638
Median Absolute Deviation (MAD)994
Skewness7.1892962
Sum22225875
Variance60911971
MonotonicityNot monotonic
2024-05-18T06:28:24.768687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 89
 
1.4%
2 68
 
1.1%
3 62
 
1.0%
5 55
 
0.9%
4 43
 
0.7%
10 42
 
0.7%
8 42
 
0.7%
9 39
 
0.6%
6 37
 
0.6%
12 35
 
0.5%
Other values (3483) 5779
89.6%
(Missing) 160
 
2.5%
ValueCountFrequency (%)
1 89
1.4%
2 68
1.1%
3 62
1.0%
4 43
0.7%
5 55
0.9%
6 37
0.6%
7 31
 
0.5%
8 42
0.7%
9 39
0.6%
10 42
0.7%
ValueCountFrequency (%)
156090 1
< 0.1%
135085 1
< 0.1%
121709 1
< 0.1%
115546 1
< 0.1%
110231 1
< 0.1%
107576 1
< 0.1%
90560 1
< 0.1%
86908 1
< 0.1%
77374 1
< 0.1%
76558 1
< 0.1%

승차건수(현금)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct229
Distinct (%)6.0%
Missing2656
Missing (%)41.2%
Infinite0
Infinite (%)0.0%
Mean30.395257
Minimum1
Maximum1881
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size56.8 KiB
2024-05-18T06:28:25.182534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median11
Q333
95-th percentile119.3
Maximum1881
Range1880
Interquartile range (IQR)29

Descriptive statistics

Standard deviation60.611415
Coefficient of variation (CV)1.9941077
Kurtosis249.50121
Mean30.395257
Median Absolute Deviation (MAD)9
Skewness10.738585
Sum115350
Variance3673.7436
MonotonicityNot monotonic
2024-05-18T06:28:25.616729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 349
 
5.4%
2 309
 
4.8%
3 231
 
3.6%
4 195
 
3.0%
6 158
 
2.4%
5 130
 
2.0%
7 125
 
1.9%
8 110
 
1.7%
9 106
 
1.6%
11 101
 
1.6%
Other values (219) 1981
30.7%
(Missing) 2656
41.2%
ValueCountFrequency (%)
1 349
5.4%
2 309
4.8%
3 231
3.6%
4 195
3.0%
5 130
 
2.0%
6 158
2.4%
7 125
 
1.9%
8 110
 
1.7%
9 106
 
1.6%
10 95
 
1.5%
ValueCountFrequency (%)
1881 1
< 0.1%
691 1
< 0.1%
647 1
< 0.1%
625 1
< 0.1%
601 1
< 0.1%
573 1
< 0.1%
529 1
< 0.1%
516 1
< 0.1%
473 1
< 0.1%
463 2
< 0.1%

일평균승하차건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1050
Distinct (%)16.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean235.13223
Minimum0
Maximum10765
Zeros514
Zeros (%)8.0%
Negative0
Negative (%)0.0%
Memory size56.8 KiB
2024-05-18T06:28:26.042401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q17
median74
Q3262
95-th percentile946
Maximum10765
Range10765
Interquartile range (IQR)255

Descriptive statistics

Standard deviation513.20351
Coefficient of variation (CV)2.1826166
Kurtosis95.904369
Mean235.13223
Median Absolute Deviation (MAD)73
Skewness7.6311013
Sum1516838
Variance263377.84
MonotonicityNot monotonic
2024-05-18T06:28:26.423317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 514
 
8.0%
1 377
 
5.8%
2 248
 
3.8%
3 154
 
2.4%
4 117
 
1.8%
5 95
 
1.5%
6 84
 
1.3%
7 73
 
1.1%
10 60
 
0.9%
9 56
 
0.9%
Other values (1040) 4673
72.4%
ValueCountFrequency (%)
0 514
8.0%
1 377
5.8%
2 248
3.8%
3 154
 
2.4%
4 117
 
1.8%
5 95
 
1.5%
6 84
 
1.3%
7 73
 
1.1%
8 51
 
0.8%
9 56
 
0.9%
ValueCountFrequency (%)
10765 1
< 0.1%
9131 1
< 0.1%
8830 1
< 0.1%
8041 1
< 0.1%
7798 1
< 0.1%
6656 1
< 0.1%
6495 1
< 0.1%
5852 1
< 0.1%
5536 1
< 0.1%
5371 1
< 0.1%

Interactions

2024-05-18T06:28:18.007101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T06:28:06.261560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T06:28:08.360887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T06:28:10.732709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T06:28:12.823572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/