Overview

Dataset statistics

Number of variables8
Number of observations25
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory72.3 B

Variable types

Categorical6
Boolean1
Numeric1

Dataset

Description부산광역시_지방세납세자현황_20201231
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15079372

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
과세년도 has constant value ""Constant

Reproduction

Analysis started2023-12-10 16:38:22.004817
Analysis finished2023-12-10 16:38:22.974288
Duration0.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
부산광역시
25 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 25
100.0%

Length

2023-12-11T01:38:23.124146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:38:23.258779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 25
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
부산광역시
25 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 25
100.0%

Length

2023-12-11T01:38:23.409003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:38:23.553061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 25
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
26000
25 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row26000
2nd row26000
3rd row26000
4th row26000
5th row26000

Common Values

ValueCountFrequency (%)
26000 25
100.0%

Length

2023-12-11T01:38:23.705638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:38:23.872268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
26000 25
100.0%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2020
25 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 25
100.0%

Length

2023-12-11T01:38:24.039995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:38:24.192855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 25
100.0%

세목명
Categorical

Distinct8
Distinct (%)32.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
지방소득세
주민세
취득세
자동차세
담배소비세
Other values (3)

Length

Max length5
Median length4
Mean length4.12
Min length3

Unique

Unique2 ?
Unique (%)8.0%

Sample

1st row등록면허세
2nd row등록면허세
3rd row등록면허세
4th row지방소득세
5th row지방소득세

Common Values

ValueCountFrequency (%)
지방소득세 4
16.0%
주민세 4
16.0%
취득세 4
16.0%
자동차세 4
16.0%
담배소비세 4
16.0%
등록면허세 3
12.0%
지방소비세 1
 
4.0%
등록세 1
 
4.0%

Length

2023-12-11T01:38:24.397088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:38:24.614049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지방소득세 4
16.0%
주민세 4
16.0%
취득세 4
16.0%
자동차세 4
16.0%
담배소비세 4
16.0%
등록면허세 3
12.0%
지방소비세 1
 
4.0%
등록세 1
 
4.0%

납세자유형
Categorical

Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
개인
13 
법인
12 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
개인 13
52.0%
법인 12
48.0%

Length

2023-12-11T01:38:24.823941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:38:25.008751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 13
52.0%
법인 12
48.0%
Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size157.0 B
True
14 
False
11 
ValueCountFrequency (%)
True 14
56.0%
False 11
44.0%
2023-12-11T01:38:25.145203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

Distinct22
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7618.32
Minimum1
Maximum142511
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-11T01:38:25.310432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.2
Q19
median52
Q3167
95-th percentile31039.4
Maximum142511
Range142510
Interquartile range (IQR)158

Descriptive statistics

Standard deviation29069.718
Coefficient of variation (CV)3.8157649
Kurtosis21.461731
Mean7618.32
Median Absolute Deviation (MAD)44
Skewness4.554495
Sum190458
Variance8.450485 × 108
MonotonicityNot monotonic
2023-12-11T01:38:25.482814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
1 2
 
8.0%
8 2
 
8.0%
9 2
 
8.0%
142511 1
 
4.0%
6 1
 
4.0%
89 1
 
4.0%
91 1
 
4.0%
52 1
 
4.0%
20 1
 
4.0%
167 1
 
4.0%
Other values (12) 12
48.0%
ValueCountFrequency (%)
1 2
8.0%
2 1
4.0%
6 1
4.0%
8 2
8.0%
9 2
8.0%
12 1
4.0%
20 1
4.0%
28 1
4.0%
31 1
4.0%
52 1
4.0%
ValueCountFrequency (%)
142511 1
4.0%
36885 1
4.0%
7657 1
4.0%
1781 1
4.0%
540 1
4.0%
269 1
4.0%
167 1
4.0%
148 1
4.0%
91 1
4.0%
89 1
4.0%

Interactions

2023-12-11T01:38:22.292068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:38:25.630495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명납세자유형관내_관외납세자수
세목명1.0000.0000.0000.000
납세자유형0.0001.0000.0000.010
관내_관외0.0000.0001.0000.031
납세자수0.0000.0100.0311.000
2023-12-11T01:38:25.766440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자유형세목명관내_관외
납세자유형1.0000.0000.000
세목명0.0001.0000.000
관내_관외0.0000.0001.000
2023-12-11T01:38:25.882260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자수세목명납세자유형관내_관외
납세자수1.0000.0000.0000.000
세목명0.0001.0000.0000.000
납세자유형0.0000.0001.0000.000
관내_관외0.0000.0000.0001.000

Missing values

2023-12-11T01:38:22.546224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:38:22.816757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
0부산광역시부산광역시260002020등록면허세개인N1
1부산광역시부산광역시260002020등록면허세개인Y12
2부산광역시부산광역시260002020등록면허세법인Y8
3부산광역시부산광역시260002020지방소득세개인N31
4부산광역시부산광역시260002020지방소득세개인Y540
5부산광역시부산광역시260002020지방소득세법인N66
6부산광역시부산광역시260002020지방소득세법인Y148
7부산광역시부산광역시260002020지방소비세법인Y1
8부산광역시부산광역시260002020등록세개인Y2
9부산광역시부산광역시260002020주민세개인N9
시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
15부산광역시부산광역시260002020취득세법인N1781
16부산광역시부산광역시260002020취득세법인Y7657
17부산광역시부산광역시260002020자동차세개인N9
18부산광역시부산광역시260002020자동차세개인Y167
19부산광역시부산광역시260002020자동차세법인N20
20부산광역시부산광역시260002020자동차세법인Y52
21부산광역시부산광역시260002020담배소비세개인N91
22부산광역시부산광역시260002020담배소비세개인Y89
23부산광역시부산광역시260002020담배소비세법인N6
24부산광역시부산광역시260002020담배소비세법인Y8