Overview

Dataset statistics

Number of variables11
Number of observations35
Missing cells26
Missing cells (%)6.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory92.8 B

Variable types

Text4
Numeric1
Categorical5
Boolean1

Dataset

Description작품명,설치연도,기관1,기관2,기관3,작품주소,상세주소,작품상세,작품유형
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21241/S/1/datasetView.do

Alerts

기관1 has constant value ""Constant
기관2 has constant value ""Constant
기관3 has constant value ""Constant
공개여부 has constant value ""Constant
작품유형 has constant value ""Constant
관리코드 has constant value ""Constant
작품상세 has 26 (74.3%) missing valuesMissing

Reproduction

Analysis started2024-07-13 18:08:54.043329
Analysis finished2024-07-13 18:08:55.054271
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct34
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size412.0 B
2024-07-14T03:08:55.336649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length16
Mean length11.314286
Min length2

Characters and Unicode

Total characters396
Distinct characters172
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)94.3%

Sample

1st row쉘터오브메모리즈
2nd row쉘터오브메모리즈
3rd row남산의 생태
4th row회화적 몽타주
5th row휴식
ValueCountFrequency (%)
5
 
6.0%
사색의 4
 
4.8%
자리 4
 
4.8%
쉘터오브메모리즈 2
 
2.4%
아름다운 2
 
2.4%
변조기 1
 
1.2%
조용하고 1
 
1.2%
가장 1
 
1.2%
수화-세상에서 1
 
1.2%
화분(1-29 1
 
1.2%
Other values (61) 61
73.5%
2024-07-14T03:08:56.126973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48
 
12.1%
0 14
 
3.5%
12
 
3.0%
r 10
 
2.5%
a 10
 
2.5%
- 8
 
2.0%
Z 7
 
1.8%
( 7
 
1.8%
) 7
 
1.8%
8 7
 
1.8%
Other values (162) 266
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 235
59.3%
Space Separator 48
 
12.1%
Lowercase Letter 37
 
9.3%
Decimal Number 31
 
7.8%
Uppercase Letter 14
 
3.5%
Dash Punctuation 8
 
2.0%
Other Punctuation 8
 
2.0%
Open Punctuation 7
 
1.8%
Close Punctuation 7
 
1.8%
Math Symbol 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
5.1%
6
 
2.6%
6
 
2.6%
6
 
2.6%
5
 
2.1%
5
 
2.1%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
Other values (126) 179
76.2%
Decimal Number
ValueCountFrequency (%)
0 14
45.2%
8 7
22.6%
1 2
 
6.5%
2 2
 
6.5%
9 1
 
3.2%
3 1
 
3.2%
6 1
 
3.2%
7 1
 
3.2%
5 1
 
3.2%
4 1
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
r 10
27.0%
a 10
27.0%
i 7
18.9%
e 3
 
8.1%
n 2
 
5.4%
p 1
 
2.7%
s 1
 
2.7%
c 1
 
2.7%
d 1
 
2.7%
o 1
 
2.7%
Uppercase Letter
ValueCountFrequency (%)
Z 7
50.0%
T 1
 
7.1%
F 1
 
7.1%
W 1
 
7.1%
O 1
 
7.1%
L 1
 
7.1%
C 1
 
7.1%
B 1
 
7.1%
Other Punctuation
ValueCountFrequency (%)
, 6
75.0%
; 1
 
12.5%
! 1
 
12.5%
Space Separator
ValueCountFrequency (%)
48
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 234
59.1%
Common 110
27.8%
Latin 51
 
12.9%
Han 1
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
5.1%
6
 
2.6%
6
 
2.6%
6
 
2.6%
5
 
2.1%
5
 
2.1%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
Other values (125) 178
76.1%
Common
ValueCountFrequency (%)
48
43.6%
0 14
 
12.7%
- 8
 
7.3%
( 7
 
6.4%
) 7
 
6.4%
8 7
 
6.4%
, 6
 
5.5%
1 2
 
1.8%
2 2
 
1.8%
9 1
 
0.9%
Other values (8) 8
 
7.3%
Latin
ValueCountFrequency (%)
r 10
19.6%
a 10
19.6%
Z 7
13.7%
i 7
13.7%
e 3
 
5.9%
n 2
 
3.9%
p 1
 
2.0%
s 1
 
2.0%
T 1
 
2.0%
c 1
 
2.0%
Other values (8) 8
15.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 234
59.1%
ASCII 161
40.7%
CJK 1
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
48
29.8%
0 14
 
8.7%
r 10
 
6.2%
a 10
 
6.2%
- 8
 
5.0%
Z 7
 
4.3%
( 7
 
4.3%
) 7
 
4.3%
8 7
 
4.3%
i 7
 
4.3%
Other values (26) 36
22.4%
Hangul
ValueCountFrequency (%)
12
 
5.1%
6
 
2.6%
6
 
2.6%
6
 
2.6%
5
 
2.1%
5
 
2.1%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
Other values (125) 178
76.1%
CJK
ValueCountFrequency (%)
1
100.0%

설치연도
Real number (ℝ)

Distinct6
Distinct (%)17.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2008.4
Minimum2007
Maximum2012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2024-07-14T03:08:56.244559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2007
5-th percentile2007
Q12007
median2007
Q32010
95-th percentile2011.3
Maximum2012
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7354437
Coefficient of variation (CV)0.00086409264
Kurtosis-0.88154533
Mean2008.4
Median Absolute Deviation (MAD)0
Skewness0.8049913
Sum70294
Variance3.0117647
MonotonicityDecreasing
2024-07-14T03:08:56.392857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2007 18
51.4%
2010 5
 
14.3%
2011 4
 
11.4%
2008 4
 
11.4%
2012 2
 
5.7%
2009 2
 
5.7%
ValueCountFrequency (%)
2007 18
51.4%
2008 4
 
11.4%
2009 2
 
5.7%
2010 5
 
14.3%
2011 4
 
11.4%
2012 2
 
5.7%
ValueCountFrequency (%)
2012 2
 
5.7%
2011 4
 
11.4%
2010 5
 
14.3%
2009 2
 
5.7%
2008 4
 
11.4%
2007 18
51.4%

기관1
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
서울시
35 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울시
2nd row서울시
3rd row서울시
4th row서울시
5th row서울시

Common Values

ValueCountFrequency (%)
서울시 35
100.0%

Length

2024-07-14T03:08:56.539410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T03:08:56.647289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울시 35
100.0%

기관2
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
디자인정책관
35 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row디자인정책관
2nd row디자인정책관
3rd row디자인정책관
4th row디자인정책관
5th row디자인정책관

Common Values

ValueCountFrequency (%)
디자인정책관 35
100.0%

Length

2024-07-14T03:08:56.779112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T03:08:56.891860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
디자인정책관 35
100.0%

기관3
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
디자인산업담당관
35 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row디자인산업담당관
2nd row디자인산업담당관
3rd row디자인산업담당관
4th row디자인산업담당관
5th row디자인산업담당관

Common Values

ValueCountFrequency (%)
디자인산업담당관 35
100.0%

Length

2024-07-14T03:08:57.026349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T03:08:57.134777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
디자인산업담당관 35
100.0%
Distinct30
Distinct (%)85.7%
Missing0
Missing (%)0.0%
Memory size412.0 B
2024-07-14T03:08:57.364594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length16.885714
Min length12

Characters and Unicode

Total characters591
Distinct characters68
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)77.1%

Sample

1st row서울시 용산구 용산동6가 11-376
2nd row서울시 용산구 용산동6가 11-375
3rd row서울시 용산구 후암동 406-98
4th row서울시 용산구 후암동 30-84
5th row서울시 용산구 용산동2가 1-1144
ValueCountFrequency (%)
서울시 35
25.2%
종로구 11
 
7.9%
성동구 6
 
4.3%
용산구 6
 
4.3%
중구 4
 
2.9%
서소문동 4
 
2.9%
37-4일대 3
 
2.2%
226 3
 
2.2%
신문로1가 3
 
2.2%
후암동 2
 
1.4%
Other values (56) 62
44.6%
2024-07-14T03:08:57.809285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
104
17.6%
40
 
6.8%
37
 
6.3%
35
 
5.9%
35
 
5.9%
35
 
5.9%
1 32
 
5.4%
- 25
 
4.2%
2 15
 
2.5%
6 15
 
2.5%
Other values (58) 218
36.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 330
55.8%
Decimal Number 132
 
22.3%
Space Separator 104
 
17.6%
Dash Punctuation 25
 
4.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
12.1%
37
 
11.2%
35
 
10.6%
35
 
10.6%
35
 
10.6%
15
 
4.5%
11
 
3.3%
10
 
3.0%
9
 
2.7%
9
 
2.7%
Other values (46) 94
28.5%
Decimal Number
ValueCountFrequency (%)
1 32
24.2%
2 15
11.4%
6 15
11.4%
3 14
10.6%
4 12
 
9.1%
8 12
 
9.1%
5 10
 
7.6%
7 9
 
6.8%
9 7
 
5.3%
0 6
 
4.5%
Space Separator
ValueCountFrequency (%)
104
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 330
55.8%
Common 261
44.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
12.1%
37
 
11.2%
35
 
10.6%
35
 
10.6%
35
 
10.6%
15
 
4.5%
11
 
3.3%
10
 
3.0%
9
 
2.7%
9
 
2.7%
Other values (46) 94
28.5%
Common
ValueCountFrequency (%)
104
39.8%
1 32
 
12.3%
- 25
 
9.6%
2 15
 
5.7%
6 15
 
5.7%
3 14
 
5.4%
4 12
 
4.6%
8 12
 
4.6%
5 10
 
3.8%
7 9
 
3.4%
Other values (2) 13
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 330
55.8%
ASCII 261
44.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
104
39.8%
1 32
 
12.3%
- 25
 
9.6%
2 15
 
5.7%
6 15
 
5.7%
3 14
 
5.4%
4 12
 
4.6%
8 12
 
4.6%
5 10
 
3.8%
7 9
 
3.4%
Other values (2) 13
 
5.0%
Hangul
ValueCountFrequency (%)
40
12.1%
37
 
11.2%
35
 
10.6%
35
 
10.6%
35
 
10.6%
15
 
4.5%
11
 
3.3%
10
 
3.0%
9
 
2.7%
9
 
2.7%
Other values (46) 94
28.5%
Distinct31
Distinct (%)88.6%
Missing0
Missing (%)0.0%
Memory size412.0 B
2024-07-14T03:08:58.171009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length23
Mean length15.857143
Min length4

Characters and Unicode

Total characters555
Distinct characters156
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)80.0%

Sample

1st row국립중앙박물관 입구 건너편 정류장
2nd row국립중앙박물관 입구 정류장
3rd row남산소월길, 후암약수터 버스정류장
4th row남산소월길, 남산도서관 앞 버스정류장
5th row남산소월길, 보성여중고 버스정류장
ValueCountFrequency (%)
8
 
6.6%
버스정류장 6
 
4.9%
4
 
3.3%
3호선 4
 
3.3%
남산소월길 4
 
3.3%
돌담길 3
 
2.5%
입구 3
 
2.5%
덕수궁 3
 
2.5%
옥수역 3
 
2.5%
흥국생명 3
 
2.5%
Other values (69) 81
66.4%
2024-07-14T03:08:58.840703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/