Ozone
·
{mlbench} 패키지에서 제공하는 데이터셋입니다.
·
1976년 로스앤젤레스(Los Angeles) 지역의 오존 오염 데이터
데이터의 기술 통계 정보 확인하기
#
----------------------------------------
# 패키지 설치하기 및 임포트
install.packages("mlbench")
library(mlbench)
# ----------------------------------------
# 데이터의 기술 통계 정보 확인하기
# 데이터를 불러옵니다.
>
data("Ozone")
> data("Ozone", package="mlbench")
|
>
|
#전체 데이터의 타입 확인하기
>
class(Ozone)
[1]
"data.frame"
|
>
|
# 기본으로 6개의 데이터만 출력하여 일부 데이터 확인하기(값, 컬럼 구조등)
>
head(Ozone)
V1 V2
V3 V4 V5 V6 V7 V8 V9
V10 V11 V12 V13
1 1
1 4 3 5480
8 20 NA NA 5000 -15 30.56 200
2 1
2 5 3 5660
6 NA 38 NA NA -14
NA 300
3 1
3 6 3 5710
4 28 40 NA 2693 -25 47.66 250
4 1
4 7 5 5700
3 37 45 NA 590 -24 55.04 100
5 1
5 1 5 5760
3 51 54 45.32 1450 25
57.02 60
6 1
6 2 6 5720
4 69 35 49.64 1568 15
53.78 60
|
>
|
# 데이터 타입 확인
>
str(Ozone)
'data.frame': 366
obs. of 13 variables:
$ V1 : Factor w/ 12 levels
"1","2","3","4",..: 1 1 1 1 1 1 1 1 1
1 ...
$ V2 : Factor w/ 31 levels
"1","2","3","4",..: 1 2 3 4 5 6 7 8 9
10 ...
$ V3 : Factor w/ 7 levels
"1","2","3","4",..: 4 5 6 7 1 2 3 4 5
6 ...
$ V4 : num 3 3 3 5 5 6 4 4 6 7 ...
$ V5 : num 5480 5660 5710 5700 5760 5720 5790 5790
5700 5700 ...
$ V6 : num 8 6 4 3 3 4 6 3 3 3 ...
$ V7 : num 20 NA 28 37 51 69 19 25 73 59 ...
$ V8 : num NA 38 40 45 54 35 45 55 41 44 ...
$ V9 : num NA NA NA NA 45.3 ...
$ V10: num 5000 NA 2693 590 1450 ...
$ V11: num -15 -14 -25 -24 25 15 -33 -28 23 -2 ...
$ V12: num 30.6 NA 47.7 55 57 ...
$ V13: num 200 300 250 100 60 60 100 250 120 120 ...
|
>
|
# 데이터의 컬럼 정보 확인하기
컬럼
|
설명
|
V1
|
월
1 = January, ..., 12 = December
|
V2
|
해당 달의 날짜 (Day of month)
|
V3
|
요일(Day of week)
1 = Monday, ..., 7 = Sunday
|
V4
|
Daily maximum one-hour-average
ozone reading
|
V5
|
500 millibar pressure height
(m) measured at Vandenberg AFB
|
V6
|
Wind speed (mph) at Los Angeles
International Airport (LAX)
|
V7
|
Humidity (%) at LAX
|
V8
|
캘리포니아 Sandburg 에서
측정한 온도
Temperature (degrees F)
measured at Sandburg, CA
|
V9
|
캘리포니아 El Monte에서
측정한 온도
Temperature (degrees F)
measured at El Monte, CA
|
V10
|
Inversion base height (feet) at
LAX
|
V11
|
LAX 에서 캘리포니아 Daggett까지
기압경도(Pressure gradient)
(mm Hg)
|
V12
|
Inversion base temperature
(degrees F) at LAX
|
V13
|
Visibility (miles) measured at
LAX
|
# 데이터에 대한 기초 통계량(요약 정보)를 확인합니다.
>
summary(Ozone)
V1 V2 V3 V4 V5 V6 V7
1
: 31 1 : 12
1:52 Min. : 1.00
Min. :5320 Min.
: 0.000 Min. :19.00
3
: 31 2 : 12
2:52 1st Qu.: 5.00 1st Qu.:5700 1st Qu.: 3.000 1st Qu.:49.00
5
: 31 3 : 12
3:52 Median : 9.00 Median :5770 Median : 5.000 Median :65.00
7
: 31 4 : 12
4:53 Mean :11.53
Mean :5753 Mean
: 4.869 Mean :58.48
8
: 31 5 : 12
5:53 3rd Qu.:16.00 3rd Qu.:5830 3rd Qu.: 6.000 3rd Qu.:73.00
10
: 31 6 : 12
6:52 Max. :38.00
Max. :5950 Max.
:11.000 Max. :93.00
(Other):180 (Other):294 7:52
NA's :5 NA's
:12
NA's :15
V8 V9 V10 V11 V12 V13
Min.
:25.00 Min. :27.68
Min. : 111 Min.
:-69.0 Min. :27.50
Min. : 0.0
1st Qu.:51.00 1st Qu.:49.73 1st Qu.: 890 1st Qu.:-10.0 1st Qu.:51.26 1st Qu.: 70.0
Median :62.00 Median :57.02 Median :2125 Median : 24.0 Median :62.24 Median :110.0
Mean
:61.91 Mean :56.85
Mean :2591 Mean
: 17.8 Mean :60.93
Mean :123.3
3rd Qu.:72.00 3rd Qu.:66.11 3rd Qu.:5000 3rd Qu.: 45.0 3rd Qu.:70.52 3rd Qu.:150.0
Max.
:93.00 Max. :82.58
Max. :5000 Max.
:107.0 Max. :91.76
Max. :500.0
NA's
:2 NA's :139
NA's :15 NA's
:1 NA's :14
|
>
|
# 전체 데이터
> Ozone
V1 V2 V3 V4 V5 V6 V7 V8 V9
V10 V11 V12 V13
1 1
1 4 3 5480
8 20 NA NA 5000 -15 30.56 200
2 1
2 5 3 5660
6 NA 38 NA NA -14
NA 300
3 1
3 6 3 5710
4 28 40 NA 2693 -25 47.66 250
4 1
4 7 5 5700
3 37 45 NA 590 -24 55.04 100
5 1
5 1 5 5760
3 51 54 45.32 1450 25 57.02 60
6 1
6 2 6 5720
4 69 35 49.64 1568 15
53.78 60
7 1
7 3 4 5790
6 19 45 46.40 2631 -33 54.14 100
8 1
8 4 4 5790
3 25 55 52.70 554 -28 64.76 250
9 1
9 5 6 5700
3 73 41 48.02 2083 23 52.52 120
10 1 10 6 7
5700 3 59 44 NA 2654
-2 48.38 120
11 1 11 7 4
5770 8 27 54 NA 5000 -19 48.56 120
12 1 12 1 6
5720 3 44 51 54.32 111
9 63.14 150
13 1 13 2 5
5760 6 33 51 57.56 492 -44 64.58 40
14 1 14 3 4
5780 6 19 54 56.12 5000 -44 56.30 200
15 1 15 4 4
5830 3 19 58 62.24 1249 -53 75.74 250
16 1 16 5 7
5870 2 19 61 64.94 5000 -67 65.48 200
17 1 17 6 5
5840 5 19 64 NA 5000 -40 63.32 200
18 1 18 7 9
5780 4 59 67 NA
639 1 66.02 150
19 1 19 1 4
5680 5 73 52 56.48 393 -68 69.80 10
20 1 20 2 3
5720 4 19 54 NA 5000 -66 54.68 140
21 1 21 3 4
5760 3 19 54 53.60 5000 -58 51.98 250
22 1 22 4 4
5730 4 26 58 52.70 5000 -26 51.98 200
23 1 23 5 5
5700 5 59 69 51.08 3044 18 52.88 150
24 1 24 6 6
5650 5 70 51 NA 3641
23 47.66 140
25 1 25 7 9
5680 3 64 53 NA
111 -10 59.54 50
26 1 26 1 5
5780 3 NA 56 53.60 692 -25 67.10 0
27 1 27 2 6
5820 5 19 59 59.36 597 -52 70.52 70
28 1 28 3 6
5830 4 NA 59 60.08 NA -44
NA 150
29 1 29 4 6
5810 5 19 64 56.66 1791 -15 64.76 150
30 1 30 5 11 5790
3 28 63 57.38 793 -15 65.84 120
31 1 31 6 10 5800
2 32 63 NA 531 -38 75.92 40
32 2
1 7 7 5820
5 19 62 NA 419 -29 75.74 120
33 2
2 1 12 5770 8 76 63 57.20 816
-7 66.20 6
34 2
3 2 9 5670
3 69 54 45.50 3651 62
49.10 30
35 2
4 3 2 5590
3 76 36 37.40 5000 70 37.94 100
36 2
5 4 3 5410
6 64 31 32.18 5000 28 32.36 200
37 2
6 5 3 5350
7 62 30 32.54 1341 18
45.86 60
38 2
7 6 2 5480
9 72 36 NA 5000 0 38.66 350
39 2
8 7 3 5600
7 76 42 NA 3799 -18 45.86 250
40 2
9 1 3 5490 11 72 37 38.48 5000 32 38.12 350
41 2 10 2 4
5560 10 72 41 40.46 5000 -1 37.58 300
42 2 11 3 6
5700 3 32 46 NA 5000 -30 45.86 300
43 2 12 4 8
5680 5 50 51 47.12 5000 -8 45.50 300
44 2 13 5 6
5700 4 86 55 49.28 2398 21 53.78 200
45 2 14 6 4
5650 5 61 41 NA 5000
51 36.32 100
46 2 15 7 3
5610 5 62 41 NA 4281
42 41.36 250
47 2 16 1 7
5730 5 66 49 NA 1161
27 52.88 200
48 2 17 2 11 5770
5 68 45 52.88 2778 2 55.76 200
49 2 18 3 13 5770
3 82 55 55.40 442 26 58.28
40
50 2 19 4 4
5700 5 NA 45 38.12 NA
82 NA 2
51 2 20 5 6
5690 8 21 41 43.88 5000 -30 42.26 300
52 2 21 6 5
5700 3 19 45 NA 5000 -53 43.88 300
53 2 22 7 4
5730 11 19 51 NA 5000 -43 49.10 300
54 2 23 1 4
5690 7 19 53 50.18 5000 7 49.10 300
55 2 24 2 6
5640 5 68 50 37.40 5000 24 42.08 300
56 2 25 3 10 5720
6 63 60 53.06 1341 19 59.18 150
57 2 26
4 15 5740 3 54 54 56.48
1318 2 64.58 150
58 2 27 5 23 5740
3 47 53 58.82 885 -4 67.10
80
59 2 28 6 17 5740
3 56 53 NA 360
3 67.10 40
60 2 29 7 7
5670 7 61 44 NA 3497
73 49.46 40
61 3
1 1 2 5550 10 74 40 38.84 5000 73 40.10
80
62 3
2 2 3 5470
7 46 30 29.66 5000 44 29.30 300
63 3
3 3 3 5320 11 45 25 27.68 5000 39 27.50 200
64 3
4 4 5
NA 8 33 39 30.20 5000 15 30.02 500
65 3
5 5 4 5530
3 43 40 36.14 5000 -12 33.62 140
66 3
6 6 6 5600 3 21 45
NA 5000 -2 39.02 140
67 3
7 7 7 5660
7 57 51 NA 5000 30 42.08 140
68 3
8 1 7 5580
5 42 48 40.64 3608 24 39.38 100
69 3
9 2 6 5510
5 50 45 36.86 5000 38 32.90 140
70 3 10 3 3
5530 5 61 47 33.80 5000 56 35.60 200
71 3 11 4 2
5620 9 61 43 37.04 5000 66 34.34 120
72 3 12 5 8
5690 0 60 49 46.04 613 -27 59.72 300
73 3 13 6 12 5760
4 31 56 NA 334
-9 64.40 300
74 3 14 7 12 5740
3 66 53 NA 567
13 61.88 150
75 3 15 1 16 5780
5 53 61 57.92 488 -20
64.94 2
76 3 16 2 9
5790 2 42 63 57.02 531 -15 71.06 50
[ reached
getOption("max.print") -- omitted 290 rows ]
|
>
|
# 해당 데이터셋의 상세 정보 확인