FORECAST OF SOLAR PROTON EVENTS WITH NOAA SCALES BASED ON SOLAR X-RAY FLARE DATA USING NEURAL NETWORK

Journal of The Korean Astronomical Society.
2014.
Dec,
47(6):
209-214

- Received : May 01, 2014
- Accepted : August 08, 2014
- Published : December 31, 2014

Download

PDF

e-PUB

PubReader

PPT

Export by style

Article

Metrics

Cited by

TagCloud

In this study we develop a set of solar proton event (SPE) forecast models with NOAA scales by Multi Layer Perceptron (MLP), one of neural network methods, using GOES solar X-ray flare data from 1976 to 2011. Our MLP models are the first attempt to forecast the SPE scales by the neural network method. The combinations of X-ray flare class, impulsive time, and location are used for input data. For this study we make a number of trials by changing the number of layers and nodes as well as combinations of the input data. To find the best model, we use the summation of F-scores weighted by SPE scales, where F-score is the harmonic mean of PODy (recall) and precision (positive predictive value), in order to minimize both misses and false alarms. We find that the MLP models are much better than the multiple linear regression model and one layer MLP model gives the best result.
^{−2}
sec
^{−1}
ster
^{−1}
), is one of the strongest solar activities. The SPE is very significant for space weather in that it can seriously affect hardware in satellites and astronauts. It has been well known that SPEs are very intimately associated with solar activities such as solar flares and coronal mass ejections (CMEs). There have been many studies on the relationship between SPE and X-ray flares; e.g., shock acceleration of electrons and ions in solar flares (
Ellison et al. 1985
), proton events and X-ray flares in the last three solar cycles (
Belov et al. 2005
), and a study of solar energetic particle events of 1997-2006 (
Cane et al. 2010
).
Machine learning algorithms such as support vector machine and neural network have been used for the study of solar activities (e.g.,
Colak & Qahwaji 2009
;
Al-Omari et al. 2010
;
Choi et al. 2012
). The algorithms can learn systems from historical data. Therefore, the algorithms make it possible to construct a forecast model even though the relationship between cause and effect is not clearly understood. Neural networks have been broadly used in space weather applications such as the prediction of solar activities (
Wang 2000
;
Gong et al. 2004
; Wang et al. 2008; Qahwaji et al. 2007, 2008;
Henwood et al. 2010
) and geomagnetic activities (
Lundstedt 1992
;
Hernandez et al. 1993
;
Freeman et al. 1993
;
Valach et al. 2009
;
Ji et al. 2013
).
SPE prediction models have been developed by neural networks (
Wang 2000
;
Gong et al. 2004
;
Cui et al. 2012
), which are mostly applied to the prediction of SPE occurrence (Yes or No of SPE events). The models used several kinds of input data such as sunspot properties from white light and magnetogram observations, flare, and radio burst. While two models (
Gong et al. 2004
;
Cui et al. 2012
) can predict “event” or “no event”,
Wang (2000)
suggested a two-pass model using by both a yes/no event prediction and forecast three levels of SPE events.
In this study we make the first attempt to forecast SPE scales by neural network. We develop a set of SPE forecast models that predicts SPE scales, which represents the SPE flux according to NOAA space weather scales (S), by multi linear regression (MLR) and multi layer perceptron (MLP) methods using X-ray flare information. The MLP is a feedforward artificial neural network model, which is to classify complex patterns. It has been used for an automatic solar flare detection (
Qu et al. 2003
), a prediction of interplanetary shock arrival time (
Turchenko et al. 2007
), and a prediction of ozone levels (
Wang et al. 2006
).
This work would be a part of a two stage of SPE forecast models: SPE probability (
Park et al. 2010
) for the first stage and SPE scale, which represents the SPE flux according to NOAA space weather scales (S), for the second stage. The first stage is based on
Park et al. (2010)
who developed a SPE prediction model using X-ray flare information based on the dependence of SPEs on their associated flares. In this study, as the second stage, we develop a forecast model to predict SPE scales using the neural network method.
We use X-ray flare class, location (longitude, latitude), and impulsive time (flare peak time - flare start time) as input data. We use SPE scales (S1-S4), which is provided by NOAA space weather prediction center, as output data. We construct the forecast models depending on the number of layers and nodes of the MLP as well as combinations of input data. For the evaluation of the models, we use the F-score, which is the harmonic mean of PODy defined by recall and precision, to find the best models.
This paper is organized as follows. Section 2 describes data, MLR and MLP methods, and statistical verification method. In Section 3, we evaluate and compare 1 MLR model and 4 MLP models. In Section 4, we give a brief summary and conclusion.
^{−5}
W/m
^{2}
). Then we select 175 SPEs which are associated with flares whose peak/start times and locations are available. From the GOES X-ray flare list
2
provided by NOAA National Geophysical Data Center, we use X-ray flare class, its location, and its impulsive time as input data. The impulsive time is from the flare start to peak time.
Table 1
shows the selected numbers of SPEs for this study from April 1976 to September 2011. We separate the data into two sets for training data from 1976 to 1996 and test data from 1997 to 2011. We preprocess them so that the program can recognize the characteristics of the data easily before we apply the MLR and MLP programs to the data set. The flare locations are given from 0 (limb) to 90 (center of the Sun). We number X-ray class with the following two cases: case 1) 3 for all strengths of M class and 4 for all strengths of X class and case 2) an addition of 10 to the strength of X-class flare. For case 2, M5 is 5 and X7 is 17. As output data, we use SPE scales.
Table 2
shows the input and output data.
Number of SPE events used in data selection
^{a} i : intensity of X-ray flare ^{b} flux: flux in pfu (number of particles cm^{−2} sec^{−1} ster^{−1}) ^{c} No S5 (flux ≥ 10^{5}) event during the period used in this analysis
Here
Y_{i}
is a dependent variable,
X_{ki}
is an independent variable,
A_{k}
is k-dimensional parameter, and
b
is a constant term, where
i
is an event number. In this analysis,
X_{ki}
is the input data, which are X-ray class, flare location, and impulsive time. And
Y_{i}
is the SPE scale. To construct the MLR models, first, we find several models with the combinations of input data from the training data set. Second, we obtain output data from the input of test data set.
Flow chart of the development of SPE forecast models based on MLP
The MLP is a feedforward artificial neural network model. We use OpenCV
4
that provides the MLP functions for train and prediction. We train the preprocessed training data with changing the number of layers and nodes. We use a grid-search from 2 to 20 nodes for each layers. In addition, we use a few kernel functions and propagation methods. We use gaussian and sigmoid kernel functions provided by OpenCV for the models. Once after we train the data set, we predict the SPE scale using the test data set by the models. Then we rescale the output obtained from the MLR and MLP models as follows: S1: 0≤output~~
~~_{min}
+△S, S2: S
_{min}
+△S≤output~~
~~_{min}
+2×△S, S3: S
_{min}
+2×△S ≤output~~
~~_{min}
+3×△S, and S4: S
_{min}
+3×△S ≤output≤ S
_{max}
. Here △S is defined as (S
_{max}
−S
_{min}
)/4. S
_{min}
and S
_{max}
are the minimum and maximum values of output obtained by each model, respectively. Finally, we compare the output data with the observations for the test data set.
a
” is the number of event forecasts that corresponds to the event observations (i.e., the number of hits); the “
b
” is the number of event forecasts that do not correspond to the observed events (i.e., the number of false alarms); the “
c
” is the number of no-event forecasts corresponding to the observed events (i.e., the number of misses). As an example for the S1, the ‘a’ indicates the number of S1 forecasts that corresponds to the S1 observations. ‘b’ indicates the number of S1 forecasts that corresponds to the S2-S4 observations.
a = hit, b = false alarm, c = miss
In
Table 4
, we summarize various statistical parameters based on the contingency table. PODy is the proportions of hits. FAR is the proportion of incorrect prediction. CSI is the proportion of correct predictions that are either predicted or observed. The PODy is a good verification parameter when we focus on the misses. The CSI considers both misses and false alarms. It is hard to say which parameter is more important in space weather forecast. Therefore, we use a verification parameter, a balanced F-score, which is the harmonic mean of precision and recall. The precision is the probability of observation of ‘Yes’ predictions (see
Table 4
) and the recall is the same with the PODy. The balanced F-score is given by
Statistical verification parameters from the contingency table
Since the prediction of stronger SPE scales is much more important than that of weaker ones, we use a parameter, summation of F-scores weighted by SPE scale (SF), to see an overall performance of the model for all SPE scales. The SF is given by
where k is the SPE scale.
^{a}Input: X (X-ray Flare Class), T (Impulsive Time), L (Flare Location) ^{b}X-ray flux : case 1 (All M class: 3, All X class: 4), case 2 (ex. M5 = 5, X7 = 17)
Table 6
shows the statistical verification parameters of each model with SPE scales. The MLR model hardly predicts the stronger SPEs than S1. It may come from that the regression method find a model that is more optimized for weaker SPEs since the number of weaker SPEs are much larger than that of stronger SPEs.
Statistical verification parameters of all five models with SPE scales
We compare the F-scores of all five models with SPE scales in
Figure 2
. The F-scores of MLP models are much larger than those of the MLR model for the events stronger than S1. In the comparison of the MLP models, the F-score values of MLP1L6 (purple) and MLP2L3×3 (blue) for S4 are larger than those of the other models while the F-scores of MLP2L4×12 (red) and MLP2L7×17 model (orange) for S2 are larger than those of the other models. Especially, MLP1L6 model (purple in
Figure 2
) has the highest value of F-score, 0.57 for S4. To compare an overall performance of these models, we use the summation of F-scores (SF, Equation (2)) weighted by SPE scale since stronger events are more important in terms of space weather. As a result, we find that the MLP1L6 model has the highest value of SF, 4.37, which is larger than those of the other MLP models. Main advantage of this model is to well predict S4 events, which are extremely strong ones. The values of PODy and CSI also show similar patterns with the F-score values in
Table 6
. Therefore, we suggest the MLP1L6 model, which has one hidden layer with 6 nodes, for the prediction of SPE scales according to the highest SF.
Comparison of the F scores of all five models with SPE scales
It is not easy to compare our model with previous other models in that other neural network models mostly predict ‘yes’ or ‘no’ of SPE event (
Gong et al. 2004
;
Cui et al. 2012
).
Wang (2000)
used a neural network method for the forecast of three levels of SPE flux. Since his classification are different from our classification and the number of events that he used is too small (14 events), it is difficult to compare his results with ours. On the other hand, several studies to predict SPE flux have been made based on the relationship between SPEs and X-ray flares (
Garcia 1994
;
Balch 2008
;
Park et al. 2010
). We compute the SF value of
Park et al.(2010)
and find that the value (2.33) is much smaller than those of our neural network models and a little smaller than that of the MLR method.
One may raise a question why MLP1L6 model gives best scores than other models for strong solar proton events and MLP2L4 12 and MLP2L7 17 give best results for weak solar proton events. The neural network algorithms learn systems from historical data, which is useful to construct a forecast model even though the physical relationship between cause and effect is not clearly understood. Therefore, it is hard to find any physical understanding from the neural network models. We examine the dependence of F-scores of the models on the number of nodes and layers. We find that the F-score of a model decreases as the number of layers or nodes. This fact can be understood by the fact that the number of strong solar proton events is much smaller than that of weak solar ones.
1 Solar proton events affecting the Earth environment: http://www.swpc.noaa.gov/ftpdir/indices/SPE.txt
2 ftp://ftp.ngdc.noaa.gov/STP/SOLAR_DATA/SOLAR_FLARES/FLARES_XRAY
3 http://commons.apache.org/math
4 http://opencv.org

1. INTRODUCTION

Solar proton event (SPE), which is defined as solar energetic particle events having a flux of >10 MeV protons greater than or equal to 10 particle flux unit (pfu, number of particles cm
2. DATA AND METHODS

- 2.1. Data

We use SPE scales in the solar proton event list
1
provided by Space Wether Prediction Center of National Oceanic and Atmospheric Administration (NOAA). There are 246 SPEs from April 1976 to September 2011. We select 203 SPEs associated with X-ray flares, which are stronger or equal to M class (≥10
Number of SPE events used in data selection

PPT Slide

Lager Image

Input and output data

PPT Slide

Lager Image

- 2.2. MLR and MLP

We apply MLR and MLP algorithms to solar flare data for the prediction of SPE scales. We use Apache Common Math library
3
to preprocess the input data. It is an open library that can be efficiently used in scientific computing. Then we apply the MLR algorithm to the input and output data. The MLR is one of regression analysis, which is the most commonly used techniques in statistics. It can be used to fit a predictive model to an observed data set of values. In this analysis, the MLR equation is expressed as
PPT Slide

Lager Image

PPT Slide

Lager Image

- 2.3. Statistical Verification

We evaluate the performance of the models using statistical parameters. In
Table 3
, the “
Contingency table between predictions and observations

PPT Slide

Lager Image

PPT Slide

Lager Image

Statistical verification parameters from the contingency table

PPT Slide

Lager Image

PPT Slide

Lager Image

3. RESULTS AND DISCUSSION

We show one MLR and 4 MLP models in
Table 5
. The MLR model shown in the
Table 5
has the highest SF among MLR models with the combinations of input data. This model uses X (X-ray flare class) and T (impulsive time) with case 1 as its input data. In the case of MLP models, we find that a gaussian function is the best kernel since it shows higher PODy and lower FAR. With the changes of the number of layers and nodes as well as the combinations of the input data, we select 4 MLP models whose SF values are larger than 2.5 in
Table 5
. The selected MLP models show that XTL combination (X-ray flare class, impulsive time, and location) with case 2 produces higher SF values than other combinations.
Kernel functions and parameters of MLP models

PPT Slide

Lager Image

Statistical verification parameters of all five models with SPE scales

PPT Slide

Lager Image

PPT Slide

Lager Image

4. SUMMARY AND CONCLUSION

We develop the SPE scale forecast models based on solar X-ray flare information (flare class, source location, impulsive time) using the MLR and MLP methods. The models can be used to predict the SPE scale once after a strong flare (M or X class) occurs. For the evaluation of the models, we use several statistical verification parameters such as PODy, CSI, and F-score. To compare an overall performance of the models, we used the SF value, the summation of F-scores weighted by SPE scales.
We find that the statistical verification parameters of the MLP models are much better than the MLR model. A comparison of the SF values shows that the MLP1L6 model of one layer and 6 nodes gives the best result. It is noted that our MLP models are the first attempt to forecast the SPE scales by the neural network method.
As explained in Section 1, we suggest that our model can be a part of a two-stage SPE forecast model: SPE probability forecast (
Park et al. 2010
;
Park et al. 2012
) for the first stage and SPE scale forecast (this study) for the second stage. We expect that our models can be also used for the operation of an automatic SPE forecast.
Several studies have shown that SPEs are closely related with CMEs (
Kahler et al. 1984
;
Park et al. 2012
;
Huang et al. 2012
). We plan to develop a model based on both flare and CME parameters as its input data. We will also consider other input data such as hard X-ray fluxes (
Garcia 2004
), magnetic properties(
Cui et al. 2012
), and other solar activities (
Wang 2000
;
Gong et al. 2004
).
Acknowledgements

This work was supported by the BK21 plus program through the National Research Foundation (NRF) funded by the Ministry of Education of Korea, Basic Science Research Program through the NRF funded by the Ministry of Education (2013R1A1A2058409, NRF-2013R1A1A2012763), and NRF of Korea Grant funded by the Korean Government (NRF-2013M1A3A3A02042232).

Al-Omari M.
,
Qahwaji R.
,
Colak T.
,
Ipson S.
2010
Machine Learning-Based Investigation of the Associations between CMEs and Filaments
Solar Physics
262
511 -
** DOI : 10.1007/s11207-010-9516-5**

Balch C. C.
2008
Updated Verification of the Space Weather Prediction Center's Solar Energetic Particle Prediction Model
Space Weather
6
S01001 -
** DOI : 10.1029/2007SW000337**

Belov A.
,
Garcia H.
,
Kurt V.
,
Mavromichalaki E.
2005
Proton Events and X-Ray Flares in the Last Three Solar Cycles
Cosmic Research
43
165 -
** DOI : 10.1007/s10604-005-0031-7**

Cane H. V.
,
Richardson G.
,
von Rosenvinge T. T.
2010
A Study of Solar Energetic Particle Events of 1997-2006: Their Composition and Associations
JGR
115
A08101 -

Ellison D. C.
,
Ramaty R.
1985
Shock Acceleration of Electrons and Ions in Solar Flares
ApJ
298
400 -
** DOI : 10.1086/163623**

Choi S. H.
,
Moon Y.-J.
,
Vien N. A.
,
Park Y. D.
2012
Application of Support Vector Machine to the Prediction of Geo-Dffective Halo CMEs
JKAS
43
125 -

Colak T.
,
Qahwaji R.
2009
Automated Solar Activity Prediction: A Hybrid Computer Platform Using Machine Learning and Solar Imaging for Automated Prediction of Solar Flares
Space Weather
7
S06001 -

Cui Y.-m.
,
Li R.
,
Liu S.-q.
2012
Short-Term Forecast of Solar Proton Events With Characteristic Physical Quantities of Photospheric Magnetic Fields
Chinese Astronomy and Astrophysics
36
169 -
** DOI : 10.1016/j.chinastron.2012.04.004**

Freeman J.
,
Nagai A.
,
Reiff P.
1993
The Use of Neural Networks to Predict Magnetospheric Parameters for Input to a Magnetospheric Forecast Model
Proceedings of the International Workshop on Artificial Intelligence Applications in Solar-Terrestrial Physics
Lund, Sweden
Sep. 22-24, 1993
167 -

Garcia H. A.
1994
Temperature and Hard X-Ray Signatures for Energetic Proton Events
ApJ
420
422 -
** DOI : 10.1086/173572**

Garcia H. A.
2004
Forecasting Methods for Occurrence and Magnitude of Proton Storms with Solar Hard X Rays
Space Weather
2
S06003 -

Gong J.-c.
,
Xue B.-s.
,
Liu S.-q.
,
Zou Z.-m.
,
Miao J.
,
Wang J.-l.
2004
Short-Term Prediction of Solar Proton Events by Neural Network Method
Chinese Astronomy and Astrophysics
28
174 -
** DOI : 10.1016/S0275-1062(04)90021-5**

Henwood R.
,
Chapman S. C.
,
Willis D. M.
2010
Increasing Lifetime of Recurrent Sunspot Groups Within the Greenwich Photoheliographic Results
Solar Physics
262
299 -
** DOI : 10.1007/s11207-009-9419-5**

Hernandez J. V.
,
Tajima T.
,
Horton W
1993
Neural Network Forecasting Geomagnetic Activity
Geophys. Res. Lett
20
2707 -
** DOI : 10.1029/93GL02848**

Huang X.
,
Wang H.-N.
,
Li L.-P.
2012
Ensemble Prediction Model of Solar Proton Events Associated with Solar Flares and Coronal Mass Ejections
Research in Astron-omy and Astrophysics
12
313 -
** DOI : 10.1088/1674-4527/12/3/007**

Ji E.-Y.
,
Moon Y.-J.
,
Park J.
,
Lee D.-H.
2013
Com-parison of Neural Network and Support Vector Machine Methods for Kp Forecasting
JGR
118
5109 -

Kahler S.W.
,
Sheeley N. R.
,
Howard R. A.
,
Michels D. J.
,
Koomen M. J.
,
McGuire R. E.
,
von Rosenvinge T. T.
,
Reames D. V.
1984
Associations between Coronal Mass Ejections and Solar Energetic Proton Events
JGR
89
9683 -
** DOI : 10.1029/JA089iA11p09683**

Lundstedt H.
1992
Neural Networks and Predictions of Solar-Terrestrial Effects
Planet. Space Sci.
40
457 -
** DOI : 10.1016/0032-0633(92)90164-J**

Park J.
,
Moon Y.-J.
,
Lee D.-H.
,
Youn S.
2010
Depen-dence of Solar Proton Events on Their Associated Activities: Flare Parameters
JGR
115
(A10)
A10105 -
** DOI : 10.1029/2010JA015330**

Park J.
,
Moon Y.-J.
,
Gopalswamy N.
2012
Depen-dence of Solar Proton Events on Their Associated Activities: Coronal Mass Ejection Parameters
JGR
117
(A8)
A08108 -

Qahwaji R.
,
Colak T.
2007
Automated Short-Solar Flare Prediction Using Machine Learning and Sunspot Associations
Solar Physics
241
195 -
** DOI : 10.1007/s11207-006-0272-5**

Qahwaji R.
,
Colak T.
,
Al-Omari M.
,
Ipson S.
2008
Automated Prediction of CMEs Using Machine Learning of CME-Flare Associations
Solar Physics
248
471 -
** DOI : 10.1007/s11207-007-9108-1**

Qu M.
,
Shih F. Y.
,
Jing J.
,
Wang H.
2003
Automated Solar Flare Detection Using MLP, RBF, and SVM
Solar Physics
217
157 -
** DOI : 10.1023/A:1027388729489**

Turchenko V.
,
Ternopil N.
,
Ternopil D. V.
,
Sachenko A.
2007
Interplanetary Shock Arrival Time Prediction Using Multi-Layer Perceptron
Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications
185 -

Valach F.
,
Revallo M.
,
Bochníček J.
,
Hejda P.
2009
Solar Energetic Particle Flux Enhancement as a Predictor of Geomagnetic Activity in a Neural Network-Based model
Space Weather
7
S04004 -

Wang D.
,
Lu W. Z.
2006
Forecasting Ozone Levels and Analyzing Their Dynamics by a Bayesian Multilayer Perceptron Model for Two Air-Monitoring Sites in Hong Kong
Human and Ecological Risk Assecement
12
313 -
** DOI : 10.1080/10807030500533725**

Wang H. N.
,
Cui Y. M.
,
Li L. Y.
,
Zhang H. H.
2008
Solar Flare Forecasting Model Supported with Artificial Neural Network Techniques
Advances in Space Research
42
1464 -
** DOI : 10.1016/j.asr.2007.06.070**

Wang J.-l.
2000
A Two-Pass Neural Network Model for Solar Proton Event Alert
Chinese Astronomy and As-trophysics
24
10 -
** DOI : 10.1016/S0275-1062(00)00020-5**

Citing 'FORECAST OF SOLAR PROTON EVENTS WITH NOAA SCALES BASED ON SOLAR X-RAY FLARE DATA USING NEURAL NETWORK
'

@article{ CMHHBA_2014_v47n6_209}
,title={FORECAST OF SOLAR PROTON EVENTS WITH NOAA SCALES BASED ON SOLAR X-RAY FLARE DATA USING NEURAL NETWORK}
,volume={6}
, url={http://dx.doi.org/10.5303/JKAS.2014.47.6.209}, DOI={10.5303/JKAS.2014.47.6.209}
, number= {6}
, journal={Journal of The Korean Astronomical Society}
, publisher={The Korean Astronomical Society}
, author={Jeong, Eui-Jun
and
Lee, Jin-Yi
and
Moon, Yong-Jae
and
Park, Jongyeop}
, year={2014}
, month={Dec}