This paper presents a hybrid model solution for user motion recognition. The use of a single classifier in motion recognition models does not guarantee a high recognition rate. To enhance the motion recognition rate, a hybrid model consisting of decision trees and artificial neural networks is proposed. We define six user motions commonly performed in an indoor environment. To demonstrate the performance of the proposed model, we conduct a real field test with ten subjects (five males and five females). Experimental results show that the proposed model provides a more accurate recognition rate compared to that of other single classifiers.
A user’s activity or status is essential information for pervasive computing that interacts with users and collects information for improving the quality of human life. The smartphone is a device that can be used to recognize a user’s motion by sensing and collecting data from various kinds of embedded sensors [1]. Recognized motion can be used to improve human-centered services, intelligent buildings, location-based services (LBS), and user-context awareness [2]. In particular, recognized motion is useful to enhance the positioning accuracy of LBS in indoor environments where Global Positioning System signals are blocked. In [3], a Wi-Fi-based positioning system that combines physical motion recognition is presented. Pedestrian dead reckoning combined with motion recognition is presented in [4]–[5]. However, in the case of false motion recognition, positioning performance results are less accurate; thus, an advanced classifier that could give a more accurate recognition rate is required.There have been several researches regarding motion recognition through the use of smartphones [3], [6]. In [6], a decision tree (DT) using simple accelerometer features was implemented. In [3], on the other hand, a support vector machine (SVM)–based classifier for smartphone users was created. Use of the DT in [6] gives a higher performance compared with use of the SVM in [3]. However, in [3], the performance of the SVM-based classifier is better than that of the DT. Usually, each of the algorithms (DT and SVM) is dependent upon defined motion states, selected features, or the position and orientation of the smartphone. Hence, the performance of each algorithm cannot be guaranteed when such factors are likely to vary.To produce favorable results, an advanced classification technique is required, such as that used in an intelligent hybrid classifier. In this work, we present a hybrid model classifier consisting of DTs and an artificial neural network (ANN) ensemble. We define six user motions commonly performed in indoor environments. In addition, we define two motion groups — motions are assigned according to whether a proximity sensor is enacted by a motion. Next the ANN ensemble chooses a performed motion in a selected group. The experimental results show that the proposed classifier gives higher recognition rates than the single classifiers in [3], [6], [7], and [8].
II. System Description
Many researchers understand the limitations of single classifiers and how their performance is dependent on various experimental conditions. Also, in many applications, the process data to be analyzed can be too large for a single classifier to handle [9]. To overcome such limitations, we propose a hybrid model classifier. Here, the basic concept is derived from the notion that a combined opinion is more reliable than an individual one [10]. A diagram of the proposed hybrid model classifier is presented in Fig. 1. The smartphone gives sensor data to the hybrid model classifier at a sampling rate of 50 Hz. We utilize the accelerometer and proximity sensor for the motion recognition. The input of the hybrid model classifier is the combination of the three-axis output from the accelerometer and the output from the proximity sensor. In Fig. 1, the format of the input data is presented. The output of the hybrid model classifier is the estimated motion of the smartphone user. The first DT, using a variance value of the accelerometer output, determines whether the user is moving. If the first DT determines that the user is moving, then the second DT will determine the motion group to which the performed motion belongs by using an output of the proximity sensor. Each ANN ensemble finally recognizes the user’s motion.
We define six motions that are commonly performed by a user carrying a smartphone as they walk in an indoor environment. The defined motions, depicted in Fig. 2, are explained in Table 1.
A DT is a classifier that helps to represent decisions that are made in accordance with a particular feature’s qualities; hence, it is also known as a qualitative classifier. For motion recognition, for which many features are required, a single DT classifier is not sufficient. On the other hand, if features can be clearly divided into certain motion groups, then a single DT in combination with other classifiers could be useful for motion recognition.As shown in Fig. 1, we construct the first and second DTs before operating the ANN ensemble. The first DT, by considering the accelerometer output, is used to determine whether the user is static or moving. Figure 3 presents the variance value (Vt) of the accelerometer output during the experiment. As expected, if the user is static, then the value of Vt is very low. On the other hand, the value of Vt is quite high when the user performs defined motions other than M_1. The first DT determines that the user is static for values of Vt that are lower than the accelerometer variance threshold (that is, TA). In our algorithm, TA is set to 0.3. The value of Vt is calculated from the following:In (1), a_xt, a_yt, and a_zt denote the accelerometer output of their respective axis at time t. The norm value of the accelerometer at time t is denoted by A_Nt. In (2), A_Mt indicates the mean value of A_N during the 0.02 × n second at time t. For example, if n = 50, then A_Mt is the mean value of A_N during the particular second at time t. In (3), Vt denotes the variance of the accelerometer during the 0.02 × m second at time t. In our experiments, n and m are set to 10 and 50, respectively.
Variance of accelerometer output for the six motions.
If the first DT determines that the user is moving, then the second DT selects one of the two predefined motion groups. As shown in Table 1, each motion is assigned to either motion group one (MG_1) or motion group two (MG_2). The motions M_2, M_4, and M_5 belong to MG_1, and the remaining motions, except M_1, belong to MG_2. To assign each motion, we utilize the proximity sensor because it can detect the presence of nearby objects. For the motions depicted in Fig. 2, the proximity sensor would be enacted in the cases of M_3 and M_6. On the other hand, the proximity sensor would not respond in the cases of M_2, M_4, and M_5. The proximity sensor provides a binary near or far measurement. If the proximity sensor detects a nearby object, then it outputs the value 0. Otherwise, it outputs the value 1. To remove the possibility of false detection, we use the average value of the proximity sensor, which is calculated as follows:In (4), Pk denotes the output of the proximity sensor at time k. The mean value of P during 0.02 × j second at time t is denoted by P_Mt. In our experiments, j is set to 5. Thus, P_Mt would be the mean value of the proximity sensor during the first 0.1 seconds. If P_M is lower than the proximity sensor threshold (TP), then the second DT would conclude that the performed motion belongs to MG_2. In our algorithm, TP is set to 0.1. Figure 4 depicts the values of P_M for the various aforementioned motions. As expected, only in the cases of M_3 and M_6 is P_M lower than TP. After a certain motion group is selected, the ANN ensemble estimates the motion of user.
Mean value of proximity sensor output for the six motions.
- 2. ANN Ensemble
The ANN is a classification model inspired by natural neurons. The ANN is comprised of an input layer, a hidden layer, an output layer, and weights connecting each of the nodes. The output of the ANN is dependent upon the aforementioned weights. We extract certain features from the accelerometer and use them to create an input vector. The names and definitions of these features can be found in Table 2. We utilize a backpropagation algorithm [11] for the learning of the ANN classifier. In the learning of the ANN classifier process, final weights are decided and an output is obtained. After the completion of the ANN process, an output vector is calculated. The output vector is expressed as {o_e1, o_ e2, ... , o_eu}, where u denotes the number of candidate motions. Each output element has a value between −1 and 1, and the maximum value among the output elements becomes the recognized motion.
In practice, a user’s sensor data, obtained from the smartphone, varies in accordance with their physical condition and motions. This is a critical factor that can affect the performance of a single ANN classifier. To overcome this issue, we implement an ensemble of ANN classifiers. The ANN ensemble concept is derived from the widely accepted notion that “two heads are better than one.” Each ANN classifier in the ANN ensemble has its own weight since an initial random weight for each was set prior to its learning. We combine the output vectors of each ANN classifier (see Fig. 5). To combine each of the ANN classifier output vectors, we transform the output vectors into probability vectors, as follows:where pu,v denotes the uth element of the probability vector of the vth ANN classifier. In other words, the output vector {o_e1,v, o_e2,v, … , o_eu,v} is transformed to the probability vector {p1,v, p2,v, … , pu,v}. Then we obtain the combined probability vector as follows:where B denotes the combined probability of the ANN ensemble and each s is an element of the combined probability. Each element of the combined probability vector has its own labeled motion. The recognized motion is determined by the element having maximum value in the combined probability vector.
In our proposed system, before the ANN ensemble estimates the motion, the second DT selects a motion group. Thus, the ANN ensemble’s task is made easier since there are now fewer candidate motions to choose from. Nevertheless, the reason for the ANN ensemble is that the input vector of each ANN classifier is highly dependent on the characteristics of the user. In our algorithm, v is set to three throughout the experiments. However, it can be altered according to the experimental environment.
IV. Experimental Assessment
- 1. Data Collection and Test Setup
To verify the proposed system, real field tests were conducted in the L1 building of the Korea Institute of Science and Technology. Ten subjects (five males and five females) of different ages participated in the test. The subjects’ physical characteristics and ages are presented in Table 3. For the test scenario, each subject had to walk along a corridor performing the motions M_2 to M_6 sequentially as they walked. The subject performed each motion for a distance of 50 meters, walking 250 meters in total. An application logging the sensor data was installed on a Samsung Galaxy Note (Android OS). When the subject walked, the sensor data, obtained from the accelerometer and proximity sensor, were saved in the SD card at a sampling rate of 50 Hz. Upon collecting the data and training the classifiers, we compared the performance of the hybrid model classifier with single classifiers, such as DT, ANN, and SVM. Note that the sensor data of subject M1 is only utilized for the training of all classifiers.
In this subsection, the performance of the hybrid model classifier is analyzed on ten subjects. Table 4 presents the experimental results of the hybrid model classifier. For each of the subjects, the respective total recognition rates corresponding to each of the six predefined motions is given. The proposed classifier performs well regardless of the subject. However, in the case of M_4, the recognition rate is lower compared to that for other motions. In the proposed classifier, the M_4 is (walking swinging hands) intermittently misestimated for the M_5 (running). When the subject performs the swing motion, the patterns of the extracted feature are sometimes similar to those associated with the running motion. To solve this problem, a third DT distinguishing between M_4 and M_5 would be required.
The ANN ensemble, consisting of several individual ANN classifiers, is utilized in the proposed model. Figure 6 shows the performance of the hybrid model classifier according to the number of ANN classifiers used. The recognition rate in Fig. 6 is the mean recognition rate for all ten subjects. This mean recognition rate converges to about 98% when the number of ANN classifiers used exceeds two. Thus, we take three ANN classifiers for the ANN ensemble in the hybrid model classifier.
Mean recognition rate of hybrid model classifier according to the number of ANN single classifiers used.
To evaluate the performance of the hybrid model classifier, we compare its recognition rate with those of other classifiers, such as DT [6], ANN [11], and SVM [12], with the same data sets. Table 5 presents the recognition rates of all classifiers. The proposed model provides the highest recognition rate among the classifiers. Figure 7 shows the recognition results of subject F7 for all classifiers. Blue squares and red points denote true label and estimated label, respectively. We can clearly see that false recognitions occurring in the single classifiers are overcome in the proposed model.
Recognition results of subject F7 for all classifiers: (a) DT, (b) ANN, (c) SVM, and (d) hybrid model classifier.
Table 6 presents the computational times of all classifiers. All algorithms were performed on a personal computer with Intel Core i7-2600 CPU. The computational time of the SVM is the longest among the classifiers. A comparison of the computational times indicates that the proposed model has more computational efficiency than that of the SVM.
In this paper, we presented a hybrid model classifier for the motion recognition of users. Single classifiers cannot guarantee high recognition rates for users of varying ages and body characteristics. A hybrid model classifier, comprising of two DTs and an ANN ensemble, was proposed to enhance such a recognition rate. We defined six motions commonly performed in indoor environments. To verify the performance of the proposed classifier, we conducted real field tests with ten subjects. We compared its performance with those of other classifiers, such as a DT, a single ANN, and a single SVM. The experimental results showed that the hybrid model classifier provided the highest results among the classifiers.
This work was supported by the KIST Institutional Program (Project No. 2E24812) and also supported by Institute of Advanced Aerospace Technology at Seoul National University.
BIO
bjshin1984@snu.ac.krBeomju Shin received his BS and MS degrees in information and communication engineering from Sejong University, Seoul, Rep. of Korea, in 2010 and 2012, respectively. From 2012 to 2014, he worked for the Sensor System Research Center at the Korea Institute of Science and Technology, Seoul, Rep. of Korea. He is currently working toward his PhD at the School of Mechanical and Aerospace Engineering, Seoul National University, Seoul, Rep. of Korea. His current interests include pattern recognition, machine learning, and indoor navigation systems.
chulki.kim@kist.re.krChulki Kim received his BS degree in physics from the Korea Advanced Institute of Science and Technology, Daejeon, Rep. of Korea, in 2002 and his PhD degree in physics from the University of Wisconsin-Madison, WI, USA, in 2011. He was a research staff member at the Samsung Advanced Institute of Technology, Yongin, Rep. of Korea in 2012. Since 2012, he became a researcher at the Korea Institute of Science and Technology, Seoul, Rep. of Korea. His research interests include nanomechanical single-electron transistors and sensor applications of NEMS.
jaekim@kist.re.krJae Hun Kim received his BS and MS degrees in electrical and computer engineering from Purdue University, NI, USA, in 1997 and 1999, respectively and his PhD in electrical engineering from Pennsylvania State University, PA, USA, in 2008. His research interests are focused on the development of sensor networks.
slee@kist.re.krSeok Lee received his BS, MS, and PhD degrees in physics from Yonsei University, Seoul, Rep. of Korea in 1985, 1987, and 1994, respectively. Since 1996, he has been with the Sensor System Research Center at the Korea Institute of Science and Technology, Seoul, Rep. of Korea, where he is now a senior researcher. His research interests include integration of bio-sensors and a sensor platform with sensor networks.
kee@snu.ac.krChangdon Kee received his BS and MS degrees in aeronautics engineering from Seoul National University, Seoul, Rep. of Korea, in 1984 and 1986, respectively. He received his PhD degree in aeronautics and astronautics from Stanford University, Standford, CA, USA, in 1994. Since 1996 he has been with the Department of Mechanical and Aerospace Engineering, Seoul National University, Seoul, Rep. of Korea, where he is a professor. He has more than 20 years’ experience of GNSS and flight-control research.
Corresponding Authortaikjin@kist.re.krTaikjin Lee received his BS and PhD degrees in mechanical and aerospace engineering from Seoul National University, Seoul, Rep. of Korea, in 2001 and 2008, respectively. In 2008, he was with the School of Mechanical and Aerospace Engineering, Seoul National University, Rep. of Korea, where he was a postdoctoral fellow. Since 2010, he has been with the Korea Institute of Science and Technology, Seoul, Rep. of Korea, as a senior researcher. His areas of interest are indoor navigation systems, pattern recognition, and sensor networks.
Bedogni L.
,
Felice M.D.
,
Bonori L.
“By Train or by Car? Detecting the User’s Motion Type through Smartphone Sensors Data,”
IEEE/IFIP Int. Conf. Wireless Days
Dublin, Ireland
Nov. 21–23, 2012
1 -
6
DOI : 10.1109/WD.2012.6402818
Chon Y.
,
Cha H.
2011
“LifeMap: A Smartphone-Based Context Provider for Location-Based Services,”
IEEE J. Pervasive Comput.
10
(2)
58 -
67
DOI : 10.1109/MPRV.2011.13
Yang J.
2009
“Toward Physical Activity Diary: Motion Recognition Using Simple Acceleration Features with Mobile Phones,”
Int. Workshop Interactive Multimedia Consum. Electron.
Beijing, China
1 -
10
DOI : 10.1145/1631040.1631042
Khan A.
“Human Activity Recognition via an Accelerometer-Enabled-Smartphone Using Kernel Discriminant Analysis,”
Proc. Int. Conf. Future Inf. Technol.
Busan, Rep. of Korea
May 21–23, 2010
1 -
6
DOI : 10.1109/FUTURETECH.2010.5482729
Bujari A.
“Movement Pattern Recognition through Smartphone’s Accelerometer,”
IEEE. Consum. Commun. Netw.
Las Vegas, NV, USA
Jan. 14–17, 2012
502 -
506
DOI : 10.1109/CCNC.2012.6181029
Ding Y.
,
Song X.
,
Zen Y.
2008
“Forecasting Financial Condition of Chinese Listed Companies Based on Support Vector Machine,”
Elsevier J. Expert Syst. Appl.
34
(4)
3081 -
3089
DOI : 10.1016/j.eswa.2007.06.037
@article{ HJTODO_2014_v36n6_1016}
,title={Hybrid Model-Based Motion Recognition for Smartphone Users}
,volume={6}
, url={http://dx.doi.org/10.4218/etrij.14.0113.1159}, DOI={10.4218/etrij.14.0113.1159}
, number= {6}
, journal={ETRI Journal}
, publisher={Electronics and Telecommunications Research Institute}
, author={Shin, Beomju
and
Kim, Chulki
and
Kim, Jae Hun
and
Lee, Seok
and
Kee, Changdon
and
Lee, Taikjin}
, year={2014}
, month={Dec}
TY - JOUR
T2 - ETRI Journal
AU - Shin, Beomju
AU - Kim, Chulki
AU - Kim, Jae Hun
AU - Lee, Seok
AU - Kee, Changdon
AU - Lee, Taikjin
SN - 1225-6463
TI - Hybrid Model-Based Motion Recognition for Smartphone Users
VL - 36
PB - Electronics and Telecommunications Research Institute
DO - 10.4218/etrij.14.0113.1159
PY - 2014
UR - http://dx.doi.org/10.4218/etrij.14.0113.1159
ER -
Shin, B.
,
Kim, C.
,
Kim, J. H.
,
Lee, S.
,
Kee, C.
,
&
Lee, T.
( 2014).
Hybrid Model-Based Motion Recognition for Smartphone Users.
ETRI Journal,
36
(6)
Electronics and Telecommunications Research Institute.
doi:10.4218/etrij.14.0113.1159
Shin, B
et al.
2014,
Hybrid Model-Based Motion Recognition for Smartphone Users,
ETRI Journal,
vol. 6,
no. 6,
Retrieved from http://dx.doi.org/10.4218/etrij.14.0113.1159
[1]
B Shin
,
C Kim
,
JH Kim
,
S Lee
,
C Kee
,
and
T Lee
,
“Hybrid Model-Based Motion Recognition for Smartphone Users”,
ETRI Journal,
vol. 6,
no. 6,
Dec
2014.
Shin, Beomju
Kim, Chulki
Kim, Jae Hun
et al.
“Hybrid Model-Based Motion Recognition for Smartphone Users”
ETRI Journal,
6.
6
2014:
Shin, B
,
Kim, C
,
Kim, JH
,
Lee, S
,
Kee, C
,
Lee, T
Hybrid Model-Based Motion Recognition for Smartphone Users.
ETRI Journal
[Internet].
2014.
Dec ;
6
(6)
Available from http://dx.doi.org/10.4218/etrij.14.0113.1159
Shin, Beomju
,
Kim, Chulki
,
Kim, Jae Hun
,
Lee, Seok
,
Kee, Changdon
,
and
Lee, Taikjin
,
“Hybrid Model-Based Motion Recognition for Smartphone Users.”
ETRI Journal
6
no.6
()
Dec,
2014):
http://dx.doi.org/10.4218/etrij.14.0113.1159