Full article: Fidan: a predictive service demand model for assisting nursing home health-care robots

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

While population aging has sharply increased the demand for nursing staff, it has also increased the workload of nursing staff. Although some nursing homes use robots to perform part of the work, such robots are the type of robots that perform set tasks. The requirements in actual application scenarios often change, so robots that perform set tasks cannot effectively reduce the workload of nursing staff. In order to provide practical help to nursing staff in nursing homes, we innovatively combine the LightGBM algorithm with the machine learning interpretation framework SHAP (Shapley Additive exPlanations) and use comprehensive data analysis methods to propose a service demand prediction model Fidan (Forecast service demand model). This model analyzes and predicts the demand for elderly services in nursing homes based on relevant health management data (including physiological and sleep data), ward round data, and nursing service data collected by IoT devices. We optimise the model parameters based on Grid Search during the training process. The experimental results show that the Fidan model has an accuracy rate of 86.61% in predicting the demand for elderly services.

KEYWORDS:

1. Introduction

Population aging has become a challenge for every country in the world (Yang et al., Citation2022). Developed countries also face this problem (Johnson, Citation2022). In the context of population aging, the traditional elderly care service model is facing transformation (Zhang, Citation2023). In the elderly care service, the demand for caregivers is also increasing. The increased number of older adults and the decrease in newborns have caused a human resource crisis in the nursing industry (Marć et al., Citation2019). Our research is to solve this crisis better. Researchers are exploring solutions to the current problem of increased caregiver workload due to an aging population. In order to solve the problem of human resources and the increase in the workload of nursing staff, it has become common to use robots to assist nursing staff in their work. However, traditional robotic solutions’ limitations are that the robots currently used in nursing homes perform pre-set tasks. Robots performing pre-set tasks need to reduce the workload of nursing staff effectively.

The workflow of this type of robot that executes the set task type, taking the disinfection robot as an example, the main steps are: first, the disinfection robot will move around the office area where it is working before the first work to complete the work of building the map; then, the staff using the disinfection robot will mark critical locations on the constructed map, such as meeting rooms, administrative offices, front desk, etc.; finally, the staff will set the start time and place of disinfection in the background system, and the disinfection robot will Start working at the appointed time and position, and return to the charging position after completing the disinfection task. Since this type of robot does not have the capabilities of subjective analysis, judgment, decision-making, and operational execution, it adds new workloads in application scenarios where requirements change frequently. Our research results can empower robots to be upgraded from robots performing pre-set tasks to robots actively performing services, providing more direct and practical assistance to nursing staff.

Robot technology has developed rapidly in recent years and has been widely used in medical, logistics, agriculture, service industries, and other fields (Lopez et al., Citation2017). The emergence of edge computing technology has also made the application of artificial intelligence in intelligent elderly care even more powerful. In addition to basic offloading capabilities, edge computing provides the smart elderly care industry with value-added capabilities, application capabilities, and customer self-service capabilities. For example, cash register robots, security patrol robots (Trovato et al., Citation2017), warehousing robots, caddy-following robots, welcome robots (Draghici et al., Citation2022), food delivery robots, disinfection robots, etc., are more common in the service industry (Du et al., Citation2022). Such robots belong to the type of robots that perform set tasks.

After our research, we found that the nursing home provides services for individuals with nearly 20 tasks, such as nurses measuring blood pressure and massage for individual users, cooperating with nurses to conduct ward rounds, etc. Therefore, to realise the intelligent service of nursing home health-care robots, the service needs of individuals in nursing homes must be effectively predicted. The service requirements of individuals in nursing homes are mainly influenced by the behavioural attributes of individuals, so effective prediction of the behavioural attributes of individuals in nursing homes is the core task of intelligent service for nursing home health-care robots. The demand for intelligent nursing home health-care robots is to provide services to individuals after relevant analysis and decision-making.

The intelligent service scenario in Figure has doctors and nurses overseeing and inspecting the services. The daily physiological monitoring, sleep monitoring, air monitoring, service matter, and room check monitoring values of all individuals in the nursing home are uploaded in real time to the health profile, health monitoring, and service management databases. From these databases, we extract relevant data for feature analysis, build models, train the constructed models on relevant features using the Gradient Boosting Decision Tree (GBDT) based machine learning model, and finally, the influence of each independent variable on the dependent variable is analyzed using the SHAP framework. The care server robot proactively provides services to individuals in nursing homes based on the predicted outcomes of the Fidan model.

Figure 1. Illustration of an intelligent service scenario.

This research aims to predict the demand for nursing home services based on the LightGBM (Light Gradient Boosting Machine) algorithm and the machine learning interpretation framework SHAP (SHapley Additive exPlanations) and data collected by IoT devices. This paper's contribution is innovatively applying the algorithm to health-care robots in nursing homes. The application value of this research is that in the application scenario of nursing homes, the Fidan model can predict the service demand of the elderly and improve the efficiency of health-care robots. The industrial significance of this research is that in the application scenario of nursing homes, nursing homes can add analysis and decision-making capabilities to nursing robots by calling the Fidan model. Nursing robots using the Fidan model can actively perform services, reducing the workload of nursing staff more directly and effectively. The main contributions of this paper are as follows:

We propose a feature extraction model for predicting service demand based on the comprehensive analysis of prediction results from data in the health records database, health monitoring database, service management database, and a machine learning interpretable framework.
We propose a model for predicting service demand based on the gradient boosting framework. Based on an exhaustive search, combined with relevant feature variables, we optimised the model parameters and obtained the optimal model for predicting service demand through training.
We performed a complexity and overhead analysis while proposing the model. Based on the analysis results, we propose a human–computer interaction method and a communication method between the model algorithm layer and the robot control layer.

The following is how the rest of the paper is organised. Section 2 is a review of related work. Section 3 describes the design of the intelligent service model. Section 4 describes the implementation of the model. Section 5 describes the analysis of the experimental results. Section 6 presents the conclusions.

2. Related work

In recent years, there has been much research on service demand forecasting. In order to solve the problem that companies participating in the bicycle rental system are facing balanced supply and inconsistent demand, Y. Qin et al. uses the Washington DC shared bicycle system data set (Qin et al., Citation2018), based on principal component analysis and generalised recurrent neural network model, proposes a method for predicting public bicycle demand. This method achieved minimum RMSLE and MAPE values of 0.391 and 0.108, respectively. M. Phankokkruad et al. proposed a method to predict service demand through the number of patients served by medical services based on convolutional neural networks to ensure the availability of resources for the required drugs (Phankokkruad & Wacharawichanant, Citation2019). This method obtained an MAE value of 1.152 and an RMSE value of 1.366 on the daily outpatient data set of the obstetrics and gynecology department of a hospital in Bangkok.

In order to reduce the logistics cost of each process of service companies, R. Ramos-Carrasco et al. proposed a method of predicting tourism demand based on an artificial neural network (Ramos-Carrasco et al., Citation2019). This method's absolute average percentage error (AMPE) is 11.21%, which reduces the logistics cost of the service company by nearly 33%. In order to make more effective use of resources and reduce losses caused by overproduction (Çetinkaya & Erdal, Citation2019), Z. Çetinkaya et al. proposed a method for predicting daily dietary needs in the cafeteria of Kırıkkale University based on the artificial neural network model using student data sets. The average MSE value of the studies conducted was 0.00864. In order to facilitate vehicle scheduling, improve mileage utilisation, and reduce passenger waiting time (Jin et al., Citation2020), Y. Jin et al. proposed a model to predict online car-hailing demand based on stacked ensemble learning methods and large-scale data sets. The MAE and RMSE of this model increase by 6.0% and 5.2%, respectively, within a 30-min interval, demonstrating the proposed model's effectiveness and feasibility. In order to improve the accessibility of urban traffic, reduce passenger waiting time, improve the safety of taxi stations, and reduce passenger transportation costs, D. Faial et al. proposed a prediction method using flow learning (Faial et al., Citation2020). The method was 78 percent accurate in predicting taxi demand.

In order to realise the prediction of the demand for emergency medical services, K. Bandara et al. proposed a global prediction and reasoning framework based on the extended short-term memory network (Bandara et al., Citation2020). When this framework is validated using an Australian alcohol, drug, and self-harm-related emergency medical services dataset, experimental results outperform many state-of-the-art techniques, achieving competitive results in predictive accuracy. L. Munkhdalai et al. proposed a service demand prediction method based on deep learning in response to the sharp increase in demand for postal delivery services during unique festivals (Munkhdalai et al., Citation2020). The MAPE value of this method in the experiment is less than 0.15. In order to improve the refinement of aviation material support and increase the predictability of aviation material support (Niu et al., Citation2020), P. Niu et al. proposed a material demand forecasting method based on LVQ neural network, Elman neural network, and SOM neural network. Examples prove that this method has a good effect. M. A. Zháo et al. proposed a prediction method based on a support vector regression machine (Zháo & Jayadi, Citation2021), using variables such as seasonality, public holidays, and order peak times to realise the prediction of daily restaurant visitors and menu demand. This method achieved a MAPE of 31.2% in the experiments.

There is also a lot of research on nursing robots. Ahamed et al. used infrared sensors and Arduino UNO microcontrollers to make a robot that can remind patients or the elderly to take medicine in time and provide pure drinking water when patients need it. The robot uses infrared and ultrasonic sensors to sense the presence of cups and the level of drinking water (Ahamed et al., Citation2020). In order to reduce the social transmission of coronavirus disease, Manikandan et al. designed a robot that can provide patients with medicines, food, and clothes (Manikandan et al., Citation2021). In order to improve the comfort of the elderly robot for the elderly, Zhang et al. analyzed the vibration characteristics of the human-machine coupling system (Zhang et al., Citation2021). They designed a robot prototype that would not resonate with the elderly during walking. In order to ease the work of nursing staff, Jonatan et al. proposed a robot prototype for nursing homes that can deliver medicines according to a configured schedule (Jonatan et al., Citation2022). The above research mainly studies improving the physical experience during the robot-carrying process.

The nursing robots research above belongs to the type of robots that perform set tasks. Because task-based robots need to set tasks before running, during the actual work of nurses, the content of their work will change with the needs of the elderly in nursing homes. It then arises that while the robot is performing one task, it may be required to complete another. However, a robot can't interrupt an ongoing assignment with a task request (Zieliński et al., Citation2017). This interrupt handling is a complex issue for robot controllers. Some tasks cannot be terminated immediately during the startup phase, and some need to complete a series of operations before they can be suspended and resumed (Dudek et al., Citation2019). Therefore, task-based robots are limited in reducing nurses’ workload due to the need for more anticipation of tasks. The Fidan model proposed in this paper, based on machine learning, allows task-type robots to be transformed into active service-providing robots by predicting the corresponding service needs based on various behavioural attributes of individuals in nursing homes.

3. Model design

In this section, we first analyze the daily behaviours of all individuals in nursing homes and propose a feature extraction model. Secondly, we analyze various data and train the constructed training set based on the ensemble learning algorithm. Then, based on some task scheduling algorithms (Du et al., Citation2022; Zhou et al., Citation2022), the health-care robot application management system sends execution instructions to each health-care robot terminal. According to the business logic of intelligent service, it can be concluded that the core of intelligent service model design is the effective prediction of individual service demand (Wang et al., Citation2022; Zhou et al., Citation2021).

3.1. Feature analysis

According to our research, the service to be performed by the health-care robot in the nursing home is the behavioural needs of all individuals in the nursing home (Fuji et al., Citation2011). The individual demand behaviours in the nursing home are mainly divided into passive execution (Lu et al., Citation2021), active calling, and imperceptible participation (Khosla et al., Citation2013).

Passive execution behaviour means behaviour without consciousness and belongs to passive acceptance. The behavioural attributes include the detection of physiological indicators such as triglycerides, changing urine pads, changing bedding, changing clothes, indoor cleaning, diet, routine ward rounds, night patrols, etc.

The core of the intelligent service of health-care robots is the active service after the effective prediction of individual needs, which are the daily service needs of individuals. According to our research, these daily services can be categorised into five categories: physiological monitoring services, sleep monitoring services, air monitoring services, ward round monitoring services, and daily life services. Therefore, intelligent services need to predict the demand for these five categories of services effectively. Moreover, the feature analysis of these five types of data is the primary task of the intelligent service of health-care robots.

The initial data used in this study are extracted from three databases: health archives, health monitoring, and service management in the health management system. This paper extracts individual information, including gender, age, medical history, nursing level, evaluation level, physiological monitoring, sleep monitoring, ward round monitoring, air monitoring, and service items, which are more than 20 features of 10 categories as independent variables. We count the service workload of the individual, which is then used as the dependent variable, and use the user ID as the unique identifier, ultimately obtaining 13,000 groups of samples. According to the above analysis, we randomly divide the dataset into a test set for prediction evaluation and a training set for model training according to the ratio of 2:8.

3.2. Working principle of lightGBM

LightGBM is a new boosting framework proposed by Microsoft (Ke et al., Citation2017). The basic working principle of LightGBM is similar to that of XGBoost (Chen & Guestrin, Citation2016), both of which use decision trees based on learning algorithms. LightGBM has further optimised the training speed of the model. For example, LightGBM uses a histogram-based decision tree algorithm. This algorithm's basic idea is first to discretize the continuous floating-point eigenvalues into multiple integers and construct a histogram whose width is an integer number. The statistical index accumulates in the histogram when the model traverses the data according to the discretized value. After the model crosses the data once, the histogram collects the required statistics. Finally, according to the discrete value of the histogram, travel to find the optimal segmentation point.

The implementation process of the XGBoost algorithm is as follows: multiple decision tree models are used as the basic unit to form an integrated learner; based on the second-order Taylor expansion, the regular term is added to the loss function to realise the control of complexity and overfitting (Chen & Guestrin, Citation2016).

Formulas (1) (2) express the calculation method of the XGBoost evaluation function. (1) $\begin{aligned} L^{(t)} & = \sum_{i = 1}^{n} l (y_{i}, {\hat{y}}_{i}^{(t - 1)} + f_{t} (x_{i})) + Ω (f_{t}) \end{aligned}$ (1) (2) $\begin{aligned} Ω (f_{t}) & = γ T + \frac{1}{2} λ ∥ ω ∥^{2} \end{aligned}$ (2) In formula (1), $L^{(t)}$ represents the objective function after $t$ iterations, $l (y_{i}, {\hat{y}}_{i}^{(t - 1)} + f_{t} (x_{i}))$ represents the loss function, and $y_{i}$ represents the value of $i$ samples, ${\hat{y}}_{i}^{(t - 1)}$ represents the i-th output after training $(t - 1)$ times, $f_{t} (x_{i})$ represents the state of $x_{i}$ after training $t$ times; in formula (2), $Ω (f_{t})$ represents the output after training $t$ times, $γ$ represents the output parameter, $T$ represents the total dependent variable, $λ$ represents the relevant parameters in the objective function, and ω represents the corresponding degree of influence (Chen & Guestrin, Citation2016).

The LightGBM algorithm is an upgraded version of the XGBoost algorithm. The LightGBM algorithm uses the GOSS algorithm to reduce the number of samples (Ke et al., Citation2017). Suppose there are $i$ data sets ${x_{1}, \dots, x_{i}}$ , each $x_{i}$ is an m-dimensional vector in $X^{m}$ space, and the loss function gradient is marked as ${g_{1}, \dots, g_{i}}$ . The processing method of the GOSS algorithm is as follows: first, arrange the training set in descending order of the absolute value of the gradient, retain the large gradient instance $a \times 100 %$ subset $A$ , and the small gradient instance $(1 - a) \times 100 %$ is the subset $A^{C}$ , and randomly sample $b \times | A^{C} |$ subset $B$ , split the training set according to the variable gain value ${\tilde{V}}_{j} (d)$ calculated on $A \cup B$ (Ke et al., Citation2017). As shown in formula (3). (3) ${\tilde{V}}_{j} (d) = \frac{1}{n} (\frac{{(\sum_{x_{i} \in A_{l}} g_{i} + \frac{1 - a}{n} \sum_{x_{i} \in B_{l}} g_{i})}^{2}}{n_{l}^{j} (d)} + \frac{{(\sum_{x_{i} \in A_{r}} g_{i} + \frac{1 - a}{n} \sum_{x_{i} \in B_{r}} g_{i})}^{2}}{n_{r}^{j} (d)})$ (3) In formula (3), $A_{l} = {x_{i} \in A : x_{i j} \leq d}$ , $A_{r} = {x_{i} \in A : x_{i j} > d}$ , $B_{l} = {x_{i} \in B : x_{i j} \leq d}$ , $B_{r} = {x_{i} \in B : x_{i j} > d}$ .

However, the LightGBM model is composed of N trees. For a specific training sample, we cannot confirm the influence of each feature in the training sample on the prediction result, especially for those samples that misjudged the model itself (Ke et al., Citation2017). The relationship between the results effectively improves the model effect or analyzes abnormal samples. Therefore, the Fidan model proposed in this paper based on the LightGBM algorithm uses the explainable machine learning framework SHAP (Shapley Additive exPlanations) to analyze the relationship between features and prediction results.

3.3. Working principle of SHAP

The principle of the SHAP value is the Shapley value, and the Shapley value is an algorithm derived from cooperative game theory. The Shapley value, created by Shapley in 1953, is an algorithm that assigns payouts to participants based on their contribution to the total payout. Participants cooperate in collective activities and obtain corresponding benefits from this cooperation. Interpreting machine learning predictions regarding Shapley, the Shapley value for each feature is the average marginal contribution of the feature value over all possible combinations of feature values. Where “total expenditure” is the model prediction for a single instance of the dataset, “participant” is the feature value of the sample, and “revenue” is the actual prediction for that instance minus the average forecast for all cases.

In the process of constructing the Fidan model, the calculation of the impact of feature variables on the prediction results is to use the machine learning interpretation framework SHAP developed by Lundberg et al. (Lundberg & Lee, Citation2017), as shown in formula (4). (4) $ϕ_{i} = \sum_{S \subseteq F ∖ {i}} \frac{| S |! (| F | - | S | - 1)!}{| F |!} [f_{S \cup {i}} (x_{S \cup {i}}) - f_{S} (x_{S})]$ (4) In formula (4) (Lundberg & Lee, Citation2017), $ϕ_{i}$ represents the degree of influence of the i-th variable, $s$ represents a subset of variables, $F ∖ {i}$ represents a set of variables, $F$ represents the total input of variables, $f_{S \cup {i}} (x_{S \cup {i}})$ means the model output when the training set only has variable values in $x_{S \cup {i}}$ , and $f_{S} (x_{S})$ means the model output when the training set only has variable values in $x_{S}$ .

3.4. Predictive model construction

The construction process of the Fidan model of the health-care robot task prediction model proposed in this paper mainly includes three steps: feature variable extraction, parameter optimisation, and result evaluation, as shown in Figure .

Figure 2. Forecast service demand model.

Figure shows the implementation idea of the Fidan model, which is described in detail as follows:

In steps 1 and 2, after analyzing the data in the health record database, health monitoring database, and service management database, we propose a feature extraction model for predicting service demand and use this model to construct a training sample set and test sample set.
In steps 3, 4 and 5, we train the feature extraction model based on the LightGBM algorithm. The contribution of prediction results is analyzed based on the SHAP framework. Based on the analysis results, we updated the training and test sample sets.
In step 6, we use LightGBM as the base classifier to train our proposed model for predicting service demand. Based on the prediction results of the model, we use Grid Search to optimise the model parameters.
In steps 7 and 8, we use the test sample set to test the optimal output model and optimise the model again according to the test results.

We loop through steps 6, 7, and 8 until the accuracy reaches a preset threshold. Ultimately, we obtain an optimal model for predicting service demand.

Table

Display Table

In Algorithm 1, the

b a t c h

is the number of samples, the

e p o c h s_N

is the number of model-building exercises, and the

A c c

is the accuracy pre-set threshold. Here the

A c c

is a preset accuracy threshold. As shown in Algorithm 1, we first build training and testing sets using our proposed feature extraction model. In training the Fidan model, we use Grid Search to optimise the model parameters. We iteratively train the model until the accuracy reaches a pre-set threshold.

Parameter optimisation uses a way of controlling model complexity to prevent model overfitting and improve model performance. In this paper, five-fold cross-validation is used to obtain the best parameters of the model. 10,400 training samples are divided into 2,080 sets according to 5 sets as a unit. In training, the model structure parameters, parameters affecting accuracy, and parameters affecting overfitting are processed using the grid search method (GridSearchCV) to find the optimal parameters. Finally, we obtain the optimal parameter prediction model based on the LightGBM algorithm after multiple iterative tests and parameter adjustment.

3.5. Evaluation metrics

The evaluation metrics we use are Mean Square Error (MSE), coefficient of determination (R_Squared), Mean Absolute Percentage Error (MAPE), Mean Absolute Error (MAE), and accuracy. Assuming that the predicted value is: $\hat{y} = {{\hat{y}}_{1}, {\hat{y}}_{2}, {\hat{y}}_{3}, \dots \dots, {\hat{y}}_{n}}$ , the real value is: $y = {y_{1}, y_{2}, y_{3}, \dots \dots, y_{n}}$ , the mean square error (MSE) calculation formula is shown in formula (5). (5) $M S E = \frac{1}{n} \sum_{i = 1}^{n} ({\hat{y}}_{i} - y_{i})^{2}$ (5) The value range $[0, + \infty)$ in formula (5), the larger the value, the larger the error. If the value is equal to 0, the predicted value is completely consistent with the actual value. The formula for calculating the coefficient of determination (R Squared) is shown in formula (6). (6) $R^{2} = 1 - \frac{\sum_{i = 1}^{n} {({\hat{y}}_{i} - y_{i})}^{2}}{\sum_{i = 1}^{n} {({\bar{y}}_{i} - y_{i})}^{2}}$ (6) The numerator in formula (6) is the sum of the square difference between the preset value and the output value. The denominator is the sum of the square difference between the preset value and the output mean value. The value range of $R^{2}$ is [0,1]. The larger the output value, the better the training. The calculation formula of mean absolute percentage error (MAPE) is shown in formula (7). (7) $M A P E = \frac{100 \%}{n} \sum_{i = 1}^{n} | \frac{{\hat{y}}_{i} - y_{i}}{y_{i}} |$ (7) The MAPE value range in formula (7) is $[0, + \infty)$ . The larger the value, the greater the error. The calculation formula of the mean absolute error (MAE) is shown in formula (8). (8) $M A E = \frac{1}{n} \sum_{i = 1}^{n} | {\hat{y}}_{i} - y_{i} |$ (8) The MAE value range in formula (8) is $[0, + \infty)$ . The larger the value, the greater the error. The accuracy calculation formula is shown in formula (9). (9) $Accuracy = \frac{T P + T N}{T P + T N + F P + F N} \times 100 %$ (9) In formula (9), TP represents true positive, TN represents true negative, FP represents false positive, and FN represents false negative.

3.6. Method complexity and cost analysis

The Fidan model we proposed uses the LightGBM algorithm as the base classifier. The LightGBM algorithm uses a histogram algorithm when splitting. The histogram algorithm can reduce the complexity of the base model itself, thus reducing memory usage. When the histogram algorithm handles cache access, all the sample features obtain gradients in the same way so that to achieve continuous access, only the gradients need to be sorted. This mechanism significantly improves the hit rate of the cache. Since the histogram algorithm does not need to store the array of row index to leaf index, it reduces the storage capacity requirement and avoids the cache miss problem.

The LightGBM algorithm uses the unilateral gradient sampling technology GOSS (Gradient-based One-Side Sampling). The starting point of the unilateral gradient sampling technology GOSS algorithm is to reduce the number of samples. In the gradient boosting decision tree GBDT, each sample data has a different gradient value, and the sample with a slight gradient value has a relatively small training error. From this, the model learns the sample data with a slight gradient value well. Therefore, the GOSS algorithm balances sample data volume and prediction accuracy by deleting most of the small gradient samples, retaining all samples with large gradients, randomly sampling samples with small gradients in proportion, and using the remaining samples to calculate information gain. The number of samples increases the speed of model training.

3.7. Data collection

In proposing the Fidan model, we collected data from health records, monitoring, and service management. The main fields collected in health records include gender, height, weight, age, self-care assessment, level of care, living habits, dietary preferences, medical history, family medical history, cancer history, immunisation history, and medications. The equipment that collects health monitoring data mainly includes thermometers, oximeters, blood pressure monitors, blood glucose, uric acid, and cholesterol metres. These IoT devices will upload the collected data through the Internet to the server that interacts with the Fidan model. Service management data records users’ needs when they click on each service content on the APP.

4. Model implementation

In this section, we introduce the implementation of the Fidan model. Based on the lifting tree LightGBM algorithm, this model builds a basic model for predicting individual service demand based on the training data set. This model uses the method of Shapley's additional explanations (SHAP) derived from game theory to analyze the contribution of 9 categories of more than 30 characteristic variables to the output results. The relevant parameters are updated according to the analysis results, and finally, the best model parameters are obtained.

4.1. Introduction to sample data

The data set used in this paper is a non-public private data set when training the model, and the public data set Food Demand Forecasting on Kaggle is used to verify the design ideas of the model further. Some fields of the non-public private data set used in training the model are shown in Table .

Table 1. Description of characteristic independent variables.

Download CSV Display Table

We plan to desensitise the non-public private data set and make it public after obtaining relevant authorisation.

4.2. Individual behaviour attribute collection

According to our research, the task prediction of health care robots is mainly to effectively predict the service needs of all individuals in nursing homes. Individual service needs are mainly affected by individual behavioural attributes. Therefore, collecting individual behaviour attributes and analyzing historical records are essential tasks for medical robot task prediction. This paper's behaviour attribute acquisition modules include a central control module, signal acquisition module, and terminal use module developed by Google Android Studio, including login, health detection, health monitoring, service call, nursing service, and nursing service functions.

4.3. Implementation of health-care robot

After the individual service demand prediction is completed, sending execution instructions to each health-care robot terminal is necessary. The health-care robot application management system in this paper uses the MySQL database for data storage. It uses the Bootstrap framework and VUE framework for front-end management interface development. It also uses HTML5 Canvas for data visualisation development.

The medical robot application management system predicts the processing result according to the individual service demand and sends the movement direction to the navigation system. The individual behaviour attribute collection module's central control module controls the signal collection unit to collect signals through relevant hardware interfaces. After the central control module performs relevant processing and analysis, it finally sends the result to the business workload prediction module through the network communication module. The workload prediction model updates relevant parameters in real-time.

4.4. Human–computer interaction and communication

We integrated speech synthesis, recognition, and voice wake-up functions into the robot control software system.

Considering the interactive experience between health-care robots and the elderly, we have added a human–computer interaction module to the robot control software system. When the nursing robot arrives in front of the elderly, it will actively ask them elderly if they need services. The nursing robot will only start serving when the elderly say they need assistance. When the older adult replies that no service is necessary, the nursing robot will leave the older man and go to other older people. After the nursing robot completes the interaction with the elderly, it will send the received feedback results back to the server to provide data support for further optimisation of the model.

To gain an in-depth understanding of the service experience of the elderly with health-care robots connected to the Fidan model and those not related to the Fidan model, we have added a scoring module to the robot control software system. When the nursing robot completes the service for the elderly, the elderly will terminate the service evaluation through voice or touch screen. Evaluation indicators include:

The timeliness of this service.
The effectiveness of this service.
The quality of this service.

As shown in Figure , to ensure that the nursing robot control layer can communicate with the model algorithm layer, we use two modes of mutual integration, cloud and local, when deploying model applications. We deploy the optimal model obtained through training locally and in the cloud. After the cloud model is updated, the locally deployed model will be updated synchronously. The robot control layer prioritises calling the models deployed in the cloud. If the data communication fails, the robot control layer will switch and call the locally deployed models in real-time.

Figure 3. The robot control layer communicates with the model algorithm layer.

We add a heartbeat communication protocol between the robot control and model algorithm layers. The protocol sets a period, and the robot control layer sends heartbeat data to the model algorithm layer. After receiving the heartbeat data, the model algorithm layer replies to the data according to the protocol. If the robot control layer receives the reply data over time, it will immediately start the model offline alarm.

5. Result analysis

5.1. Prediction results

Standard prediction algorithms are linear regression algorithm, logistic regression algorithm, support vector machine algorithm, model fusion algorithm, random forest algorithm, adaptive boosting tree algorithm, and gradient boosting decision tree algorithm. The Fidan model proposed in this paper is constructed based on a linear regression algorithm, logistic regression algorithm, support vector machine algorithm, model fusion algorithm, random forest algorithm, adaptive boosting tree algorithm, and gradient boosting decision tree algorithm. The evaluation results are compared and verified, as shown in Figure (a).

Figure 4. Comparison of prediction results of various methods. (a). Comparison of the prediction accuracy of GBDT with other algorithms; (b). Comparison of prediction accuracy of LightGBM with other algorithms.

As can be seen from Figure (a), the accuracy of the base models constructed based on the model fusion algorithm, the random forest model (Random Forest), and the gradient boosting decision tree algorithm all exceeded 80%. They all have high predictive ability, among which the basic model based on GBDT has the highest accuracy rate, and the accuracy rate of forecasting personalised service demand reaches 86.36%.

In order to further compare the evaluation results of the base model, the Fidan model proposed in this paper was constructed based on the XGBoost algorithm, Linear Regression, Random Forest, and LightGBM algorithm, respectively, during the construction process, and the evaluation was carried out. The results are compared and verified, as shown in Figure (b).

It can be seen from Figure (b) that the accuracies of the base models constructed based on the XGBoost algorithm, the Random Forest model, and the LightGBM algorithm all exceed 85%, and they all have high predictive ability. Among them, the base model based on the LightGBM algorithm has the highest accuracy, and individual service demand prediction accuracy reaches 86.61%.

The healthcare sector can turn their running nursing robots that perform set tasks into active service nursing robots with analysis and decision-making capabilities by calling the API interface encapsulated by our research results.

5.2. Analysis of the influence degree of characteristics

The individual service demand is mainly influenced by the service content and matters provided by the nursing home. The characteristic dependent variables involved in the study of this paper are the following service components: changing diaper pads, meal inquiry, calling for toilet, calling for water, and calling for massage. The correlation between the independent and dependent variables is shown in Figure .

Figure 5. Correlation between an independent variable and dependent variable.

Figure reflects that individual service needs are strongly influenced by the evaluation level, nursing level, age, gender, medical history, and some physiological monitoring values in the health record.

5.3. Analysis of the contribution of predicted outcomes

In order to explore the factors that affect the prediction results of individual service demand, we use the SHAP algorithm to analyze the contribution of the prediction results in detail from two aspects. These two aspects are global characteristic independent variable SHAP values and analysis of individual sample characteristics.

The influence of the global independent variables on the prediction results is shown in Figure , where each row represents an independent variable, and the horizontal coordinate is the SHAP value. In the Figure, a point represents a sample, a bluer colour represents a smaller value of the characteristic independent variable SHAP, and a redder colour represents a more significant value of the characteristic independent variable SHAP.

Figure 6. Global feature independent variable SHAP value.

In a comprehensive analysis, it can be concluded that the predictive results of the Fidan model proposed in this paper are influenced by gender, age, medical history, level of care, level of evaluation, HDL, LDL, total cholesterol, triglycerides, and uric acid.

5.4. Model tuning and validation

In terms of model tuning, we tune the training samples. In terms of sample tuning, according to the conclusions of the previous section, the characteristic independent variables that directly affect the prediction results of the Fidan model proposed in this paper are gender, age, medical history, nursing level, evaluation level, high-density lipoprotein, low-density lipoprotein, Total cholesterol, triglycerides, uric acid. In this paper, we first fine-tune the sample, extract ten characteristic independent variables and characteristic dependent variables that directly impact the prediction results of the Fidan model from the sample, and then form a new sample.

To compare the influence of the samples before and after tuning on the prediction results of the Fidan model, two evaluation metrics, mean squared error (MSE) and the coefficient of determination (R Squared), were used. Evaluation of the prediction results.

Because the more significant the MSE value, the larger the error, and the larger the R variance value is within the range of values, the better the model-fitting effect will be. Therefore, we examine the following evaluation results. Figure (a) shows the results of evaluating the mean squared error of the samples before and after adjustment. Figure (b) shows the results of the evaluation of the R squared evaluation results of the samples before and after adjustment.

Figure 7. The MSE and R squared evaluation results before and after sample tuning. (a). Mean square error evaluation results before and after sample tuning; (b). R squared evaluation results before and after sample tuning.

We can see that the prediction accuracy of the Fidan model trained with the tuned samples for individual service demands is better than that of the Fidan model trained with the pre-tuned samples.

5.5. Physiological data collection

In this paper, during the actual software operation process, the picture of using IoT devices to collect individual physiological data is shown in Figure .

Figure 8. Physiological data collection

As shown in Figure , while collecting physiological data by IoT devices, the individual blood glucose data collected by the blood glucose monitor are then transmitted to the Android tablet. The Android tablet transmits the data through the Internet to the cloud server where the Fidan model has been deployed.

5.6. Food demand forecasting

The core of the intelligent service of health care robots studied in this paper is the prediction of the daily service demand of individuals. The daily service contains daily living service. Daily living services include changing urine pads, bedding, clothes, house cleaning, and eating and drinking. Therefore, to further validate the accuracy of the Fidan model prediction, we validated the Fidan model using Food Demand Forecasting, a publicly available dataset on Kaggle (Vidhya, Citation2020). The results of the validation are shown in Figure :

Figure 9. Comparison of different models for food demand prediction results.

In this validation, we performed the base model construction based on the linear regression algorithm, logistic regression algorithm, support vector machine algorithm, and gradient boosting decision tree algorithm in the construction process for the Fidan model proposed in this paper. As shown in Figure , the accuracy of our proposed Fidan model for food demand forecasting in the dataset Food Demand Forecasting, the base model constructed based on the gradient boosting decision tree algorithm, reached 82.9%. From this validation result, the Fidan model can provide practical support for the intelligent service of nursing home care robots. Using the model optimisation method described in the previous section, we will further use Food Demand Forecasting to verify the model optimisation ideas. The experimental results before and after optimisation are shown in Table .

Table 2. Comparison of experimental results before and after optimisation.

Download CSV Display Table

From Table , all indicators of the optimised model are better than those before optimisation. We compare the experimental results with the research on service prediction cited in this paper, and the details are shown in Table .

Table 3. Comparison with current research methods.

Download CSV Display Table

As can be seen from Table , in the current research related to this paper, the MAPE value of Y. Qin et al.’s forecast for public bicycle demand is 0.108; the MAPE value of L. Munkhdalai’s forecast for service demand is less than 0.15; M. A. Zháo et al. The predicted MAPE value of the restaurant’s daily visitors and menu demand is 0.312; the comparison in Table shows that the Fidan model proposed by us based on SHAP and LightGBM has the best prediction effect.

This section describes the experimental results of the Fidan model. The experimental environment used in this paper to train the Fidan model includes Ubuntu 18.04.5 LTS (Bionic Beaver), Python 3.7.6, LightGBM 3.3.2, XGBoost 1.6.1, Numpy 1.18.5, Pandas 1.3.5, and sklearn 1.1. In collecting Internet of Things data, we used the web application framework Django of the MTV framework model. This framework is an open-source model powered by Python programming and follows MVC design and BSD copyright. The front end uses RESTful API to access the Django server. RESTful API is a REST-style API. REST is an architectural style with nothing to do with programming languages and platforms. It uses HTTP as the transmission protocol. The system uses MySQL to store health records, health monitoring, and service management data. We evaluate the experimental results using MSE, MAE, R_Squared, MAPE, and Accuracy. The Fidan model achieved an accuracy of 82.9% when verified using the Food Demand Forecasting, a public data set on Kaggle. The experimental results prove that the Fidan model can effectively support the service demand prediction of nursing home care robots.

6. Conclusion

This paper proposes a Fidan model for forecasting demand for nursing home services. The model combines the LightGBM algorithm and the explainable machine-learning SHAP algorithm. After iteration and optimisation, the Fidan model has a prediction accuracy of 86.61% for each service demand. Using our model, the health care department has realised the active service function of providing nursing robots with analysis and decision-making, providing more direct and practical assistance to nursing staff.

This paper completes the forecast of service demand with the day as the window unit. Future research will use hours as the time window unit. In addition, this paper completes the forecast of single service demand, then collects sample data of multiple service demands to study the forecast of multiple service demands. We plan to connect the Fidan model to nursing robots in nursing homes one after another and collect relevant scene data, including user experience data on the nursing robot that calls the Fidan model. Considering the large dimensions of scene data, in future research, we plan to use dimensionality reduction technology and deep neural networks to optimise further the Fidan model proposed in this paper, improve the robustness and generalisation ability of the model, and expand the application scope of the model, and more closely integrate models with real-world applications.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by Shanghai Science and Technology Project under Grant: [grant no No.22510761000]; National Natural Science Foundation of China under Grant: [grant no No.61873309, No.92046024, No.92146002]; National Key Research and Development Program of China: [grant no No.2019YFA0709502].

References

Adler, J. (2020). TUG autonomous mobile robots [EB/OL]. https://aethon.com/products.
Google Scholar
Ahamed, A., Ahmed, R., Hossain Patwary, M. I., Hossain, S., Ul Alam, S., & al Banna, H. (2020). Design and implementation of a nursing robot for Old or paralyzed person. 2020 IEEE Region 10 Symposium (TENSYMP), 2020, pp. 594-597. https://doi.org/10.1109/TENSYMP50017.2020.9230956.
Google Scholar
Bandara, K., Bergmeir, C., Campbell, S., Scott, D., & Lubman, D. (2020). Towards accurate predictions and causal ‘what-if’ analyses for planning and policy-making: A case study in emergency medical services demand. 2020 international joint conference on neural networks (IJCNN), Glasgow, UK, 2020, pp. 1-10. https://doi.org/10.1109/IJCNN48605.2020.9206787.
Google Scholar
Çetinkaya, Z., & Erdal, E. (2019). Daily food demand forecast with artificial neural networks: Kırıkkale university case. 2019 4th International conference on computer science and engineering (UBMK), Samsun, Turkey, 2019, pp. 1-6. https://doi.org/10.1109/UBMK.2019.8907105.
Google Scholar
Chen, T., & Guestrin, C. (2016). Xgboost: A scalable tree boosting system. In proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (KDD ‘16). Association for Computing Machinery, New York, NY, USA, 785–794. https://doi.org/10.1145/2939672.2939785.
Google Scholar
Draghici, B. G., Dobre, A. E., Misaros, M., & Stan, O. P. (2022). Development of a human service robot application using pepper robot as a museum guide. 2022 IEEE International Conference on Automation, Quality and Testing, Robotics (AQTR), 2022, pp. 1-5. https://doi.org/10.1109/AQTR55203.2022.9802037.
Google Scholar
Du, X., Liu, Y., Lu, Z., Duan, Q., Feng, J., Wu, J., Chen, B., & Zheng, Q. (2022a). A Low-latency communication design for brain simulations. IEEE Network, 36(2), 8–15. https://doi.org/10.1109/MNET.008.2100447
Web of Science ®Google Scholar
Du, X., Tang, S., Lu, Z., Gai, K., Wu, J., & Hung, P. C. K. (2022b). Scientific workflows in IoT environments: A data placement strategy based on heterogeneous edge-cloud computing. ACM Transactions on Management Information Systems, https://doi.org/10.1145/3531327
Web of Science ®Google Scholar
Dudek, W., Węgierek, M., Karwowski, J., Szynkiewicz, W., & Winiarski, T. (2019). Task harmonisation for a single-task robot controller. 2019 12th international workshop on robot motion and control (RoMoCo), Poznan, Poland, 2019, pp. 86-91. https://doi.org/10.1109/RoMoCo.2019.8787385.
Google Scholar
Faial, D., Bernardini, F., Meza, E. M., Miranda, L., & Viterbo, J. (2020). A methodology for taxi demand prediction using stream learning. 2020 international conference on systems, signals and image processing (IWSSIP), Niteroi, Brazil, 2020, pp. 417-422. https://doi.org/10.1109/IWSSIP48289.2020.9145097.
Google Scholar
Fuji, S., Date, M., Nagai, Y., Yasuhara, Y., Tanioka, T., & Ren, F. (2011). Research on the possibility of humanoid robots to assist in medical activities in nursing homes and convalescent wards. 2011 7th International Conference on Natural Language Processing and Knowledge Engineering, 2011, pp. 459-463. https://doi.org/10.1109/NLPKE.2011.6138243.
Google Scholar
Jin, Y., Ye, X., Ye, Q., Wang, T., Cheng, J., & Yan, X. (2020). Demand forecasting of online Car-hailing With stacking ensemble learning approach and large-scale datasets. IEEE Access, 8, 199513–199522. https://doi.org/10.1109/ACCESS.2020.3034355
Google Scholar
Johnson, J. A. (2022). Designing technology for an aging population. In Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems (CHI EA ‘22). Association for Computing Machinery, New York, NY, USA, Article 130, 1–2. 10.11453491101.3503745.
Google Scholar
Jonatan, E. S.-A., Gonzalez-Hernandez, H. G., Martinez-Chan, N., Bielma-Avendano, C. M., Salinas-Molar, X. A., & Garcia-Garcia, M. O. (2022). Assistant Delivery Robot for Nursing Home using ROS: Robotic prototype for medicine delivery and vital signs registration. In The 2022 5th international conference on electronics, communications and control engineering (ICECC 2022). Association for Computing Machinery, New York, NY, USA, 141–148. https://doi.org/10.1145/3531028.3531051.
Google Scholar
Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., & Liu, T.-Y. (2017). LightGBM: a highly efficient gradient boosting decision tree. In proceedings of the 31st international conference on neural information processing systems (NIPS'17). Curran Associates Inc., Red Hook, NY, USA, 3149–3157. https://dl.acm.org/doi/pdf/10.55553294996.3295074.
Google Scholar
Khosla, R., Chu, M.-T., & Nguyen, K. (2013). Affective Robot Enabled Capacity and Quality Improvement of Nursing Home Aged Care Services in Australia. 2013 IEEE 37th Annual Computer Software and Applications Conference Workshops, 2013, pp. 409-414. https://doi.org/10.1109/COMPSACW.2013.89.
Google Scholar
Lopez, A., Paredes, R., Quiroz, D., Trovato, G., & Cuellar, F. (2017). Robotman: A security robot for human-robot interaction. 2017 18th International Conference on Advanced Robotics (ICAR), 2017, pp. 7-12. https://doi.org/10.1109/ICAR.2017.8023489.
Google Scholar
Lu, D.-R., Chai, C.-W., Su, K.-H., & Lin, Y.-S. (2021). On the study of smart medical care system working in senior citizens housing and nursing homes. 2021 IEEE 3rd Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability (ECBIOS), 2021, pp. 278-283. https://doi.org/10.1109/ECBIOS51820.2021.9510818.
Google Scholar
Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. Proceedings of the 31st International Conference on Neural Information Processing Systems. New York, 2017: 4768—4777. https://dl.acm.org/doi/pdf/10.55553295222.3295230.
Google Scholar
Manikandan, P., Ramesh, G., Likith, G., Sreekanth, D., & Durga Prasad, G. (2021). Smart nursing robot for COVID-19 patients. 2021 International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), 2021, pp. 839-842. https://doi.org/10.1109/ICACITE51222.2021.9404698.
Google Scholar
Marć, M., Bartosiewicz, A., Burzyńska, J., Chmiel, Z., & Januszewicz, P. (2019). A nursing shortage – a prospect of global and local policies. International Nursing Review, 66(1), 9–16. https://doi.org/10.1111/inr.12473
PubMed Web of Science ®Google Scholar
Munkhdalai, L., Park, K. H., Batbaatar, E., Theera-Umpon, N., & Ryu, K. H. (2020). Deep learning-based demand forecasting for Korean postal delivery service. IEEE Access, 8, 188135–188145. https://doi.org/10.1109/ACCESS.2020.3030938
Google Scholar
Niu, P., Tang, Y., Wang, Z., & Zhu, Y. (2020). Air material demand forecast based on combined neural network. 2020 3rd international conference on electron device and mechanical engineering (ICEDME), Suzhou, People’s Republic of China, 2020, pp. 600-604. https://doi.org/10.1109/ICEDME50972.2020.00142.
Google Scholar
Phankokkruad, M., & Wacharawichanant, S. (2019). An application of convolutional neural network-long short-term memory model for service demand forecasting. 2019 international conference on information and communications technology (ICOIACT), Yogyakarta, Indonesia, 2019, pp. 614-618. https://doi.org/10.1109/ICOIACT46704.2019.8938553.
Google Scholar
Qin, Y., Zhang, Y., & Chen, W. (2018). Public bicycle demand forecast based on PCA-BA-GRNN model. 2018 ieee 9th international conference on software engineering and service science (ICSESS), Beijing, People’s Republic of China, 2018, pp. 1015-1018. https://doi.org/10.1109/ICSESS.2018.8663761.
Google Scholar
Ramos-Carrasco, R., Galvez-Diaz, S., Nuñez-Ponce, V., & Alvarez-Merino, J. (2019). Artificial neural networks to estimate the forecast of tourism demand in Peru. 2019 ieee sciences and humanities international research conference (SHIRCON), Lima, Peru, 2019, pp. 1-4. https://doi.org/10.1109/SHIRCON48091.2019.9024873.
Google Scholar
Trovato, G., Lopez, A., Paredes, R., & Cuellar, F. (2017). Security and guidance: Two roles for a humanoid robot in an interaction experiment. 2017 26th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), pp. 230-235. https://doi.org/10.1109/ROMAN.2017.8172307.
Google Scholar
Vidhya, A. (2020). Food Demand Forecasting [Data set]. Kaggle. https://doi.org/10.34740/KAGGLE/DSV/1138962.
Google Scholar
Wang, Y., Du, X., Lu, Z., Duan, Q., & Wu, J. (2022). Improved LSTM-based time-series anomaly detection in rail transit operation environments. IEEE Transactions on Industrial Informatics, https://doi.org/10.1109/TII.2022.3164087
PubMed Web of Science ®Google Scholar
Yang, S., Li, A., & Li, G. (2022). Evaluation index system and methodology for actively responding to ageing population. In IEEE/WIC/ACM international conference on Web intelligence and intelligent agent technology (WI-IAT ‘21). Association for Computing Machinery, New York, NY, USA, 497–502. https://doi.org/10.1145/3498851.3499005.
Google Scholar
Zhang, X., Mu, X., Xu, H., Alhassan, A. B., & Kadry, H. K. (2021). Vibration characteristics analysis of human- robot coupled system for walking posture of elderly-assistant robot. IEEE Access, 9, 44217–44235. https://doi.org/10.1109/ACCESS.2021.3066397
Web of Science ®Google Scholar
Zhang, Y. (2023). Application of blockchain technology in community-based home elderly care services under population aging. In proceedings of the 11th international conference on software and information engineering (ICSIE ‘22). Association for Computing Machinery, New York, NY, USA, 86–91. https://doi.org/10.1145/3571513.3571528.
Google Scholar
Zháo, M. A., & Jayadi, R. (2021). Forecasting daily visitors and menu demands in an Indonesian chain restaurant using support vector regression machine. 2021 international conference on artificial intelligence and mechatronics systems (AIMS), Bandung, Indonesia, 2021, pp. 1-6. https://doi.org/10.1109/AIMS52415.2021.9466036.
Google Scholar
Zhou, X., Liang, W., Wang, K., & Yang, L. T. (2021). Deep correlation mining based on hierarchical hybrid networks for heterogeneous Big data recommendations. IEEE Transactions on Computational Social Systems, 8(1), 171–178. https://doi.org/10.1109/TCSS.2020.2987846
Web of Science ®Google Scholar
Zhou, X., Liang, W., Yan, K., Li, W., Wang, K., Ma, J., & Jin, Q. (2022). Edge-Enabled Two-stage scheduling based on deep reinforcement learning for internet of everything. IEEE Internet of Things Journal, https://doi.org/10.1109/JIOT.2022.3179231
Web of Science ®Google Scholar
Zieliński, C., Stefańczyk, M., Kornuta, T., Figat, M., Dudek, W., Szynkiewicz, W., Kasprzak, W., Figat, J., Szlenk, M., Winiarski, T., & Banachowicz, K. (2017). Variable structure robot control systems: The rapp approach. Robotics and Autonomous Systems, 94, 226–244. https://doi.org/10.1016/j.robot.2017.05.002
Web of Science ®Google Scholar

Fidan: a predictive service demand model for assisting nursing home health-care robots

Abstract

1. Introduction

2. Related work