Full article: Spatio-temporal analysis of error-related brain activity in active and passive brain

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

Electroencephalography (EEG)-based brain–computer interface (BCI) systems infer brain signals recorded via EEG without using common neuromuscular pathways. User brain response to BCI error is a contributor to non-stationarity of the EEG signal and poses challenges in developing reliable active BCI control. Many passive BCI implementations, on the other hand, have the detection of error-related brain activity as their primary goal. Therefore, reliable detection of this signal is crucial in both active and passive BCIs. In this work, we propose CREST: a novel covariance-based method that uses Riemannian and Euclidean geometry and combines spatial and temporal aspects of the feedback-related brain activity in response to BCI error. We evaluate our proposed method with two datasets: an active BCI for 1-D cursor control using motor imagery and a passive BCI for 2-D cursor control. We show significant improvement across participants in both datasets compared to existing methods.

KEYWORDS:

1. Introduction

Brain–computer interface (BCI) systems record brain activity directly from the brain using methods such as electroencephalography (EEG) and attempt to infer the user’s intent [Citation1,Citation2]. Active BCIs such as motor imagery (MI) BCIs are among common BCI systems in which the user imagines moving a part of her/his body resulting in a decrease in power (called an event-related desynchronization or ERD) in various frequency bands [Citation3,Citation4]. Movement imagination of different body parts leads to spatially different desynchronization that can be used by the BCI to detect the imagined movement. In practice, the imagined movement classes (such as right/left hand) can be mapped to, for example, a switch, to control the movement of a robotic limb or a wheelchair. This BCI output is referred to as BCI feedback and the brain response to BCI feedback as feedback-related brain activity. One source of non-stationarity in EEG signals is the feedback-related brain activity [Citation5]. Error-related potentials (ErrP) and error-related spectral components are among the components of this signal [Citation5]. If not taken into account, these signals can pose challenges for real-world application of a BCI system [Citation5–Citation7]. In previous work, we have shown that when the feedback-related activity is appropriately modeled, the information can be combined with the motor imagery classification to greatly improve the overall BCI performance [Citation5,Citation6].

Separate work in passive BCIs (pBCI) [Citation9,Citation10] has shown that the user’s intentions or emotional states can be detected through passive cognitive monitoring and that this signal can be used as an (implicit) control source [Citation11,Citation12]. Therefore, in both active and passive BCIs, single-trial classification of the user’s state with respect to the BCI feedback (whether the BCI output is perceived as an error/undesired or not) is a critical component of a reliable BCI.

Previous work on classifying feedback-related brain activity varies by the type of features used and the classifier that is trained on these features [Citation7]. For instance, authors in [Citation8] and [Citation13] focused on temporal features from one or two fronto-central channels, while others such as [Citation11] considered all available EEG channels and used a windowed-means approach as instructed by [Citation14].

Riemannian geometry has been shown to be promising for reliable classification in various BCI paradigms [Citation15–Citation18]. Methods based on Riemannian geometry have also been used for data augmentation to balance classes for error detection in a P300 speller task [Citation19]. However, to our knowledge, there is no work in the literature that attempts to classify error-related or feedback-related brain activity using Riemannian methods.

In this paper, we explore ways to improve the classification of the error-related brain activity so as to further improve its contribution to an overall BCI system. Specifically, we investigate the spatio-temporal aspects of the error-related brain activity using covariance matrices in two different BCI paradigms by looking at both space and time covariances through Euclidean and Riemannian geometry. Our goal is to better distinguish whether the BCI feedback (output) is perceived by the user as an error.

We evaluate our proposed methods through two different datasets: one from our previous study in which participants were actively controlling cursor movements using right/left-hand motor imagery and another dataset shared with us by Zander and Krol et al. [Citation11] in which participants were passively controlling a cursor on a screen in front of them. An earlier version of this work appeared in [Citation20].

2. Data collection and pre-processing

2.1. Dataset I: active cursor control with motor imagery

Data were recorded from 10 participants after the study was approved by the University Institutional Review Board at UC San Diego. All participants signed a consent form prior to participating in the experiment. EEG data were recorded using a 64-channel BrainAmp system (Brain Products GmbH) at 5000 Hz. Channels were located according to the international 10–20 system and were referenced online to FCz.

Participants were instructed to use motor imagery of their right/left hand to control a cursor on a screen in front of them to the right/left toward a target [Citation21]. At the beginning of each trial, the cursor and the target appeared at the center and three steps away from the center at either right or left side of the screen, respectively (see ). The cursor moved one step every second and trials ended when the cursor hit the target location or the other end of the screen. Participants believed that they were in control of the cursor movements; however, the cursor moved based on a pre-determined sequence of movements that was kept the same across participants. This was to have enough cursor movements toward/away from the target (i.e. good/bad movements) for each participant irrespective of the motor imagery performance. The cursor sequence of movement was randomly generated subject to a few constraints, e.g., no more than two consecutive changes in direction were allowed. For more details about the experiment, please refer to [Citation5].

Figure 1. An example of a trial in dataset I. Participants were instructed to use right/left-hand motor imagery to move the cursor (the blue circle) to right/left toward the target (white circle). We considered movements toward/away from the target as good/bad movements perceived by the participants [Citation5].

The overall goal in a motor-imagery BCI is to detect the imagined class. The common method is to train a classifier to distinguish between right-hand and left-hand motor imagery (or whatever imagery classes have been mapped to ‘move cursor right’ and ‘move cursor left’). As shown in our earlier work [Citation5], there is another classifiable aspect in the EEG signal – whether the cursor moved in the desired or non-desired direction. In the analysis for this paper, our goal is to improve this classification of whether the user was satisfied with the last cursor movement or not, i.e. if the cursor had just moved toward (good) or away (bad) from the target. We call this a good/bad (G/B) classifier.

2.2. Dataset II: passive cursor control

This dataset was recorded at the Technische Universität Berlin, Germany, from 19 participants and shared with us by Zander and Krol et al. [Citation11]. All participants signed a consent form accepted by the ethics committee of the Department of Psychology and Ergonomics before taking part in the experiment. Data were recorded using a 64-channel BrainAmp system (Brain Products GmbH) at 500 Hz. Channels were located according to the international 10–20 system and were referenced online to FCz.

This study had multiple parts including offline and online cursor control; however, we considered only the offline data that were used for calibration in the original study. This part of the study consisted of a cursor moving randomly on a 4 × 4 grid. Participants were instructed to observe cursor movements on the grid and evaluate each movement as ‘appropriate’ or ‘inappropriate’ with respect to reaching the target, which was located in one of the corners of the grid. The cursor moved randomly to one of up to eight adjacent nodes until it reached the target, after which another target was selected and the procedure restarted in the next trial. For more details about the experiment, please refer to [Citation11].

Angular deviance from the optimal path was used to describe and categorize the movements. In , the target is in the top right corner and the cursor (red) at the bottom row. A movement upwards (depicted in -) has an angular deviance of 18°, whereas - depicts an angular deviance of 63°. We considered angles that were below 45° as good movements and angles above 130° as bad movements and the EEG data corresponding to these two labels were used to train a good/bad (G/B) classifier. The angles in between were labeled as neutral and not used for classification.

Figure 2. In dataset II, participants were instructed to ‘judge’ each cursor (red full circle) movement (indicated by the arrow in the static figure) as satisfactory or unsatisfactory with respect to its movement toward/away from the target (red empty circle) [Citation11]. Diagram (a) depicts a cursor’s location and diagrams (b) and (c) specify different next cursor movements and how the angle between the cursor direction of movement and the direct line connecting the cursor to the target location is defined. We considered angles smaller than 45° as good movements and larger than 130° as bad movements perceived by the participants.

2.3. Pre-processing

In each dataset, sections that were contaminated with excessive noise were removed. Independent component analysis (ICA) [Citation22] was applied to data from each participant and independent components representing muscle and eye artifacts were removed. Pre-processing was done in MATLAB [Citation23] and EEGLAB [Citation24]. A maximum of 1 and 3 noisy channels were removed from datasets I and II, respectively. The removed channels were interpolated using EEGLAB and all 64 channels were used for feature extraction and classification. Data were re-referenced to the common average, downsampled to 100 Hz, and filtered in 0.5–10, 1–3, 2–5, 4–7, 6–10, 7–12, 10–15, 12–19, 18–25, 19–30, 25–35, 30–40 Hz with a 100th-order FIR filter using a Kaiser window. The first frequency band was used for the windowed-means method only as will be described in more detail later. Next, data were epoched 50–950 ms after each cursor movement and this segment is called a ‘step’ in what follows. Classification results are reported on a single cursor movement, i.e. for every step.

3. Feature extraction and classification

Feature extraction and classification were implemented in Python. We used scikit-learn [Citation25] to implement classifiers and the pyRiemann toolbox [Citation26] to calculate Riemannian distances and means.

In each dataset, classes (good and bad) were balanced by randomly subsampling the larger class. Therefore, we generated 10 instances of train-test combinations which were kept the same across the different tested methods. In each instance, the train-test ratio is about 4:1. On average, in dataset I instances, there are 573 train and 142 test steps. In dataset II instances, the average train and test steps are 197 and 49, respectively.

Covariance matrices are used at the core of several feature extraction methods in BCI applications [Citation15,Citation27,Citation28]. The methods discussed in this work use space and time covariances of the good (toward the target) and bad (away from the target) steps. Moreover, we looked at different frequency bands, namely 1–3, 2–5, 4–7, 6–10, 7–12, 10–15, 12–19, 18–25, 19–30, 25–35, 30–40 Hz covering the low and high theta, mu, and beta frequency bands and to cover for potential individual differences [Citation5,Citation29]. Covariance matrices were calculated in each frequency band separately. Next, we will explain how we estimated covariances to capture spatial and temporal features and how these were used for classification.

3.1. Space and time covariances

Let $p$ represent the number of channels, $t$ the number of time samples and $i \in {1, . . ., N}$ where $N$ is the number of steps available. Let $X_{i} \in R^{p \times t}$ be an EEG epoch (i.e. step) and $C_{s_{i}}$ and $C_{t_{i}}$ be the sample space and time covariances for the $i^{t h}$ step, respectively. Since the number of time samples in our case (i.e. 90 time samples at 100 Hz sampling rate) is larger than the number of EEG channels $p = 64$ , time covariances are not full rank and thus not positive definite. Also, as we removed eye and muscle components through ICA, space covariances are also rank deficient. Therefore, we used regularization to make the covariance matrices full-rank. Space and time covariances were regularized as follows:

(1)

C \leftarrow (1 - α) C + α \frac{t r a c e (C)}{N} I,

(1)

where C is the covariance matrix, $α$ is the regularization parameter, $t r a c e (C)$ is the sum of the diagonal elements of C and I is the identity matrix with the same size as C. We used a data-driven method [Citation30] to estimate the regularization parameter for data from each participant, in each frequency band for space and time covariances separately. We only used train data to estimate the shrinkage parameters ( $α$ ).

3.2. Common spatial patterns (CSP)

The filterbank common spatial patterns (FB-CSP) algorithm was proposed by Ang et al. [Citation32] to detect the imagined movement class in a motor imagery BCI. Inspired by this method, in our previous work, we used a similar approach to classify the error-related brain activity in a motor imagery BCI [Citation5].

Let $C_{s_{i}}^{g}$ and $C_{s_{i}}^{b}$ represent the space covariance of the $i^{t h}$ good and bad steps, respectively, for a specific frequency band. The average of the trace normalized sample covariances for each of the good and bad classes were estimated as:

(2)

Σ_{g} = \frac{1}{N} \sum_{i} C_{s_{i}}^{g} / t r a c e (C_{s_{i}}^{g})

(2)

and

(3)

Σ_{b} = \frac{1}{N} \sum_{i} C_{s_{i}}^{b} / t r a c e (C_{s_{i}}^{b})

(3)

respectively, where N is the number of steps. As mentioned earlier, the number of steps in good and bad classes were balanced. CSP filters for each frequency band were estimated by simultaneous diagonalization of the two covariance matrices:

(4)

W^{T} Σ_{g} W = Λ^{g} W^{T} Σ_{b} W = Λ^{b}

(4)

where $Λ^{g}$ and $Λ^{b}$ are diagonal matrices such that $Λ^{g} + Λ^{b} = I$ [Citation27]. CSP filters, represented by the columns of $W$ , are the solutions of the following generalized eigenvalue problem:

(5)

Σ_{g} W = λ Σ_{b} W .

(5)

Next, 6 filters (eigenvectors) corresponding to the 3 largest and smallest eigenvalues were selected. EEG epochs were filtered through the selected filters in each frequency band. The logarithm of the variance (across time) of the filtered EEG data through each of the 6 selected filters were calculated as features. These 6 features in each of the 11 frequency bands were used for classification using a 66D regularized linear discriminant analysis (r-LDA) classifier [Citation25,Citation27,Citation31].

3.3. Common temporal patterns (CTP)

The common temporal patterns (CTP) algorithm, proposed by Yu et al. [Citation33], is the temporal counterpart of the common spatial patterns in which the sample mean of the good and bad time covariances are considered instead of space covariances. Similar to FB-CSP, we consider a filterbank version of CTP. EEG epochs were filtered through 6 CTP filters (corresponding to the 3 largest and smallest eigenvalues) in each frequency band and the logarithm of the variance (across 64 channels) of the filtered epochs were selected as features (6 features for each band, hence a total of 66 features). A regularized linear discriminant analysis (r-LDA) was trained on the selected features [Citation25,Citation31].

3.4. Common spatial and temporal patterns (CSP-CTP)

To combine spatio-temporal features, we first calculated CSP filters and selected 6 filters (corresponding to the 3 largest and smallest eigenvalues). Then, EEG epochs were filtered through the CSP filters corresponding to the good class. The CTP method was then used to capture temporal features by learning 6 CTP filters (corresponding to the 3 largest and smallest eigenvalues). Another set of 6 CTP filters were trained using the EEG epochs filtered through the CSP filters corresponding to the bad class. describes this method.

Figure 3. Method of CSP-CTP: CSP filters are trained on each frequency band ( $p = 64$ and $t = 90$ ). Six CSP filters (corresponding to the 3 largest and smallest eigenvalues) are selected and EEG epochs are filtered through each. Then, CTP filters are trained on the good (G) and bad (B) CSP-filtered data separately.

The above procedure was done in each frequency band separately to select a total of 36 features and the selected features from all 11 frequency bands were concatenated. Finally, a regularized-LDA classifier was trained [Citation25,Citation31].

3.5. Distance to Riemannian mean of spatial covariances (DRM-S)

Full-rank covariance matrices lie on a Riemannian manifold pertaining to the symmetric positive definite (SPD) matrices [Citation15]. Let $A (n)$ be the set of all $n \times n$ SPD matrices. The Riemannian distance between $A_{1} \in A (n)$ and $A_{2} \in A (n)$ is defined as follows:

(6)

δ_{R} (A_{1}, A_{2}) = | | L o g (A_{1}^{- 1} A_{2}) | |_{F} = [\sum_{i = 1}^{n} l o g^{2} λ_{i}]^{1 / 2}

(6)

where $λ_{i}$ are the eigenvalues of $(A_{1}^{- 1} A_{2})$ . Since $A_{1}$ and $A_{2}$ are both SPD, $λ_{i}$ are real positive (non-zero) values. Also, $| | . | |_{F}$ represents the Frobenius norm and $L o g (.)$ the matrix logarithm.

The mean of the SPD matrices $A_{1}, A_{2}, . . ., A_{l}$ on the Riemannian manifold is defined as follows [Citation15]:

(7)

M (A_{1}, A_{2}, . . ., A_{l}) = a r g m i n_{X \in A (n)} \sum_{i = 1}^{l} δ_{R}^{2} (X, A_{i}) .

(7)

There is no closed form solution for (7); however, it can be solved iteratively [Citation34].

Based on the defined matrix relationships on the manifold, we propose a filter bank generalization of the minimum distance to Riemannian mean (MDM) classifier [Citation26]. First, the Riemannian mean of the good and bad space covariances in each frequency band on the training set were estimated as described earlier. Next, in each frequency band, features were selected as the Riemannian distances to the Riemannian means of the good and bad classes. This resulted in a total of 22 features: 11 frequency bands ×2 good and bad classes. A logistic regression classifier was trained on the selected features [Citation25]. We trained a logistic regression classifier for all Riemannian methods since we found that the distribution of features was far from a multivariate normal distribution.

3.6. Distance to Riemannian mean of temporal covariances (DRM-T)

This method is the temporal counterpart of the DRM-S method described previously. After the Riemannian mean of the good and bad time covariances in each frequency band on the training set were estimated, features were selected as the Riemannian distances to the Riemannian means of the good and bad classes. This resulted in a total of 22 features: 11 frequency bands ×2 good and bad classes. Logistic regression was trained on the selected features [Citation25].

3.7. Distance to Riemannian mean of spatial and temporal covariances (DRM-ST)

This method combines spatial and temporal Riemannian geometry-based features by concatenating DRM-S and DRM-T features described in the previous two subsections. This resulted in a total of 44 features: 11 frequency bands ×2 good and bad classes ×2 time and space covariances. A logistic regression classifier was trained on the selected features [Citation25].

3.8. Covariance-based Riemannian and Euclidean spatio-temporal classifier (CREST)

We combined DRM-ST and CSP-CTP to capture spatio-temporal features using both Riemannian and common spatial and temporal classifiers. We call this method CREST. For each classifier, we first calculated the signed distance of each trial to the decision hyperplane and applied a logistic function to estimate the probability of the trial belonging to the the good (or bad) class as the classifier score. Logistic regression was used to combine DRM-ST and CSP-CTP classifier scores [Citation25].

3.9. Windowed-means (WM)

We compared our proposed methods with the windowed means method which is widely used for single-trial event-related potential (ERP) classification [Citation11,Citation14]. EEG data on each channel were bandpass filtered to 0.5–10 Hz as described earlier and epoched 50–950 ms after each cursor movement. We calculated the mean of the signal on each channel in 9 non-overlapping time windows, i.e. each covering 100 ms. Then, a regularized linear discriminant analysis (r-LDA) classifier was trained on the selected features [Citation14,Citation25,Citation31].

3.10. CREST+WM

Finally, we combined DRM-ST, CSP-CTP and WM as well to compare with the WM classifier to determine whether WM and CREST capture different features. Logistic regression was used to combine DRM-ST, CSP-CTP and WM classifier scores as explained earlier [Citation25].

4. Results and discussion

and plot the event-related potential (ERP), i.e. the average EEG waveform time-locked to the cursor movement, for ‘Good’ and ‘Bad’ classes on channel Fz across participants for datasets I and II, respectively. For the plot, EEG data for each participant were high-pass filtered at 1 Hz and epoched −100 to 1000 ms time locked to each cursor movement. Then, the average waveform for each participant was calculated in each class. The solid lines on each plot represent the average across participants and the shaded color represents the standard error of the mean. Note that the two classes correspond to cursor movements toward/away from the target and our goal in this paper is to train a classifier to reliably distinguish among them after every cursor movement.

Figure 4. ERP in dataset I. The blue curve corresponds to the brain response to ‘good’ cursor movements, i.e. toward the target. The red curve, on the other hand, corresponds to the brain response to ‘bad’ movements, i.e. away from the target.

Figure 5. ERP in dataset II. The blue curve corresponds to the brain response to ‘good’ cursor movements, i.e. toward the target. The red curve, on the other hand, corresponds to the brain response to ‘bad’ movements, i.e. away from the target.

We compared the G/B classification performance in datasets I and II using CSP and DRM-S as well as CTP and DRM-T. and report the average (first number in each entry) and standard error of the mean (second number in each entry) for classification accuracy over 10 instances of train-test for each participant in datasets I and II, respectively. On average, Riemannian methods perform better across participants and this difference is statistically significant for dataset I (paired-sample t-test, $p < 0.03$ ). DRM-T performs significantly better than CTP in dataset II (paired-sample t-test, $p < 0.01$ ). However, in this dataset, the difference between DRM-S and CSP is not statistically significant across participants.

Table 1. Dataset I: G/B classification accuracy using spatial and temporal features separately. Each table entry is the average classification accuracy (first number) together with the standard error of the mean (second number) over 10 instances of train-test for each participant. Riemannian methods outperform their counterparts and this difference is significant across participants (paired-sample t-test, $p < 0.03$ ).

Display Table

Table 2. Dataset II: G/B classification accuracy using spatial and temporal features separately. Each table entry is the average classification accuracy (first number) together with the standard error of the mean (second number) over 10 instances of train-test for each participant. DRM-T outperforms its counterpart and this difference is significant across participants (paired-sample t-test, $p < 0.01$ ). However, the difference between CSP and DRM-S is not statistically significant.

Display Table

and report the classification accuracy of the windowed-means method (WM) and our proposed spatio-temporal methods: DRM-ST and CSP-CTP, CREST and CREST+WM. We used paired-sample t-tests to compare the difference between WM and the other methods across participants for each dataset. CREST and CREST+WM outperform WM in both datasets (paired-sample t-test, $p < 0.006$ , which stays significant at the 0.05 threshold with Bonferroni correction for the number of tests).

Table 3. Dataset I: G/B classification accuracy comparing the proposed spatio-temporal methods and WM. Each table entry is the average classification accuracy (first number) together with the standard error of the mean (second number) over 10 instances of train-test for each participant. Significantly improved results across participants are represented in bold fonts (paired-sample t-test, $p < 0.006$ , which stays significant at the 0.05 threshold with Bonferroni correction for the number of tests, i.e. 4).

Display Table

Table 4. Dataset II: G/B classification accuracy comparing the proposed spatio-temporal methods and WM. Each table entry is the average classification accuracy (first number) together with the standard error of the mean (second number) over 10 instances of train-test for each participant. Significantly improved results across participants are represented in bold fonts (paired-sample t-test, $p < 0.005$ , which stays significant at the 0.05 threshold with Bonferroni correction for the number of tests, i.e. 4).

Display Table

The difference in performance of DRM-ST and WM is not statistically significant for either of the datasets. However, CSP-CTP outperforms WM in dataset II and this difference is statistically significant (paired-sample t-test, $p < 0.005$ , which stays significant at the 0.05 threshold with Bonferroni correction for the number of tests), while the performance of CSP-CTP in dataset I is not statistically different from that of WM.

and show WM, CREST and CREST+WM performance as bar plots for easier visualization.

Figure 6. WM, CREST and CREST+WM in dataset I. The bars and error bars represent the average classification accuracy and the standard error of the mean, respectively, i.e. first and second entries in columns 2, 5 and 6.

Figure 7. WM, CREST and CREST+WM in dataset II. The bars and error bars represent the average classification accuracy and the standard error of the mean, respectively, i.e. the first and second entries in columns 2, 5 and 6.

5. Conclusions

We proposed spatio-temporal methods to classify the error-related brain activity and evaluated our results on two different datasets. The first dataset is from an active motor imagery BCI in which the user’s brain response to the BCI feedback is an implicit piece of information and using this information can improve the overall BCI performance [Citation5]. We also evaluated our proposed methods on a passive BCI dataset in which participants were evaluating the movements of a cursor. In the latter, error-related brain activity is the core information to be classified even though it is not explicitly provided by the user.

We compared DRM-S and DRM-T that use Riemannian distances as features, with CSP and CTP methods, respectively, in their capacity for classifying feedback-related brain activity in response to BCI error. Our results show that on average across participants in both datasets, methods that use features from Riemannian geometry are more powerful when considering spatial or temporal features separately.

We also proposed methods to combine spatial and temporal features that use Riemannian distances (DRM-ST) and Euclidean geometry-based methods of common patterns (CSP-CTP). We also proposed to combine these two methods (CREST) and showed that this combined method outperforms the windowed-means (WM) method and the difference is statistically significant across participants in both datasets.

Abbreviations

Table

Download CSV Display Table

Acknowledgments

We would like to thank Laurens R. Krol and Thorsten O. Zander for sharing their dataset with us.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was supported by the NSF [IIS 1219200, 1817226, 1528214]; DAAD [short-term research grant]; UC San Diego [Chancellor’s Research Innovation Scholarships G2171, G3155 and the Mary Anne Fox dissertation year fellowship]; NIH [5T32MH020002-18].

References

Farwell LA, Donchin E. Talking off the top of your head: toward a mental prosthesis utilizing event- related brain potentials. Electroencephalogr Clin Neurophysiol. 1988;70(6):510–523.
PubMedGoogle Scholar
Wolpaw JR, McFarland DJ, Neat GW, et al. An EEG-based brain-computer interface for cursor control. Electroencephalogr Clin Neurophysiol. 1991;78(3):252–259.
PubMedGoogle Scholar
Pfurtscheller G, Brunner C, Schlögl A, et al. Mu rhythm (de) synchronization and EEG single-trial classification of different motor imagery tasks. NeuroImage. 2006;31(1):153–159.
PubMed Web of Science ®Google Scholar
McFarland DJ, Miner LA, Vaughan TM, et al. Mu and beta rhythm topographies during motor imagery and actual movements. Brain Topogr. 2000;12(3):177–186.
PubMed Web of Science ®Google Scholar
Mousavi M, Koerner AS, Zhang Q, et al. Improving motor imagery BCI with user response to feedback. Brain-Comput Interfaces. 2017;4(1–2):74–86.
Web of Science ®Google Scholar
de Sa, V.R. An interactive control strategy is more robust to non-optimal classification boundaries. ICMI’12 pp 579-586 October 22-26, 2012 Santa Monica, California, USA. 2012. DOI: 10.1145/2388676.2388798.
Google Scholar
Chavarriaga R, Sobolewski A, Millán J. Errare machinale est: the use of error-related potentials in brain-machine interfaces. Front Neurosci. 2014;8:208.
PubMed Web of Science ®Google Scholar
Schalk G, Wolpaw JR, McFarland DJ, et al. EEG-based communication: presence of an error potential. Clin Neurophysiol. 2000;111(12):2138–2144.
PubMed Web of Science ®Google Scholar
Zander TO, Kothe C. Towards passive brain–computer interfaces: applying brain–computer interface technology to human–machine systems in general. J Neural Eng. 2011;8(2):025005.
PubMed Web of Science ®Google Scholar
Krol L, Andreessen L, Zander T. Passive brain-computer interfaces: A perspective on increased interactivity. Brain-Comput Interfaces Handb. 2018;69–86.
Google Scholar
Zander TO, Krol LR, Birbaumer NP, et al. Neuroadaptive technology enables implicit cursor control based on medial prefrontal cortex activity. Proc Nat Acad Sci. 2016;113(52):14898–14903.
PubMed Web of Science ®Google Scholar
Zander TO, Brönstrup J, Lorenz R, et al. Advances in physiological computing. In: Fairclough SH, Gilleade K, editors. Towards BCI-based implicit control in human–computer interaction. London: Springer; 2014. p. 67–90.
Google Scholar
Ferrez PW, Millán JD. Error-related EEG potentials generated during simulated brain–computer interaction. IEEE Trans Biomed Eng. 2008;55(3):923–929.
PubMed Web of Science ®Google Scholar
Blankertz B, Lemm S, Treder M, et al. Single-trial analysis and classification of ERP componentsa tutorial. NeuroImage. 2011;56(2):814–825.
PubMed Web of Science ®Google Scholar
Barachant A, Bonnet S, Congedo M, et al. Riemannian geometry applied to BCI classification. International Conference on Latent Variable Analysis and Signal Separation; Saint Malo, France. 2010. p. 629–636.
Google Scholar
Congedo M, Barachant A, Bhatia R. Riemannian geometry for EEG-based brain-computer interfaces; a primer and a review. Brain-Comput Interfaces. 2017;4(3):155–174.
Web of Science ®Google Scholar
Kalunga EK, Chevallier S, Barthélemy Q, et al. Online SSVEP-based BCI using Riemannian geometry. Neurocomputing. 2016;191:55–68.
Web of Science ®Google Scholar
Barachant A, Congedo M. A plug&play P300 BCI using information geometry. arXiv Preprint. 2014; https://arxiv.org/abs/1409.0107.
Google Scholar
Kalunga EK, Chevallier S, Barthélemy Q. Data augmentation in Riemannian space for brain-computer interfaces. STAMLINS; 2015.
Google Scholar
Mousavi M, de Sa VR. Spatio-temporal analysis of feedback-related brain activity in brain-computer interfaces. NeurIPS 2018 Workshop on modeling and decision-making in the spatiotemporal domain; Montreal, Canada; 2018.
Google Scholar
Mousavi M, de Sa VR. Towards elaborated feedback for training motor imagery brain computer interfaces. Proceedings of the 7th Graz Brain-Computer Interface Conference 2017; Graz, Austria; 2017. p. 332–337.
Google Scholar
Makeig S, Bell AJ, Jung TP, et al. Independent component analysis of electroencephalographic data. Advances in neural information processing systems; Denver, Colorado; 1996. p. 145–151.
Google Scholar
MATLAB. Natick, Massachusetts, United States: The MathWorks Inc.; 2018b.
Google Scholar
Delorme A, Makeig S. EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Methods. 2004;134(1):9–21. Available from: http://www.sciencedirect.com/science/article/pii/S0165027003003479
PubMed Web of Science ®Google Scholar
Pedregosa F, Varoquaux G, Gramfort A, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825–2830.
Web of Science ®Google Scholar
Barachant A, Bonnet S, Congedo M, et al. Multiclass brain–computer interface classification by Riemannian geometry. IEEE Trans Biomed Eng. 2012;59(4):920–928.
PubMed Web of Science ®Google Scholar
Blankertz B, Tomioka R, Lemm S, et al. Optimizing spatial filters for robust EEG single-trial analysis. IEEE Signal Process Mag. 2008;25(1):41–56.
Web of Science ®Google Scholar
Lotte F, Bougrain L, Cichocki A, et al. A review of classification algorithms for EEG-based brain– computer interfaces: a 10 year update. J Neural Eng. 2018;15(3):031005.
PubMed Web of Science ®Google Scholar
Noh E, Herzmann G, Curran T, et al. Using single-trial EEG to predict and analyze subsequent memory. NeuroImage. 2014;84:712–723.
PubMed Web of Science ®Google Scholar
Schäfer J, Strimmer K. A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics. Stat Appl Genet Mol Biol. 2005;4(1):2194–6302.
PubMedGoogle Scholar
Ledoit O, Wolf M. Improved estimation of the covariance matrix of stock returns with an application to portfolio selection. J Empirical Finance. 2003;10(5):603–621.
Google Scholar
Ang KK, Chin ZY, Wang C, et al. Filter bank common spatial pattern algorithm on BCI competition IV datasets 2a and 2b. Front Neurosci. 2012;6:39.
PubMed Web of Science ®Google Scholar
Yu K, Shen K, Shao S, et al. Common spatio-temporal pattern for single-trial detection of event- related potential in rapid serial visual presentation triage. IEEE Trans Biomed Eng. 2011;58(9):2513–2520.
PubMed Web of Science ®Google Scholar
Moakher M. A differential geometric approach to the geometric mean of symmetric positive-definite matrices. SIAM Journal on Matrix Analysis and Applications. 2005;26(3):735–747.
Google Scholar

Spatio-temporal analysis of error-related brain activity in active and passive brain–computer interfaces

ABSTRACT

1. Introduction

2. Data collection and pre-processing