Search in:

Applied Artificial Intelligence

An International Journal

Volume 36, 2022 - Issue 1

Submit an article Journal homepage

Open access

1,246

Views

CrossRef citations to date

Altmetric

Research Article

Using Skeleton Correction to Improve Flash Lidar-based Gait Recognition

Nasrin Sadeghzadehyazdia Department of Electrical and Computer Engineering, University of Virginia, Charlottesville, Virginia, USACorrespondence[email protected]
View further author information

Tamal Batabyalb Department of Neurology, School of Medicine, University of Virginia, CharlottesvilleVirginia, United StatesView further author information

Alexander Glandonc Department of Electrical and Computer Engineering, Old Dominion University, NorfolkVirginia, USAView further author information

Nibir Dhard Department of Electrical and Computer Engineering, Virginia Commonwealth University, RichmondVirginia, USAView further author information

Babajide Familonie C5ISR Center’s Night Vision and Electronic Sensors Directorate Fort Belvoir, Virginia, USAView further author information

Khan Iftekharuddinc Department of Electrical and Computer Engineering, Old Dominion University, NorfolkVirginia, USAView further author information

Scott T. Actona Department of Electrical and Computer Engineering, University of Virginia, Charlottesville, Virginia, USAView further author information

show all

Article: 2043525 | Received 11 Jun 2021, Accepted 14 Feb 2022, Published online: 14 Mar 2022

Cite this article
https://doi.org/10.1080/08839514.2022.2043525
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Figures & data

Figure 1. Sample frames of lidar data. The top and bottom rows show depth (range) and intensity data, respectively.

Figure 2. Examples of noisy segmented silhouettes from flash lidar data.

Figure 3. Pipeline for gait recognition using the joint correction criterion of GlidarPoly. EquationEquation (4)(4) $L_{j}^{i} = \frac{2}{N_{p i x e l s}} \times tan (\frac{θ_{a o v}}{2}) \times L p_{j}^{i} \times D_{c a m e r a}^{i}$ (4) describes how depth data are combined with the output of a 2D skeleton detector (skeleton joints in the 2D image frame of reference) to create the 3D location of the joints in the real-world frame of reference.

Figure 3. Pipeline for gait recognition using the joint correction criterion of GlidarPoly. EquationEquation (4)(4) Lji=2Npixels×tan(θaov2)×Lpji×Dcamerai(4) describes how depth data are combined with the output of a 2D skeleton detector (skeleton joints in the 2D image frame of reference) to create the 3D location of the joints in the real-world frame of reference.

Figure 4. Pipeline for outlier removal. Inputs to “3D Joint location estimator” remain the same as in .

Figure 5. Top row: sample frames with correctly detected skeletons, bottom row: frames with faulty skeletons.

Figure 6. The skeleton model that we use in this work. Left: index of each joint in the skeleton model. Right: skeleton model in a sample frame.

Figure 7. Illustration of length-based feature vectors. Left: description of each feature (L and R refer to the left and right joints, respectively). Right: illustration of the features.

Figure 8. Illustration of vector-based feature vectors. Left: description of each feature (L and R refer to the left and right joints, respectively). Right: illustration of the features.

Figure 9. Effect of joint location sequence filtering. From top: sample joint location sequences before (first row) and after (second row) joint location sequence filtering (each joint location sequence corresponds with one coordinate ( $x, y, z$ ) of the location of one joint through time). Notice the abundance of missing values in the first row, which are shown as missing sections of the plotted signal, that have been recovered through the joint correction (figures in the second row). The last two rows show samples of faulty and missing skeleton joints before (third row) and after (bottom row) joint location sequence filtering.

Figure 10. Failure examples of the joint location correction filtering. Sample frames of skeleton joints, before (top) and after (bottom) the joint location correction.

Figure 11. Two examples of the ankle to ankle distance sequence of flash lidar data after joint correction. While the graph on the left presents a clear periodic pattern, the sequence on the right lacks such a pattern.

Figure 12. Illustration of two types of waIking paths: walking forward and backward (dashed line) and diamond walking (solid line).

Figure 13. Sample frames of diamond walking that captures a range of different poses.

Table 1. Correct identification scores for the proposed features (**) and the other methods. LB and VB stand for the length-based and vector-based feature vectors, respectively. Results are shown for the original (without joint correction) and after applying GlidarPoly. We also included the results with the proposed features after outlier removal.

Download CSV Display Table

Table 2. Correct identification scores with statistics of features computed over gait cycle. LB and VB stand for the length-based and vector-based feature vectors, respectively. The 3-statistic case refers to computing only mean, maximum, and standard deviation of each feature over every gait cycle. The 6-statistic scenario adds median, lower and upper quartile to the initial three statistics.

Download CSV Display Table

Table 3. Correct identification scores for each class of subject for the single-shot scenario of vector-based features. The minimum and the next-to-lowest accuracy and F-score are underlined.

Download CSV Display Table

Figure 14. T-SNE visualization of the length-based feature before (left) and after (right) applying joint correction. There is a high level of inter-class intersection before joint correction (left) that is mostly resolved after correcting joint location, creating clusters that are more distinctive (right).

Figure 15. T-SNE visualization of the vector-based feature before (left) and after (right) applying the joint correction. Before joint correction, high inter-class intersection and intra-class separation is observed (left). Joint correction transforms features into well-separated clusters (right).

Figure 16. Comparison of classification accuracy for vector-based features based on the number of missing joints in the original skeletons, before and after applying GlidarPoly for joint correction. The samples with no missing joints also include noisy samples. All cases show improvement after applying the joint location correction.

Table 4. Correct identification scores for each class of subject for the statistics of vector-based features over the gait cycle. The minimum and the next-to-lowest accuracy and F-score are underlined.

Download CSV Display Table

Figure 17. Average classification accuracy for different sizes of training sample sets given multiple numbers of test examples for the single-shot (left), and statistics over the gait cycle (right) scenarios. Both plots are acquired for vector-based features.

Table 5. Single-shot identification: Rank-1 identification accuracy for the proposed features, several RGB-based (features that are extracted from RGB images), and depth-based (features that are extracted using depth data, e.g. skeleton-based features) features for IAS-Lab RGBD-ID “TestingA” (different outfits) and “TestingB” (different rooms, various illuminations) sets. With (Rao et al. Citation2021), we only report the best results that was achieved by Reverse Reconstruction method. With our features, we only show the best results that was achieved by NN (Nearest Neighbors) and SVM (Support Vector Machine) for the Length-based and Vector-based features, respectively.

Display Table

Table 6. Single-shot identification: Rank-1 identification accuracy for the proposed features on IAS-Lab RGBD-ID “TestingA” (different outfits) and “TestingB” (different rooms, various illuminations) before (with added noise and removed joints) and after applying GlidarPoly for correction. We only show the best results that was achieved by NN (Nearest Neighbors) and SVM (Support Vector Machine) for the Length-based and Vector-based features, respectively.

Download CSV Display Table

Table 7. Rank-1 identification accuracy using the $6$ statistics of the proposed features on IAS-Lab “TestingA” (different outfits) and “TestingB” (different rooms, various illuminations) after joint location correction. We only show the best results on average that was achieved by NN (Nearest Neighbors) and SVM (Support Vector Machine) for the Length-based and Vector-based features, respectively.

Display Table

Figure 18. Comparison of the performance of mean, max, standard deviation set, and lower quartile, upper quartile, median set, and the set of all the six statistics to capture the dynamic of the motion after joint location correction. Comparison is performed for lidar and “TestingA” (different outfits) and “TestingB” (different rooms, various illuminations) in IAS-Lab datasets with both types of features and SVM (Support Vector Machine) and NN (Nearest Neighbors) as classifiers. LB and VB stand for length-based and vector-based features, respectively. In the majority of cases, lower quartile, upper quartile, median set outperforms mean, max, standard deviation set.

Preis, J., M. Kessel, M. Werner, and C. Linnhoff-Popien. 2012. “Gait recognition with kinect.” In 1st international workshop on kinect in pervasive computing, New Castle, UK, 1–4.

Google Scholar

Ball, A., D. Rye, F. Ramos, and M. Velonaki. 2012. “Unsupervised clustering of people from ‘skeleton’data.” In 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI), Boston Massachusetts USA, 225–26. IEEE.

Google Scholar

Sinha, A., K. Chakravarty, and B. Bhowmick. 2013. Person identification using skeleton information from kinect.” In Proceedings. International. Conference on Advances in Computer-Human Interactions 101–08.

Google Scholar

Yang, K., Y. Dou, L. Shaohe, F. Zhang, and Q. Lv. 2016. Relative distance features for gait recognition with Kinect. Journal of Visual Communication and Image R Epresentation 39:209–17. doi:10.1016/j.jvcir.2016.05.020.

Web of Science ®Google Scholar

Rao, H., S. Wang, H. Xiping, M. Tan, Y. Guo, J. Cheng, X. Liu, and H. Bin. 2021. A self-supervised gait encoding approach with locality-awareness for 3d skeleton based person re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence 1–1. doi:10.1109/TPAMI.2021.3092833.

PubMed Web of Science ®Google Scholar

Oreifej, O., R. Mehran, and M. Shah. 2010. “Human identity recognition in aerial images.” In 2010 IEEE Computer Society Conference on Computer Vision and P attern R ecognition, San Francisco, CA, 709–16. IEEE.

Google Scholar

Ancong, W., W.-S. Zheng, and J.-H. Lai. 2017. Robust depth-based person re-identification. IEEE Transactions on Image Processing 26 (6):2588–603. doi:10.1109/TIP.2017.2675201.

PubMed Web of Science ®Google Scholar

Zhang, Y., and L. Shutao 2011. “Gabor-LBP based region covariance descriptor for person re-identification.” In 2011 Sixth International Conference on Image and Graphics, Hefei, P.R.China, 368–71. IEEE.

Google Scholar

Liao, S., H. Yang, X. Zhu, and S. Z. Li. 2015. “Person re-identification by local maximal occurrence representation and metric learning.” In Proceedings of the IEEE conference on computer vision and pattern recognition, Boston, Massachusetts, 2197–206.

Google Scholar

Munaro, M., A. Fossati, A. Basso, E. Menegatti, and L. Van Gool. 2014b. One-shot person reidentification with a consumer depth camera. In Person re-identification, 161–81. London: Springer.

Google Scholar

Pala, P., L. Seidenari, S. Berretti, and A. Del Bimbo. 2019. Enhanced skeleton and face 3d data for person re-identification from depth cameras. Computers & Graphics 79:69–80. doi:10.1016/j.cag.2019.01.003.

Web of Science ®Google Scholar

Munaro, M., A. Basso, A. Fossati, L. Van Gool, and E. Menegatti. 2014a. “3D reconstruction of freely moving persons for re-identification with a depth sensor.” In 2014 IEEE International Conference on Robotics and Automation (ICRA), Hong Kong, China, 4512–19. IEEE.

Google Scholar

Haque, A., A. Alahi, and L. Fei-Fei. 2016. “Recurrent attention models for depth-based person identification.” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, LAS VEGAS, Nevada, 1229–38.

Google Scholar

Zheng, W., L. Lin, Z. Zhang, Y. Huang, and L. Wang. 2019. “Relational network for skeleton-based action recognition.” In 2019 IEEE International Conference on Multimedia and Expo (ICME), Shanghai, China, 826–31. IEEE.

Google Scholar

Liao, R., Y. Shiqi, A. Weizhi, and Y. Huang. 2020. A model-based gait recognition method with body pose and human prior knowledge. Pattern Recognition 98:107069. doi:10.1016/j.patcog.2019.107069.

Web of Science ®Google Scholar

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Using Skeleton Correction to Improve Flash Lidar-based Gait Recognition

Table 3. Correct identification scores for each class of subject for the single-shot scenario of vector-based features. The minimum and the next-to-lowest accuracy and F-score are underlined.

Table 4. Correct identification scores for each class of subject for the statistics of vector-based features over the gait cycle. The minimum and the next-to-lowest accuracy and F-score are underlined.

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Using Skeleton Correction to Improve Flash Lidar-based Gait Recognition

Figures & data

Table 3. Correct identification scores for each class of subject for the single-shot scenario of vector-based features. The minimum and the next-to-lowest accuracy and F-score are underlined.

Table 4. Correct identification scores for each class of subject for the statistics of vector-based features over the gait cycle. The minimum and the next-to-lowest accuracy and F-score are underlined.

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date