1,229

Views

CrossRef citations to date

Altmetric

Articles

Few shot object detection for headdresses and seats in Thangka Yidam based on ResNet and deformable convolution

Hu Wenjina Key Laboratory of China’s Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, Lanzhou, People’s Republic of China;b School of Mathematics and Computer Science, Northwest Minzu Univsersity, Lanzhou, People’s Republic of ChinaView further author information

Xue Panpana Key Laboratory of China’s Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, Lanzhou, People’s Republic of China;b School of Mathematics and Computer Science, Northwest Minzu Univsersity, Lanzhou, People’s Republic of ChinaCorrespondence[email protected]
View further author information

He Guoyuana Key Laboratory of China’s Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, Lanzhou, People’s Republic of China;c National Languages Information Technology, Boulder, Northwest Minzu University, Lanzhou, People’s Republic of ChinaView further author information

Tang Huiyuana Key Laboratory of China’s Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, Lanzhou, People’s Republic of China;c National Languages Information Technology, Boulder, Northwest Minzu University, Lanzhou, People’s Republic of ChinaView further author information

Song Huafeia Key Laboratory of China’s Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, Lanzhou, People’s Republic of China;c National Languages Information Technology, Boulder, Northwest Minzu University, Lanzhou, People’s Republic of ChinaView further author information

Yue Chaoyanga Key Laboratory of China’s Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, Lanzhou, People’s Republic of China;c National Languages Information Technology, Boulder, Northwest Minzu University, Lanzhou, People’s Republic of ChinaView further author information

Few shot object detection for headdresses and seats in Thangka Yidam based on ResNet and deformable convolution

Abstract

1. Introduction

2. Related work

2.1. Few shot object detection method

2.2. FSOD method

3. Proposed method

3.1. Few shot object detection for headdresses and seats in Thangka Yidam based on ResNet and deformable convolution

3.2. Adjustment and optimisation of convolution layer structure

3.3. Introduction of deformable convolution

3.4. Double threshold non-maximum suppression

Table 1. Comparison of AP (%) under different threshold combinations.

4. Experiment

4.1. Experimental setting

4.2. Datasets

4.2.1. Thangka dataset

Table 2. The number distribution of categories before and after Thangka image augmentation.

4.2.2. MS COCO dataset

4.3. Main results

4.3.1. Dataset split and results analysis

Table 3. Experimental instructions for the Thangka data.

Table 4. The evaluation results of Thangka image detection.

Table 5. The comparison results on the MS COCO dataset for 20-way 10-shot.

4.3.2. Ablation experimental analysis

Table 6. The ablation experiment results for key components on the Thangka dataset and COCO dataset.

5. Conclusion

Disclosure statement

References

Information for

Open access

Opportunities

Help and information

Few shot object detection for headdresses and seats in Thangka Yidam based on ResNet and deformable convolution

Abstract

1. Introduction

2. Related work

2.1. Few shot object detection method

2.2. FSOD method

3. Proposed method

3.1. Few shot object detection for headdresses and seats in Thangka Yidam based on ResNet and deformable convolution

3.2. Adjustment and optimisation of convolution layer structure

3.3. Introduction of deformable convolution

3.4. Double threshold non-maximum suppression

Table 1. Comparison of AP (%) under different threshold combinations.

4. Experiment

4.1. Experimental setting

4.2. Datasets

4.2.1. Thangka dataset

Table 2. The number distribution of categories before and after Thangka image augmentation.

4.2.2. MS COCO dataset

4.3. Main results

4.3.1. Dataset split and results analysis

Table 3. Experimental instructions for the Thangka data.

Table 4. The evaluation results of Thangka image detection.

Table 5. The comparison results on the MS COCO dataset for 20-way 10-shot.

4.3.2. Ablation experimental analysis

Table 6. The ablation experiment results for key components on the Thangka dataset and COCO dataset.

5. Conclusion

Disclosure statement

Additional information

Funding

Notes

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date