1,265
Views
1
CrossRef citations to date
0
Altmetric
Articles

Satellite and instrument entity recognition using a pre-trained language model with distant supervision

, , &
Pages 1290-1304 | Received 14 Mar 2022, Accepted 23 Jul 2022, Published online: 08 Aug 2022

References

  • Chen, Fang, and Zhongchang Sun. 2022. “Big Earth Data for Achieving the Sustainable Development Goals in the Belt and Road Region.” Big Earth Data 6 (1). Taylor & Francis, 1–2. doi:10.1080/20964471.2022.2033424.
  • Collobert, Ronan, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. 2011. “Natural Language Processing (Almost) from Scratch.” The Journal of Machine Learning Research 12 (null): 2493–2537.
  • Craglia, Max, Jiri Hradec, Stefano Nativi, and Mattia Santoro. 2017. “Exploring the Depths of the Global Earth Observation System of Systems.” Big Earth Data 1 (1–2): Taylor & Francis, 21–46. doi:10.1080/20964471.2017.1401284.
  • Devlin, Jacob, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding.” In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), 4171–4186. Minneapolis, Minnesota: Association for Computational Linguistics. doi:10.18653/v1/N19-1423.
  • Duan, X., J. Zhang, R. Ramachandran, P. Gatlin, M. Maskey, J. J. Miller, K. Bugbee, and T. J. Lee. 2018. A Neural Network-Powered Cognitive Method of Identifying Semantic Entities in Earth Science Papers.” In 2018 IEEE International Conference on Cognitive Computing (ICCC), 9–16. doi:10.1109/ICCC.2018.00009.
  • Fries, Jason A., Sen Wu, Alexander Ratner, and Christopher Ré. 2017. SwellShark: A Generative Model for Biomedical Named Entity Recognition without Labeled Data.” CoRR abs/1704.06360. http://arxiv.org/abs/1704.06360.
  • Giannakopoulos, Athanasios, Claudiu Musat, Andreea Hossmann, and Michael Baeriswyl. 2017. “Unsupervised Aspect Term Extraction with B-LSTM & CRF Using Automatically Labelled Datasets.” In Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 180–188. Copenhagen, Denmark: Association for Computational Linguistics. doi:10.18653/v1/w17-5224.
  • Guo, Huadong. 2020. “Big Earth Data Facilitates Sustainable Development Goals.” Big Earth Data 4 (1): Taylor & Francis, 1–2. doi:10.1080/20964471.2020.1730568.
  • Guo, Huadong, Dong Liang, Fang Chen, and Zeeshan Shirazi. 2021. “Innovative Approaches to the Sustainable Development Goals Using Big Earth Data.” Big Earth Data 5 (3): Taylor & Francis, 263–276. doi:10.1080/20964471.2021.1939989.
  • Guo, Huadong, Stefano Nativi, Dong Liang, Max Craglia, Lizhe Wang, Sven Schade, Christina Corban, et al. 2020. “Big Earth Data Science: An Information Framework for a Sustainable Planet.” International Journal of Digital Earth 13 (7): Taylor & Francis, 743–767. doi:10.1080/17538947.2020.1743785.
  • Han, Xu, Zhengyan Zhang, Ning Ding, Yuxian Gu, Xiao Liu, Yuqi Huo, Jiezhong Qiu, et al. 2021. Pre-Trained Models: Past, Present and Future.” AI Open, August. doi:10.1016/j.aiopen.2021.08.002.
  • Huang, Zhiheng, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF Models for Sequence Tagging.” ArXiv:1508.01991 [Cs], August. http://arxiv.org/abs/1508.01991.
  • Jafari, Omid, Parth Nagarkar, Bhagwan Thatte, and Carl Ingram. 2020. SatelliteNER: An Effective Named Entity Recognition Model for the Satellite Domain.” In Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2020, Volume 3: KMIS, Budapest, Hungary, November 2-4, 2020., 100–107. doi:10.5220/0010147401000107.
  • Kavvada, Argyro, Graciela Metternicht, Flora Kerblat, Naledzani Mudau, Marie Haldorson, Sharthi Laldaparsad, Lawrence Friedl, Alex Held, and Emilio Chuvieco. 2020. “Towards Delivering on the Sustainable Development Goals Using Earth Observations.” Remote Sensing of Environment 247 (September): 111930. doi:10.1016/j.rse.2020.111930.
  • Lample, Guillaume, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, and Chris Dyer. 2016. Neural Architectures for Named Entity Recognition.” In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 260–270. San Diego, California: Association for Computational Linguistics. doi:10.18653/v1/N16-1030.
  • Liang, Chen, Yue Yu, Haoming Jiang, Siawpeng Er, Ruijia Wang, Tuo Zhao, and Chao Zhang. 2020. BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision.” In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 1054–1064. KDD ‘20. New York, NY, USA: Association for Computing Machinery. doi:10.1145/3394486.3403149.
  • Lin, Bill Y., Frank Xu, Zhiyi Luo, and Kenny Zhu. 2017. “Multi-Channel BiLSTM-CRF Model for Emerging Named Entity Recognition in Social Media.” In Proceedings of the 3rd Workshop on Noisy User-Generated Text, 160–165. Copenhagen, Denmark: Association for Computational Linguistics. doi:10.18653/v1/W17-4421.
  • Liu, Yinhan, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. “RoBERTa: A Robustly Optimized BERT Pretraining Approach.” ArXiv:1907.11692 [Cs], July. http://arxiv.org/abs/1907.11692.
  • Mao, Huina, Gautam Thakur, Kevin Sparks, Jibonananda Sanyal, and Budhendra Bhaduri. 2019. “Mapping Near-Real-Time Power Outages from Social Media.” International Journal of Digital Earth 12 (11): 1285–1299. Taylor & Francis, doi:10.1080/17538947.2018.1535000.
  • Mikolov, Tomas, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient Estimation of Word Representations in Vector Space.” ArXiv:1301.3781 [Cs], September. http://arxiv.org/abs/1301.3781.
  • Mikolov, Tomas, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Distributed Representations of Words and Phrases and Their Compositionality.” In Proceedings of the 26th International Conference on Neural Information Processing Systems - Volume 2, 3111–3119. NIPS’13. Red Hook, NY, USA: Curran Associates Inc.
  • Nativi, Stefano, Mattia Santoro, Gregory Giuliani, and Paolo Mazzetti. 2020. “Towards a Knowledge Base to Support Global Change Policy Goals.” International Journal of Digital Earth 13 (2): Taylor & Francis, 188–216. doi:10.1080/17538947.2018.1559367.
  • Reimers, Nils, and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings Using Siamese BERT-Networks.” In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 3982–3992. Hong Kong, China: Association for Computational Linguistics. doi:10.18653/v1/D19-1410.
  • Sanh, Victor, Lysandre Debut, Julien Chaumond, and Thomas Wolf. 2020. DistilBERT, a Distilled Version of BERT: Smaller, Faster, Cheaper and Lighter.” ArXiv:1910.01108 [Cs], February. http://arxiv.org/abs/1910.01108.
  • Shang, Jingbo, Liyuan Liu, Xiaotao Gu, Xiang Ren, Teng Ren, and Jiawei Han. 2018. Learning Named Entity Tagger Using Domain-Specific Dictionary.” In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2054–2064. Brussels, Belgium: Association for Computational Linguistics. doi:10.18653/v1/D18-1230.
  • Strubell, Emma, Patrick Verga, David Belanger, and Andrew McCallum. 2017. Fast and Accurate Entity Recognition with Iterated Dilated Convolutions.” In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2670–2680. Copenhagen, Denmark: Association for Computational Linguistics. doi:10.18653/v1/D17-1283.
  • Sudmanns, Martin, Dirk Tiede, Stefan Lang, Helena Bergstedt, Georg Trost, Hannah Augustin, Andrea Baraldi, and Thomas Blaschke. 2020. “Big Earth Data: Disruptive Changes in Earth Observation Data Management and Analysis?” International Journal of Digital Earth 13 (7): Taylor & Francis, 832–850. doi:10.1080/17538947.2019.1585976.
  • Sun, Kai, Yunqiang Zhu, Peng Pan, Zhiwei Hou, Dongxu Wang, Weirong Li, and Jia Song. 2019. “Geospatial Data Ontology: The Semantic Foundation of Geospatial Data Integration and Sharing.” Big Earth Data 3 (3): Taylor & Francis, 269–296. doi:10.1080/20964471.2019.1661662.
  • Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention Is All You Need.” In Proceedings of the 31st International Conference on Neural Information Processing Systems, 6000–6010. NIPS’17. Red Hook, NY, USA: Curran Associates Inc.
  • Wang, Lizhe, and Jining Yan. 2020. “Stewardship and Analysis of Big Earth Observation Data.” Big Earth Data 4 (4): Taylor & Francis, 349–352. doi:10.1080/20964471.2020.1857055.
  • Wu, Yonghui, Mike Schuster, Zhifeng Chen, Quoc V. Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, et al. 2016. Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation.” ArXiv:1609.08144 [Cs], October. http://arxiv.org/abs/1609.08144.
  • Yang, Zhilin, Ruslan Salakhutdinov, and William Cohen. 2016. “Multi-Task Cross-Lingual Sequence Tagging from Scratch.” ArXiv:1603.06270 [Cs], August. http://arxiv.org/abs/1603.06270.
  • Zhao, Tianjie, Michael H. Cosh, Alexandre Roy, Xihan Mu, Yubao Qiu, and Jiancheng Shi. 2021. “Remote Sensing Experiments for Earth System Science.” International Journal of Digital Earth 14 (10): Taylor & Francis, 1237–1242. doi:10.1080/17538947.2021.1977473.
  • Zhu, Yunqiang. 2019. “Geospatial Semantics, Ontology and Knowledge Graphs for Big Earth Data.” Big Earth Data 3 (3): Taylor & Francis, 187–190. doi:10.1080/20964471.2019.1652003.
  • Žukov-Gregorič, Andrej, Yoram Bachrach, and Sam Coope. 2018. Named Entity Recognition With Parallel Recurrent Neural Networks.” In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 69–74. Melbourne, Australia: Association for Computational Linguistics. doi:10.18653/v1/P18-2012.