References
- Devlin J, Chang M-W, Lee K, Toutanova K. 2018. BERT: pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805 [cs].
- Hugging Face. 2021. bert-large-uncased-whole-word-masking. Hugging Face [accessed 2021 Mar 22]. https://huggingface.co/bert-large-uncased-whole-word-masking.
- Lococo KH, Staplin L, Martell CA, Sifrit KJ. 2012. Pedal application errors (No. DOT HS 811 597).
- Rajpurkar P, Zhang J, Lopyrev K, Liang P. 2016. SQuAD: 100,000+ questions for machine comprehension of text. arXiv:1606.05250 [cs].
- Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I. 2017. Attention is all you need. Paper presented at the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, p. 11.
- Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, Cistac P, Rault T, Louf R, Funtowicz M, et al. 2020. Transformers: state-of-the-art natural language processing. Paper presented at the Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. Association for Computational Linguistics, p. 38–45. doi:https://doi.org/10.18653/v1/2020.emnlp-demos.6