Building Better Machine Learning Models for Rhetorical Analyses: The Use of Rhetorical Feature Sets for Training Artificial Neural Network Models: Technical Communication Quarterly: Vol 32 , No 1

Sample our Humanities journals, sign in here to start your FREE access for 14 days

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/10572252.2022.2077452?needAccess=true

ABSTRACT

In this paper, we investigate two approaches to building artificial neural network models to compare their effectiveness for accurately classifying rhetorical structures across multiple (non-binary) classes in small textual datasets. We find that the most accurate type of model can be designed by using a custom rhetorical feature list coupled with general-language word vector representations, which outperforms models with more computing-intensive architectures.

KEYWORDS:

Computational rhetoric
research methods
big data/data visualization

Acknowledgment

The authors would like to thank the reviewers and the editor for valuable critique, guidance, and encouragement, the managing editor for the helpful and insightful edit of our manuscript, and the Center for Computationally Assisted Science and Technology (CCAST) at North Dakota State University for providing computing resources and support.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Notes

1. There are a lot of resources for learning more about word embeddings. Latysheva (Citation2019) provides a brief introduction to the topic, and Karani (Citation2018) and Sarwan (Citation2017) provide more detailed ones.

2. Rhetorical features can be understood as any language structure that exerts a rhetorical effect within a given rhetorical ecosystem. Broadly, such features can be determined through rhetorical analysis based on close reading of select texts, or quantitatively through tools like DocuScope’s generic dictionary of rhetorical features (Kaufer, Ishizaki, Butler, & Collins, Citation2004) or DICTION’s semantic feature/sub-feature sets (refer to Hart, Citation2001). Later in this article, we offer one approach to compiling a rhetorical feature list, using a set of specific steps.

3. In concept, if not necessarily in practice. It is best to think of ANNs as inspired by neuroscientific knowledge of the brain but not trying to perfectly mimic the functions of actual brains. Refer also to the discussion on backpropagation in Ananthaswamy (Citation2021).

4. Unsupervised ML algorithms, like topic models or one-shot learning algorithms, do not use pre-labeled datasets; this study focuses on supervised models.

5. A detailed description of each category is included in the dataset’s repository, specifically, under the coding manual (https://kilthub.cmu.edu/articles/dataset/E-thos_Project_Climate_Change/12964481/1?file=24696185).

6. For more detail on nuances like activation functions or output dimensions we used, our code can be accessed at https://osf.io/6sbcq/. Parts of the BERT code were built from https://www.kdnuggets.com/2020/02/intent-recognition-bert-keras-tensorflow.html; writing non-BERT code benefitted from the documentation at https://keras.io/examples/nlp/pretrained_word_embeddings/.

7. For two excellent illustrations of how BERT works, refer to Alammar (Citation2018) and Vig (Citation2019).

8. Our columnar distinctions between categories in merely serve to make the list of seed words more human-readable; from the ANN’s perspective, they appear as one long list without the superimposed class distinctions in . We used our training in rhetorical analysis to identify what we judged to be rhetorical signals for these classes of expertise appeal, but we cannot know whether the ANN algorithms used these features in the same ways or what patterns emerging from them were most predictive of the class distinctions made by the ANNs.

9. We also used BERT in Set 3 with the custom feature list; predictably, those models performed poorly and occasionally could not differentiate between different classes at all.

10. Their feature selection differed significantly from ours, however, as they used n-grams rather than custom feature sets. Their data – technical manuals – targeted a different, more regulated and less unstructured type of TPC than our data, likely favoring a syntactically driven feature selection process over our semantically and rhetorically driven one. Running our model with a similar approach as theirs produced average results.

11. We concatenated our 100-dimensional GloVe vector representations with the syntactic features, effectively adding syntactic dimensions to the word vector representation.

Latysheva, N. (2019). Why do we use word embeddings in NLP? Retrieved from https://towardsdatascience.com/why-do-we-use-embeddings-in-nlp-2f20e1b632d2

Google Scholar

Karani, D. (2018). Introduction to word embedding and Word2Vec. Retrieved from https://towardsdatascience.com/introduction-to-word-embedding-and-word2vec-652d0c2060fa

Google Scholar

Sarwan, N. S. (2017). Understanding word embeddings: From word2vec to count vectors. Retrieved from https://www.analyticsvidhya.com/blog/2017/06/word-embeddings-countword2veec/

Google Scholar

Kaufer, D., Ishizaki, S., Butler, B., & Collins, J. (2004). The power of words: Unveiling the speaker and writer’s hidden craft. Erlbaum.

Google Scholar

Hart, R. P. (2001). Redeveloping DICTION: Theoretical considerations. In M. D. West (Ed.), Theory, method, and practice in computer content analysis (pp. 43–60). Greenwood Publishing Group.

Google Scholar

Ananthaswamy, A. (2021). Artificial neural nets finally yield clues to how brains learn. Retrieved from https://www.quantamagazine.org/artificial-neural-nets-finally-yield-clues-to-how-brains-learn-20210218/

Google Scholar

Alammar, J. (2018). The illustrated BERT, ELMo, and co. How NLP cracked transfer learning. http://jalammar.github.io/illustrated-bert/

Google Scholar

Vig, J. (2019, Jan. 7). Deconstructing BERT, part 2: Visualizing the inner workings of attention. Retrieved from https://towardsdatascience.com/deconstructing-bert-part-2-visualizing-the-innerworkings-of-attention-60a16d86b5c1

Google Scholar

Additional information

Funding

The Center for Computationally Assisted Science and Technology (CCAST) resources at North Dakota State University were made possible in part by NSF MRI Award No. 2019077.

Notes on contributors

Zoltan P. Majdik

Zoltan P. Majdik is an associate professor in the Department of Communication at North Dakota State University in Fargo, ND.

James Wynn

James Wynn is an associate professor of English at Carnegie Mellon University in Pittsburgh, PA.

Log in via your institution

Access through your institution

Log in to Taylor & Francis Online

Shibboleth

Log in to Taylor & Francis Online

Username Password

Forgot password?

Keep me logged in (not suitable for shared devices).

You will otherwise be logged out automatically, after a limited period, and will need to log in again.

Restore content access

Restore content access for purchases made as guest

Purchase options * Save for later Item saved, go to cart

PDF download + Online access

48 hours access to article PDF & online version
Article PDF can be downloaded
Article PDF can be printed

USD 53.00 Add to cart

PDF download + Online access - Online Checkout

Issue Purchase

30 days online access to complete issue
Article PDFs can be downloaded
Article PDFs can be printed

USD 212.00 Add to cart

Issue Purchase - Online Checkout

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

Building Better Machine Learning Models for Rhetorical Analyses: The Use of Rhetorical Feature Sets for Training Artificial Neural Network Models

Notes on contributors

Zoltan P. Majdik

James Wynn

Log in via your institution

Log in to Taylor & Francis Online

Restore content access

Related Research

Information for

Open access

Opportunities

Help and information

Building Better Machine Learning Models for Rhetorical Analyses: The Use of Rhetorical Feature Sets for Training Artificial Neural Network Models

ABSTRACT

Acknowledgment

Disclosure statement

Notes

Additional information

Funding

Notes on contributors

Zoltan P. Majdik

James Wynn

Log in via your institution

Log in to Taylor & Francis Online

Log in to Taylor & Francis Online

Restore content access

Related Research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature