Browse
We’re here to help

Find guidance on Author Services

Search
Browse
We’re here to help

Find guidance on Author Services

Home
All Journals
Communication Methods and Measures
List of Issues
Volume 13, Issue 4
Using Supervised Machine Learning in Aut ....

Search in:

Advanced search

Communication Methods and Measures Volume 13, 2019 - Issue 4

Submit an article Journal homepage

1,487

Views

CrossRef citations to date

Altmetric

Articles

Using Supervised Machine Learning in Automated Content Analysis: An Example Using Relational Uncertainty

Andrew PilnyDepartment of Communication, University of Kentucky, Lexington, Kentucky, USACorrespondence[email protected]
View further author information

Kelly McAninchDepartment of Communication, University of Kentucky, Lexington, Kentucky, USAView further author information

Amanda SloneDepartment of Communication, University of Kentucky, Lexington, Kentucky, USAView further author information

Kelsey MooreDepartment of Communication, University of Kentucky, Lexington, Kentucky, USAView further author information

Pages 287-304 | Published online: 08 Aug 2019

Cite this article
https://doi.org/10.1080/19312458.2019.1650166
CrossMark

Sample our Communication Studies journals, sign in here to start your access, 2013 & 2014 volumes FREE to you for 14 days

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/19312458.2019.1650166?needAccess=true

ABSTRACT

The goal of this research is to make progress towards using supervised machine learning for automated content analysis dealing with complex interpretations of text. For Step 1, two humans coded a sub-sample of online forum posts for relational uncertainty. For Step 2, we evaluated reliability, in which we trained three different classifiers to learn from those subjective human interpretations. Reliability was established when two different metrics of inter-coder reliability could not distinguish whether a human or a machine coded the text on a separate hold-out set. Finally, in Step 3 we assessed validity. To accomplish this, we administered a survey in which participants described their own relational uncertainty/certainty via text and completed a questionnaire. After classifying the text, the machine’s classifications of the participants’ text positively correlated with the subjects’ own self-reported relational uncertainty and relational satisfaction. We discuss our results in line with areas of computational communication science, content analysis, and interpersonal communication.

Disclosure statement

No potential conflict of interest was reported by the authors.

Notes

1. Given the popularity of trace-data, especially in communication research (Choi, Citation2018), it is important to determine if the website prohibits the use of crawling agents to collect data. Both terms of service for both websites were carefully reviewed. Neither website made explicit statements regarding robot.txt or web-scraping policies. As such, we conclude that collecting data from these two sites did not violate any of their terms of service policies.

2. IDF for any term (t) is defined by $l o g \frac{N}{D F_{t}}$ , where N is the number of documents and DF is the number of documents that contain the term (t). The transformation process is called TF-IDF weighting: $T F - I D F_{t, d} = T F_{t, d} X I D F_{t}$ , where it assigns a higher weight to a term (t) in a document (d) when it occurs often, but only in a small number of documents. On the other hand, lower weights are assigned to terms that occur often, but in a high number of documents.

3. Precision is defined by $\frac{t r u e p o s i t i v e s}{t r u e p o s i t i v e s + f a l s e p o s i t i v e s}$ . Recall is defined by $\frac{t r u e p o s i t i v e s}{t r u e p o s i t i v e s + f a l s e n e g a t i v e s}$ . The F-Measure is defined by 2* $\frac{p r e c i s i o n * r e c a l l}{p r e c i s i o n + r e c a l l}$ .

Choi, S. (2018). When digital trace data meet traditional communication theory: Theoretical/Methodological directions. Social Science Computer Review, 089443931878861. Advance online publication, doi:10.1177/0894439318788618

Google Scholar

Additional information

Funding

This work was supported by the University of Kentucky [Research and Creative Activities Program].

Log in via your institution

Access through your institution

Log in to Taylor & Francis Online

Shibboleth

Log in to Taylor & Francis Online

Username Password

Forgot password?

Keep me logged in (not suitable for shared devices).

You will otherwise be logged out automatically, after a limited period, and will need to log in again.

Restore content access

Restore content access for purchases made as guest

Purchase options * Save for later Item saved, go to cart

PDF download + Online access

48 hours access to article PDF & online version
Article PDF can be downloaded
Article PDF can be printed

USD 53.00 Add to cart

PDF download + Online access - Online Checkout

Issue Purchase

30 days online access to complete issue
Article PDFs can be downloaded
Article PDFs can be printed

USD 258.00 Add to cart

Issue Purchase - Online Checkout

* Local tax will be added as applicable

Share icon
Back to Top

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Information for

Authors
R&D professionals
Editors
Librarians
Societies

Open access

Overview
Open journals
Open Select
Dove Medical Press
F1000Research

Opportunities

Reprints and e-prints
Advertising solutions
Accelerated publication
Corporate access solutions

Help and information

Help and contact
Newsroom
All journals
Books

Keep up to date

Sign me up

Taylor and Francis Group Facebook page

Taylor and Francis Group X Twitter page

Taylor and Francis Group Linkedin page

Taylor and Francis Group Youtube page

Taylor and Francis Group Weibo page

Registered in England & Wales No. 3099067
5 Howick Place | London | SW1P 1WG

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research