Machine Learning Methodologies and Large Data Text Corpora

Views: 210

Title: Machine Learning Methodologies and Large Data Text Corpora
Author(s): Luke Barnesmoore, Jeffery Huang
Publisher: Common Ground Research Networks
Collection: Common Ground Research Networks
Series: New Directions in the Humanities
Journal Title: The International Journal of Communication and Linguistic Studies
Keywords: Machine Learning Methodologies, History of Assemblage Model (HoAM), Statnews.org
Volume: 14
Issue: 1
Date: October 31, 2015
ISSN: 2327-7882 (Print)
ISSN: 2327-8617 (Online)
DOI: https://doi.org/10.18848/2327-7882/CGP/v14i01/43661
Citation: Barnesmoore, Luke , and Jeffery Huang. 2015. "Machine Learning Methodologies and Large Data Text Corpora." The International Journal of Communication and Linguistic Studies 14 (1): 1-15. doi:10.18848/2327-7882/CGP/v14i01/43661.
Extent: 15 pages

Abstract

With the rise of social media as a focal point for interaction in both global and local social communities, “big data” has become a key feature of social science research in the 21st century. As the size of corpora on sites like Facebook and Twitter have grown, a need has risen for more and more sophisticated computer science tools to collect data and both identify and visualize statistical trends therein. After describing our integration of Netvizz scraping software with our Statnews.org language analysis software we provide a case study of our tool’s application through analyzing Libertarian Facebook data, in particular a discussion about “milk rights” in the US, within the lens of Barnesmoore’s History of Assemblage Model (HoAM) and proceed (through use of thought experiment) to draw conclusions as to the ways in which the ontological regime(s) implicit in the analyzed data are likely to structure potential norms of thought, behavior, and being within publics socialized within the regime.

Common Ground Research Networks

Common Ground Journals and Books

Series (28)

Advanced Search

Search by open access?

Search by subscribed content?