{"id":287,"date":"2023-05-02T15:59:28","date_gmt":"2023-05-02T15:59:28","guid":{"rendered":"https:\/\/iris.siue.edu\/stldigitalhumanities\/?p=287"},"modified":"2025-11-21T17:02:22","modified_gmt":"2025-11-21T17:02:22","slug":"text-analysis","status":"publish","type":"post","link":"https:\/\/iris.siue.edu\/stldigitalhumanities\/2023\/05\/02\/text-analysis\/","title":{"rendered":"Text Analysis"},"content":{"rendered":"\n<p><strong>Contributed by Geremy Carnes, Lindenwood University<\/strong><br><em>Written for the Cleveland Teaching Collaborative<\/em><\/p>\n\n\n\n<p>Text analysis is one of the oldest forms of humanistic practice, but new digital tools can perform text analyses on a scale that would be impossible for a human being to achieve. These computationally-enabled forms of text analysis cannot replace traditional forms of practice, but they can complement them, revealing patterns across a single text or across vast corpora of texts that lead to new insights.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Spotlight Tool: NGram<\/strong><strong><\/strong><\/h3>\n\n\n\n<p>One of the simplest yet most powerful ways to get started with computationally-enabled text analysis is with the <a href=\"https:\/\/books.google.com\/ngrams\"><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-vivid-red-color\"><strong>Google NGram Viewer<\/strong><\/mark><\/a>. The NGram Viewer is a free tool that allows you to plot the usage of particular words over time, across over 8 million books and 5 centuries. The sheer size of its corpus makes the NGram Viewer the best way to quickly explore language change over time in the classroom.&nbsp;<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-content\/uploads\/sites\/71\/2022\/09\/Ngram-1024x560.png\" alt=\"\" class=\"wp-image-136\" width=\"692\" height=\"378\" srcset=\"https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-content\/uploads\/sites\/71\/2022\/09\/Ngram-1024x560.png 1024w, https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-content\/uploads\/sites\/71\/2022\/09\/Ngram-300x164.png 300w, https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-content\/uploads\/sites\/71\/2022\/09\/Ngram-768x420.png 768w, https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-content\/uploads\/sites\/71\/2022\/09\/Ngram-1536x840.png 1536w, https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-content\/uploads\/sites\/71\/2022\/09\/Ngram.png 1600w\" sizes=\"auto, (max-width: 692px) 100vw, 692px\" \/><figcaption class=\"wp-element-caption\">Google NGram Viewer<\/figcaption><\/figure>\n<\/div>\n\n\n<h4 class=\"wp-block-heading\"><strong>Learning Outcomes<\/strong><\/h4>\n\n\n\n<p>Using the Google NGram Viewer can be a component of many traditional text analysis assignments. It can also be used by itself in low-stakes exploratory assignments. Assignments involving the NGram Viewer support outcomes for many types of courses, especially literature and history courses:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Students reflect on the processes of language change through consideration of statistically-based visual evidence.<\/li>\n\n\n\n<li>Students examine the relationship between language change and historical developments\/events.<\/li>\n\n\n\n<li>Students support their analysis of a text with evidence about historical word usage frequency.<\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>Resources<\/strong><\/h4>\n\n\n\n<p>To get started with the Google NGram Viewer, simply go to <strong><a href=\"https:\/\/books.google.com\/ngrams\"><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-vivid-red-color\">https:\/\/books.google.com\/ngrams<\/mark><\/a> <\/strong>and start searching. Enter the words you want to graph (separated by commas) and the date range you want to examine, and the NGram Viewer will graph those words\u2019 frequency within Google\u2019s corpus over that time period. If you spend a half hour reading the <a href=\"https:\/\/books.google.com\/ngrams\/info\">help documentation<\/a>, more advance searches and graphs become possible, such as searching for words used as a particular part of speech or adding the results for multiple words together in a single graph line.<\/p>\n\n\n\n<p>The NGram Viewer does have some limitations you should be mindful of. The OCR (Optical Character Recognition) used to build Google\u2019s corpus isn\u2019t perfect, especially with texts from earlier centuries. (When running searches prior to 1800, be mindful of the fact that many long S\u2019s have been recorded as F\u2019s.) There are also valid concerns about the representativeness of the corpus\u2019s contents. However, while these limitations are important to keep in mind when using the NGram Viewer to make scholarly arguments, they are unlikely to present a problem for classroom usage; indeed, discussing these limitations can help students think more critically about the nature of text analysis corpora.<\/p>\n\n\n\n<p>For an example of the kind of creative assignment to which Google NGram lends itself well, check out how Katherine D. Harris had her students <a href=\"https:\/\/triproftri.wordpress.com\/2013\/10\/03\/geresearch\/\"><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-vivid-red-color\"><strong>use the tool in an exploratory manner when reading <em>A Clockwork Orange<\/em><\/strong><\/mark><\/a>.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>Other Free and Accessible Text Analysis Tools:<\/strong>&nbsp;&nbsp;<\/h4>\n\n\n\n<ul class=\"wp-block-list\">\n<li><em>Voyant<\/em>, <a href=\"https:\/\/voyant-tools.org\/\"><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-vivid-red-color\"><strong>https:\/\/voyant-tools.org\/<\/strong><\/mark><\/a>: identify and analyze themes, topics, and patterns internal to a text or corpus<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Contributed by Geremy Carnes, Lindenwood UniversityWritten for the Cleveland Teaching Collaborative Text analysis is one of the oldest forms of [&hellip;]<\/p>\n","protected":false},"author":160,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_sb_is_suggestion_mode":false,"_sb_show_suggestion_boards":false,"_sb_show_comment_boards":false,"_sb_suggestion_history":"","_sb_update_block_changes":"","footnotes":""},"categories":[24],"tags":[27,25,26],"class_list":["post-287","post","type-post","status-publish","format-standard","hentry","category-intros-and-overviews","tag-ngram","tag-text-analysis","tag-voyant"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-json\/wp\/v2\/posts\/287","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-json\/wp\/v2\/users\/160"}],"replies":[{"embeddable":true,"href":"https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-json\/wp\/v2\/comments?post=287"}],"version-history":[{"count":2,"href":"https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-json\/wp\/v2\/posts\/287\/revisions"}],"predecessor-version":[{"id":294,"href":"https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-json\/wp\/v2\/posts\/287\/revisions\/294"}],"wp:attachment":[{"href":"https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-json\/wp\/v2\/media?parent=287"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-json\/wp\/v2\/categories?post=287"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/iris.siue.edu\/stldigitalhumanities\/wp-json\/wp\/v2\/tags?post=287"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}