The Historical TV Guide

,

This video explores digitizing texts to build a database to perform a distant reading study of 1950s American television. It explains how to use DH methods such as web scraping, OCR, SQL, database building, and data cleaning to both sort through vast quantities of information (in this case hundreds of thousands of hours of television programs) and work around problematic sources (lack of preserved or very poor quality TV recordings) to challenge conventional understandings of American culture.

Further Reading and Resources

Check out the Project’s Repository on GitHub

Scholarly Resources

Technical Resources

Distant Reading, Corpus Linguistics, Digitization, Text Mining and Analytics

Posted by

Kathy M. Newman is Associate Professor of English/Literary and Cultural Studies at CMU. Her current book, in progress, is titled: How the Fifties Worked: Mass Culture and the Decade the Unions Made. Newman’s areas of expertise include American Literature, Media Studies, and the relationship between class, politics and cultural forms.

Steven Gotzler is a PhD candidate in Literary and Cultural Studies at Carnegie Mellon University. His research explores the intersections of intellectual culture, labor, and literature during the 20th century. He has published in The Los Angeles Review of Books on Richard Hoggart and the politics of working-class studies, and he serves as a governing board member for the Cultural Studies Association (CSA).

Similar Projects by Discipline

English

How to grow data forests with XML trees

Elisa Beshero-Bondar

eXtensible Markup Language (XML).

Stylometry and Authorship Analysis

Patrick Juola

Machine learning to identify authors.

DocuScope

David Kaufer

Computer Support for Close Reading and Textual Analysis in DH.

Logistic Regression

Matthew J. Lavin

Machine learning for literary analysis.

Metadata Heatmaps for Distant Reading

Benjamin Miller

Distant reading of a textual corpus.

Data Visualization: Tableau

Emma Slayton

Data visualization with Tableau.

Shakespeare-VR

Stephen Wittek

Building immersive VR projects.

LCS

Stylometry and Authorship Analysis

Patrick Juola

Machine learning to identify authors.

Shakespeare-VR

Stephen Wittek

Building immersive VR projects.

Similar Projects by Topics

Distant Reading

Marriage & Divorce of Capitalism & Democracy

Simon DeDeo

DH methods for interdisciplinary studies and results.

Metadata Heatmaps for Distant Reading

Benjamin Miller

Distant reading of a textual corpus.

No other videos for this topic yet.

Corpus Linguistics

Marriage & Divorce of Capitalism & Democracy

Simon DeDeo

DH methods for interdisciplinary studies and results.

Metadata Heatmaps for Distant Reading

Benjamin Miller

Distant reading of a textual corpus.

Building your own data set

AmyJo Brown

A Journalist's approach

Structure-based Network Analysis

S.E. Hackney

Structure-based network analysis.

Stylometry and Authorship Analysis

Patrick Juola

Machine learning to identify authors.

DocuScope

David Kaufer

Computer Support for Close Reading and Textual Analysis in DH.

Logistic Regression

Matthew J. Lavin

Machine learning for literary analysis.

Topic Modeling Subreddits

Chloe Perry

Computational techniques to topic model subreddits.

Digitization

Shakespeare-VR

Stephen Wittek

Building immersive VR projects.

No other videos for this topic yet.

Text Mining and Analytics

Marriage & Divorce of Capitalism & Democracy

Simon DeDeo

DH methods for interdisciplinary studies and results.

Metadata Heatmaps for Distant Reading

Benjamin Miller

Distant reading of a textual corpus.

Building your own data set

AmyJo Brown

A Journalist's approach

Structure-based Network Analysis

S.E. Hackney

Structure-based network analysis.

DocuScope

David Kaufer

Computer Support for Close Reading and Textual Analysis in DH.

Logistic Regression

Matthew J. Lavin

Machine learning for literary analysis.

Topic Modeling Subreddits

Chloe Perry

Computational techniques to topic model subreddits.

Last updated: August 29, 2019
https://github.com/cmu-lib/dhlg/blob/master/_projects/newmangotzler.md