Logistic Regression

This video discusses logistic regression as an entry point to machine learning for text and literary analysis. The video walks through how to train a model in a Google Drive spreadsheet for simplicity as well as how to run more complex logistical regressions in a Jupyter Notebook with Python. It also explores the results one can get when using these types of models for literary analysis.

Further Reading and Resources

Machine Learning, Corpus Linguistics, Computational Linguistics, Text Mining and Analytics

Posted by

Matt Lavin is a Clinical Assistant Professor of English at the University of Pittsburgh, and Director of the department’s Digital Media Lab. His scholarship has appeared in Studies in the Novel, Literary and Linguistic Computing, Auto|Biography Studies, and The Programming Historian.

Similar Projects by Discipline

Literature

How to grow data forests with XML trees

Elisa Beshero-Bondar

eXtensible Markup Language (XML).

Beyond the Ant Brotherhood

Tatyana Gershkovich

Dynamic digital archives of writings and timelines.

The Latin American Comics Archive (LACA)

Felipe Gómez

Online archives in comic book markup language.

Stylometry and Authorship Analysis

Patrick Juola

Machine learning to identify authors.

DocuScope

David Kaufer

Computer Support for Close Reading and Textual Analysis in DH.

Metadata Heatmaps for Distant Reading

Benjamin Miller

Distant reading of a textual corpus.

Shakespeare-VR

Stephen Wittek

Building immersive VR projects.

English

How to grow data forests with XML trees

Elisa Beshero-Bondar

eXtensible Markup Language (XML).

Stylometry and Authorship Analysis

Patrick Juola

Machine learning to identify authors.

DocuScope

David Kaufer

Computer Support for Close Reading and Textual Analysis in DH.

Metadata Heatmaps for Distant Reading

Benjamin Miller

Distant reading of a textual corpus.

Shakespeare-VR

Stephen Wittek

Building immersive VR projects.

The Historical TV Guide

Kathy M. Newman, Steven Gotzler

Using digitized text to study television history.

Data Visualization: Tableau

Emma Slayton

Data visualization with Tableau.

Similar Projects by Topics

Machine Learning

Stylometry and Authorship Analysis

Patrick Juola

Machine learning to identify authors.

No other videos for this topic yet.

Corpus Linguistics

Stylometry and Authorship Analysis

Patrick Juola

Machine learning to identify authors.

Building your own data set

AmyJo Brown

A Journalist's approach

Marriage & Divorce of Capitalism & Democracy

Simon DeDeo

DH methods for interdisciplinary studies and results.

Structure-based Network Analysis

S.E. Hackney

Structure-based network analysis.

DocuScope

David Kaufer

Computer Support for Close Reading and Textual Analysis in DH.

Metadata Heatmaps for Distant Reading

Benjamin Miller

Distant reading of a textual corpus.

The Historical TV Guide

Kathy M. Newman, Steven Gotzler

Using digitized text to study television history.

Topic Modeling Subreddits

Chloe Perry

Computational techniques to topic model subreddits.

Computational Linguistics

Stylometry and Authorship Analysis

Patrick Juola

Machine learning to identify authors.

Structure-based Network Analysis

S.E. Hackney

Structure-based network analysis.

Metadata Heatmaps for Distant Reading

Benjamin Miller

Distant reading of a textual corpus.

Topic Modeling Subreddits

Chloe Perry

Computational techniques to topic model subreddits.

Text Mining and Analytics

Building your own data set

AmyJo Brown

A Journalist's approach

Marriage & Divorce of Capitalism & Democracy

Simon DeDeo

DH methods for interdisciplinary studies and results.

Structure-based Network Analysis

S.E. Hackney

Structure-based network analysis.

DocuScope

David Kaufer

Computer Support for Close Reading and Textual Analysis in DH.

Metadata Heatmaps for Distant Reading

Benjamin Miller

Distant reading of a textual corpus.

The Historical TV Guide

Kathy M. Newman, Steven Gotzler

Using digitized text to study television history.

Topic Modeling Subreddits

Chloe Perry

Computational techniques to topic model subreddits.

Last updated: August 29, 2019
https://github.com/cmu-lib/dhlg/blob/master/_projects/lavin.md