Maps with CartoDB and Tableau

Our unit in Intro DH right now is on mapping.  In class we’ll be working on creating maps with Palladio.  We also had a preliminary introduction to data, tables, and maps by experimenting with Google Fusion Tables.  In preparation for class, I imported a data set consisting of a list of images from the Cushman Archive […] Read More

Digital pedagogy and student knowledge production

The past two weeks in my Introduction to Digital Humanities course, students have been using the open-source content management system Omeka to create online exhibits related to the early Christian text, the Martyrdom of Perpetua and Felicitas. I was astounded by their accomplishments.  The students raised thoughtful questions about the text, found items online related to […] Read More

A Martyr Is a Witness

Sinclair, Stéfan and Geoffrey Rockwell. “Voyant Tools: Reveal Your Texts.” Voyant. 31 Aug. 2015 <> In my Introduction to Digital Humanities course, my students are conducting very basic text analysis using Voyant and AntConc.  One of the datasets we are using is a set of martyr texts taken from the now public domain Ante-Nicene Fathers […] Read More

First foray into topic modeling

I spent two weeks at DHSI this year.  Week 2 I took Liz Losh’s and Jacque Wernimont’s Feminist DH, which was incredible and I highly recommend to everyone.  Check out the #femdh stream on Twitter for details.) During week 3 of DHSI this year, I took Neal Audenaert’s Topic Modeling, in which we were introduced […] Read More

My digital future

This fall, as I have been trying to finish up my book project, Monks and Their Children, I have been asked more than once:  What’s your next project?   When I start describing, I frequently get the reply:  no, I mean your real project, your next book.  My internal response was always twofold:  the snarky, “What, bringing […] Read More

Fall 2014 release: more texts, more standardization

We’ve got a new release of material at New stuff: Additional texts: more Sayings from the Coptic Apophthegmata Patrum chapters of 1 Corinthians additional chapters of the Gospel of Mark Updated and corrected annotations (part of speech, language of origin) in previous corpora Standardized field names for annotations and metadata across the corpora Linguistic […] Read More

Updates to Automated Annotation Tools

We’ve updated our tokenizer (which breaks Coptic bound groups into their constituent morphemes) and our normalizer (which normalizes spelling and orthography to faclitate further automatic annotations). Version 2.0.1 of the tokenizer includes more patterns to deal with a broader variety of bound groups.  It also includes a parameter (-l) to accommodate bound groups that are […] Read More

The so-called Rule of Pcol

The writings of Shenoute form the largest corpus of early monastic texts by a single author from one of the cradles of Christian monasticism, Egypt. These manuscripts open an important window onto the development of monasticism in Egypt, shedding light on the inner workings of an early monastery outside of the more famous Pachomian system. […] Read More

June 2014 release notes

We’ve updated our tokenizer and part-of-speech tagger.  The tokenizer breaks a Coptic bound group into its constituent words and/or morphemes.  The tagger consists of a set of fine-grained and a set of course-grained Coptic models for use with the open source natural language processing program TreeTagger.  They use a Sahidic Coptic lexicon based in part […] Read More