Digital pedagogy and student knowledge production

The past two weeks in my Introduction to Digital Humanities course, students have been using the open-source content management system Omeka to create online exhibits related to the early Christian text, the Martyrdom of Perpetua and Felicitas. I was astounded by their accomplishments.  The students raised thoughtful questions about the text, found items online related to […]

A Martyr Is a Witness

Sinclair, Stéfan and Geoffrey Rockwell. “Voyant Tools: Reveal Your Texts.” Voyant. 31 Aug. 2015 <> In my Introduction to Digital Humanities course, my students are conducting very basic text analysis using Voyant and AntConc.  One of the datasets we are using is a set of martyr texts taken from the now public domain Ante-Nicene Fathers […]

My digital future

This fall, as I have been trying to finish up my book project, Monks and Their Children, I have been asked more than once:  What’s your next project?   When I start describing, I frequently get the reply:  no, I mean your real project, your next book.  My internal response was always twofold:  the snarky, “What, bringing […]

Fall 2014 release: more texts, more standardization

We’ve got a new release of material at New stuff: Additional texts: more Sayings from the Coptic Apophthegmata Patrum chapters of 1 Corinthians additional chapters of the Gospel of Mark Updated and corrected annotations (part of speech, language of origin) in previous corpora Standardized field names for annotations and metadata across the corpora Linguistic […]

Updates to Automated Annotation Tools

We’ve updated our tokenizer (which breaks Coptic bound groups into their constituent morphemes) and our normalizer (which normalizes spelling and orthography to faclitate further automatic annotations). Version 2.0.1 of the tokenizer includes more patterns to deal with a broader variety of bound groups.  It also includes a parameter (-l) to accommodate bound groups that are […]

June 2014 release notes

We’ve updated our tokenizer and part-of-speech tagger.  The tokenizer breaks a Coptic bound group into its constituent words and/or morphemes.  The tagger consists of a set of fine-grained and a set of course-grained Coptic models for use with the open source natural language processing program TreeTagger.  They use a Sahidic Coptic lexicon based in part […]