Methodology
Daggerobelus combines archival research skills and textual close reading with data science workflows to create its primary methodology.
We apply techniques like
- NLP (Natural Language Parsing) of primary documents from historical archives into machine-readable formats like JSON
- Data analysis using python and pandas to perform techniques like network analyis, statistical modeling, and other advanced techniques
- Charting using D3 and modern web-based visualizations to create readable figures that organize dense historical data
A large part of this methodology includes ensuring that the workflow can be extrapolated to new archives by creating a clear flow from 1) the original archive to 2)machine-readable formats and 3) visual outputs in the form of figures. Since each archive presents its own idiosyncracies, this method can transfer and adapt to a new historical corpus depending on the schema we identify as salient. This method works best with large—and potentially unweildy—archives.