The paper illustrates how the combination of frequency analysis and diachronic word embeddings can help to explore and understand the development of long-running discourses in large sets of historical data.
@jdighist
The JDH serves as a forum for critical debate and discussion in the field of digital history by offering an innovative publication platform and promoting a new form of data-driven scholarship and of transmedia storytelling in the historical sciences.
The paper illustrates how the combination of frequency analysis and diachronic word embeddings can help to explore and understand the development of long-running discourses in large sets of historical data.
The paper shows how combining frequency measures with diachronic word embeddings helps trace long-running political discourses in large historical corpora, revealing how meanings and associations shift over decades.
To assess the semantic changes in how German politicians spoke about the Wehrmacht, we use overlapping diachronic word embeddings. An additional analysis of term frequency measures the level of interest in the Wehrmacht over time.
The Bundestag - (West) Germanyโs federal legislature - has published its proceedings since 1949. Digitized editions of these proceedings contain over 900,000 speeches, questions, and interjections, forming a valuable but underutilized corpus of sources on post-war German politics.
How have German politicians dealt with the burdensome legacy of militarism after the Second World War? This paper explores the connotations of the Wehrmacht in political speech through word embeddings trained on parliamentary proceedings.
Many thanks for the team at @jdighist.bsky.social for everything to bring this article to fruition! Check it out to learn more the utility and limits of LLMs as digital tools for historical research - like using LLMs for OCR, which is a hot topic of late.
How did early modern people imagine the futureโand who shaped it? Analyzing 4,900 English letters (1410โ1695), Sara Budts shows how phrases like โGod willingโ slowly faded, revealing a shift from Providence toward human agency and a more uncertain, self-shaped future.
What can 7,500 divine references in early modern letters tell us about how people imagined the future? Using the CEEC, Sara Budts maps when and how writers invoked God, revealing a patchy decline: routine phrases faded, but appeals persisted in illness, crisis, and war.
From โGod willingโ to โI trust in God,โ early modern letters show how people used divine appeals to navigate uncertainty. Through corpus analysis, Sara Budts traces Englandโs uneven secularisation: routine invocations faded, but personal faith deepened, especially in times of crisis.
Early modern letters are full of phrases like โGod willingโ or โBy Godโs grace.โ Sara Budtsโ analysis of 5,000 letters shows these werenโt clichรฉs but ways to navigate faith, agency, and uncertainty. From 1450โ1700, people balanced divine will with human action in shifting ways.
From tools to sources: By examining LLMs through source criticism, historians can evaluate generative AI's uneven potential while contributing to broader debates about these technologies' societal impact.
Mapping AI's historical knowledge: Benchmarking LLM performance on historical facts, oral history transcription accuracy, and OCR correction tasks reveals uneven capabilities. Results show whose history has been digitized and made computationally legible. #oralhistory #OCR #benchmarks
Examining LLMs as historical sources: This study applies source criticism to AI training data, revealing how patterns of digitization shape what these tools encode. Case studies map this 'jagged frontier' of capabilities across historical tasks, languages & time periods. #DigitalHistory #LLMs
History-focused: What happened to Swedish public service radio when commercial broadcasting arrived? Analyzing 1,600 hours of P1 & P3, this article uncovers shifts in sonic patterns & content variation, offering new insights into the sound history of broadcasting
Methodology-focused: How can historians work with thousands of hours of sound? This article introduces segmentation & dimensionality reduction methods for large-scale radio archives, showing how computational analysis reveals hidden structures in public service broadcasting
The research shows how digital tools like Aventura.js can mediate archival inquiry. Through an interactive atlas and the Carguero case study, it offers a new approach to reading visual history. journalofdigitalhist...
Inspired by Warburgโs principles, the article transforms static images into interlinked, interactive panels. Its case study of the 19thโcentury Carguero deepens our understanding of visual symbolism.
By merging digital techniques with human interpretation, the article demonstrates how interactive panels and data visualizations offer fresh insights into visual archives via the Carguero case.
The study repurposes Aventura.jsโa bilingual, openโsource toolโto build digital atlases that trace symbolic connections in historical images, exemplified by Colombiaโs Carguero.
A forthcoming article by Rodrรญguez Gรณmez and Urueรฑa presents a method for exploring visual collections using interactive panels, inspired by Warburgโs Mnemosyne Atlas, to analyze the 19thโcentury Carguero case. journalofdigitalhist...
But these sources raise questions. Can we ethically use archived posts from forums & blogs without consent? How do we avoid overinterpreting what's preservedโand what's lost? The paper ends with a call for cautious, critical digital history.
From lean employment pressures to restrictive benefits, the 2000s marked a turning point. Web archives reveal raw, everyday reflectionsโfrom sick workers to HR managersโoffering insight into how people understood these changes, not just endured them.
How did British citizens experience sickness & disability policy under New Labour? This article uses blogs, message boards & the Internet Archive to explore how workers, managers & families navigated welfare reformโand how they remembered the welfare state. journalofdigitalhist...
GaLiLeO is unfinished & its corpus incomplete. But thatโs the point. It shows how partial, purpose-built tools can still yield insightโespecially when they trace relationships that archives and metadata might miss. A third space between archive and algorithm.
Unlike most text mining tools, GaLiLeO doesnโt aim for sweeping models. Instead, it helps researchers identify meaningful links across Galileoโs libraryโprioritizing interpretive, humanistic inquiry over scale. Itโs built for discovery, not datafication.
What if digital tools didnโt summarize a whole corpusโbut helped historians find meaning in just a few documents? ๐ GaLiLeO (Galileoโs Library and Letters Online) is a prototype digital lab that builds interpretive paths through Galileoโs letters, books & notes. journalofdigitalhist...
We worked to recover and make accessible 10,000+ defaced websites, captured between 1995โ2010. These archived hacks aren't just noise - they reveal political messages, digital protest, and forgotten web aesthetics. Here's how they help us understand web history: journalofdigitalhist...
What can hacked websites tell us about the history of the web?
From online activism to digital vandalism, political web defacements were part of shaping the early internet. We just published a new article in the JDH diving into this fascinating and underexplored archive. journalofdigitalhist...
Excited to introduce Arvest - the platform which will be at the core of the ERC 'From Stage to Dataโ project. Designed to process and analyse digital traces of performing arts, it's transforming how we study theatrical heritage. Explore arvest.app #PerformingArts #DigitalArchives"
Discover Arvest, the cloud-based platform for IIIF image & video annotation. No server installation needed - just create an account and start annotating your digital resources immediately. Try it today at arvest.app #IIIF #DigitalHumanities #Annotation journalofdigitalhist...