Glintstone unifies the open cuneiform record — CDLI, ORACC, eBL, and a dozen more — into one platform where scholars can see exactly what's understood, what isn't, and where effort matters most.
Glintstone aims to compound what CDLI, ORACC, and the broader scholarly ecosystem have built — making each resource more queryable, more connected, and more meaningful.
New views across existing data.
With all open data connected, Glintstone allows you to explore and understand it in new ways: artifacts, composites, signs, readings, lemmas, and citations.
Search or use the API across all open data.
See the forest from the trees.
Connected data allows aggregation and analysis in new ways, linked to the ability to explore.
Automated and custom curation methods are baked in, across tablets, signs, lemmas, and meanings.
Catalyze your impact.
Future
Glintstone aims to unlock a new scale of scholarly achievement, and make sure you're recognized for your expertise and hard work.
We'll need lots of input from experts to make this happen, so we can build this system together.
Every artifact tracked through five stages — from raw image to translated text. See what's complete, what's missing, and where scholarly effort has the highest impact.
The artifact exists as a digital photograph
ML-assisted sign detection on the tablet surface
The text read into structured notation
Every word linked to its dictionary lemma
Meaning rendered in modern language
Each artifact surfaces what scholars have written about its era, genre, and place of origin — drawn dynamically from trusted open sources. Administrative tablets sit alongside the political and economic world they recorded. Literary texts surface their reception history. Lexical lists connect to the scribal schools that produced them.
Open data makes this possible, and there's over a century of expertise baked into that data.
Credit and source attribution are structural requirements.
Artifact catalog (353k records) with excavation contexts, seal descriptions, provenance, and collection data. Tablet photographs. Physical measurements.
Geographic coordinates for ancient places; provenience resolution for archaeological sites via ORACC GeoJSON data
ATF transliterations (135k texts, ~4M tokens), translations (43.7k), composite text links, fragment joins
Cuneiform sign inventory (3,367 signs, ~15k reading values, Unicode mappings, MZL/ABZ concordance)
Sumerian lexicon (~21k entries), sign-level glossary data, lemma forms
Cuneiform lexical tradition — sign lists, vocabularies, scribal training texts
21-volume Akkadian lexicon completed 2011 after 90 years; now being digitized by ISAC's Data Research Center
Sign bounding-box annotations (81 tablets, 8.1k annotations); MZL-to-OGSL sign identity mapping
Sign detection training data; MZL sign concordance file used for cross-system identification
Computer vision pipeline for cuneiform sign detection and localization; primarily Persepolis Fortification Archive
Neural POS tagger and lemmatizer for Akkadian; trained on ORACC annotated corpora
Publications bibliography (16.7k pubs), artifact-publication links, edition histories
Fragment-level citation data; publication-to-artifact linkage via live API
Publication DOI enrichment; scholar ORCID identifiers for Assyriology-adjacent works
Citation and reference graph data keyed to DOIs from OpenAlex enrichment
Comprehensive Assyriology bibliography; publication records with series designations and author data
Scholar name registry for cuneiform studies; ATF editor and author attribution
List of Assyriologists for scholar name resolution; Wikidata QIDs as authority identifiers for disambiguation
Open source and built for collaboration.
Browse the schema, review the data model, file issues, or contribute code. The entire project is open.
View RepositorySign up to get involved with early testing.
Glintstone is in the exploratory stage and looking for ambitious collaborators.
The core infrastructure is built to be language-agnostic. Trust tracking, provenance chains, and annotation ecosystems apply to any extinct or undeciphered writing system. Cuneiform is the proving ground.