Logo Logo

Cobanoglu, Yunus ORCID: 0009-0003-5715-9979; Laasonen, Jussi ORCID: 0000-0003-3496-0715; Simonjetz, Fabian ORCID: 0009-0005-2509-7317; Khait, Ilya ORCID: 0000-0002-0520-7014; Cohen, Sophie ORCID: 0009-0009-2375-8555; Földi, Zsombor ORCID: 0000-0001-9976-899X; Hätinen, Aino ORCID: 0000-0002-7039-5496; Heinrich, Adrian ORCID: 0000-0003-4357-3266; Mitto, Tonio ORCID: 0000-0002-8526-4683; Rozzi, Geraldina ORCID: 0000-0002-1072-2512; Sáenz, Luis ORCID: 0000-0002-9107-6574; Jiménez, Enrique ORCID: 0000-0003-0093-528X (2024): Transliterated Cuneiform Tablets of the Electronic Babylonian Library Platform. Journal of Open Humanities Data, 10 (1). ISSN 2059-481X

[thumbnail of 65ce03d0e876f.pdf] Veröffentlichte Publikation
65ce03d0e876f.pdf

Die Publikation ist unter der Lizenz Creative Commons Namensnennung (CC BY) verfügbar.

Herunterladen (1MB)

Abstract

This work presents a corpus of transliterated cuneiform tablets from the Electronic Babylonian Library (eBL) platform, including a public API endpoint to download the latest version of the data, and a Python library to parse the transliterations in ATF format. As of the time of writing, the constantly growing dataset contains around 25,000 tablets with over 350,000 lines of transliterated text. This dataset is a sizeable addition to open-source cuneiform data and a major milestone for research within the fields of cuneiform studies and NLP.

Publikation bearbeiten
Publikation bearbeiten