Cobanoglu, Yunus ORCID: 0009-0003-5715-9979; Laasonen, Jussi ORCID: 0000-0003-3496-0715; Simonjetz, Fabian ORCID: 0009-0005-2509-7317; Khait, Ilya ORCID: 0000-0002-0520-7014; Cohen, Sophie ORCID: 0009-0009-2375-8555; Földi, Zsombor ORCID: 0000-0001-9976-899X; Hätinen, Aino ORCID: 0000-0002-7039-5496; Heinrich, Adrian ORCID: 0000-0003-4357-3266; Mitto, Tonio ORCID: 0000-0002-8526-4683; Rozzi, Geraldina ORCID: 0000-0002-1072-2512; Sáenz, Luis ORCID: 0000-0002-9107-6574; Jiménez, Enrique ORCID: 0000-0003-0093-528X (2024): Transliterated Cuneiform Tablets of the Electronic Babylonian Library Platform. Journal of Open Humanities Data, 10 (1). ISSN 2059-481X
65ce03d0e876f.pdf
The publication is available under the license Creative Commons Attribution.
Download (1MB)
Abstract
This work presents a corpus of transliterated cuneiform tablets from the Electronic Babylonian Library (eBL) platform, including a public API endpoint to download the latest version of the data, and a Python library to parse the transliterations in ATF format. As of the time of writing, the constantly growing dataset contains around 25,000 tablets with over 350,000 lines of transliterated text. This dataset is a sizeable addition to open-source cuneiform data and a major milestone for research within the fields of cuneiform studies and NLP.
Doc-Type: | Article (LMU) |
---|---|
Organisational unit (Faculties): | 12 Cultural Studies > Department of Ancient and Modern Cultures > Assyriology and Hethitology |
DFG subject classification of scientific disciplines: | Humanities and social sciences |
Date Deposited: | 27. May 2024 11:11 |
Last Modified: | 07. Aug 2024 08:34 |
URI: | https://oa-fund.ub.uni-muenchen.de/id/eprint/1179 |
DFG: | Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) - 491502892 |