Datasets license & citation
This page covers machine-readable datasets and endpoints published on this site (dataset bytes, manifests/schemas, and AI-oriented JSON/YAML/TXT endpoints).
License
Datasets are licensed under Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0):
- https://creativecommons.org/licenses/by-nc/4.0/
That means:
- ✅ You can share and adapt the datasets for non-commercial use
- ✅ Attribution is required
- ✅ Attribution must include a source link to the canonical dataset URL
- ❌ You cannot use the datasets for commercial purposes without permission
How to cite
Minimum attribution should include:
- Author: Dzmitryi Kharlanau
- Source: a clickable link to the canonical dataset URL (preferred) or https://dkharlanau.github.io/datasets/
- DOI: 10.5281/zenodo.18862098
- License: CC BY-NC 4.0
Example:
Dzmitryi Kharlanau. “<Title>” (dataset bytes). CC BY-NC 4.0. DOI: 10.5281/zenodo.18862098. <Canonical URL>
Zenodo:
- Concept DOI: 10.5281/zenodo.18862098
- Version DOI for
v1.0.0: 10.5281/zenodo.18862097 - Dataset repository: https://github.com/dkharlanau/dkharlanau-datasets
Commercial licensing
Commercial licensing inquiries:
- https://www.linkedin.com/in/dkharlanau
Site materials
This license page applies to datasets only. Site code/text/design/media may be licensed differently (see repository LICENSE).