Changelog#

All notable changes to chronocratic-datasets will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning. Entries are managed with towncrier; see changelog.d/ for unreleased changes.

v0.1.0a3 (2026-06-11)#

Changed#

  • Renamed WeatherModule to WeatherDataModule and ElectricityLoadModule to ElectricityLoadDataModule for naming consistency. (#34)

  • Auto-create towncrier fragment on feature→dev PRs. (#36)

Fixed#

  • Use copy=True in DataFrame.to_numpy() to avoid a non-writable tensor warning from PyTorch. (#34)

v0.1.0a1 (2026-06-10) — First Alpha Release#

The first pre-release of chronocratic-datasets. This alpha introduces the complete set of time series datasets, a clean and type-safe API, and full PyTorch Lightning integration.

Expect breaking changes before the 1.0 release. Feedback is welcome.

Added#

  • Forecasting datasets: ETT, Weather, Electricity with LightningDataModule integration

  • Classification datasets: UCR (univariate) and UEA (multivariate) benchmarks

  • ForecastingLoaderMode enum: RAW_SERIES, INPUT_TARGET, INPUT_ONLY

  • ClassificationLoaderMode enum: SAMPLE_ONLY, SAMPLE_LABEL

  • ForecastingMode enum: UNIVARIATE, MULTIVARIATE

  • Data caching: Automatic NPZ caching for downloaded and preprocessed data

  • Data scaling: Configurable normalization via scikit-learn scalers

  • DDP compliance: All data modules work with distributed training strategies

  • Utility functions: Cache management, feature extraction, ARFF parsing, collation

  • Package structure: Full __init__.py with 49 re-exported public symbols

  • BSD 3-Clause license

  • Sphinx documentation with autodoc-generated API reference

Notes#

  • Namespace is chronocratic.datasets (installed via chronocratic-datasets on PyPI).

  • Requires Python 3.12+.

  • Uses PyTorch Lightning as the primary training framework integration.