euroeval.model_cache

ModelCache class for caching model outputs.

Classes

Functions

split_dataset_into_cached_and_non_cached — Split a dataset into a cached and non-cached part.
load_cached_model_outputs — Load the cached model outputs.

source class ModelCache(model_cache_dir: Path, cache_name: str, max_generated_tokens: int)

A cache for model outputs.

Initialise the model output cache.

Attributes

model_cache_dir — The directory to store the cache in.
cache_path — The path to the cache file.
cache — The model output cache.
max_generated_tokens — The maximum number of tokens to generate for each example.

Parameters

model_cache_dir : Path — The directory to store the cache in.
cache_name : str — The name of the cache file.
max_generated_tokens : int — The maximum number of tokens to generate for each example.

Methods

Load the model output cache.

Save the model output cache to disk.

Remove the cache from memory and delete it from disk.

source method ModelCache.add_to_cache(model_inputs: dict, model_output: GenerativeModelOutput) → None

Add the model input/output to the cache.

Parameters

source split_dataset_into_cached_and_non_cached(dataset: Dataset, cache: ModelCache) → tuple['Dataset', 'Dataset']

Split a dataset into a cached and non-cached part.

Parameters

Returns

Load the cached model outputs.

Parameters

Returns