summarize_manifest#

Workflow for summarizing files in the manifest.

(name)
└── YYYY-MM-DD
    └── (name).SUMMARY.txt

For each search location, flow will attempt to find all files matching the specified series name. After applying include and exclude filters, the manifest is updated and a summary of files, grouped by extension, is printed and saved to a dated directory.

Flows

run_flow

Main summarize manifest flow.

run_flow(context: ContextConfig, series: SeriesConfig, parameters: ParametersConfig) None[source]#

Main summarize manifest flow.

Configs

ContextConfig

Context configuration for summarize manifest flow.

ParametersConfig

Parameter configuration for summarize manifest flow.

SeriesConfig

Series configuration for summarize manifest flow.

class ContextConfig[source]#

Context configuration for summarize manifest flow.

working_location: str#

Location for input and output files (local path or S3 bucket).

manifest_location: str#

Location of manifest file (local path or S3 bucket).

class ParametersConfig[source]#

Parameter configuration for summarize manifest flow.

update_manifest: bool = True#

True if the manifest file should be updated, False otherwise.

search_locations: list[str]#

List of locations to search for files (local path or S3 bucket).

include_filters: list[str]#

List of Unix filename patterns for files to include in summary.

exclude_filters: list[str]#

List of Unix filename patterns for files to exclude from summary.

class SeriesConfig[source]#

Series configuration for summarize manifest flow.

name: str#

Name of the simulation series.

manifest_key: str#

Key for manifest file.

seeds: list[int]#

List of series random seeds.

conditions: list[dict]#

List of series condition dictionaries (must include unique condition “key”).