summarize_manifest#
Workflow for summarizing files in the manifest.
(name)
└── YYYY-MM-DD
└── (name).SUMMARY.txt
For each search location, flow will attempt to find all files matching the specified series name. After applying include and exclude filters, the manifest is updated and a summary of files, grouped by extension, is printed and saved to a dated directory.
Flows
Main summarize manifest flow. |
- run_flow(context: ContextConfig, series: SeriesConfig, parameters: ParametersConfig) None [source]#
Main summarize manifest flow.
Configs
Context configuration for summarize manifest flow. |
|
Parameter configuration for summarize manifest flow. |
|
Series configuration for summarize manifest flow. |
- class ContextConfig[source]#
Context configuration for summarize manifest flow.
- working_location: str#
Location for input and output files (local path or S3 bucket).
- manifest_location: str#
Location of manifest file (local path or S3 bucket).
- class ParametersConfig[source]#
Parameter configuration for summarize manifest flow.
- update_manifest: bool = True#
True if the manifest file should be updated, False otherwise.
- search_locations: list[str]#
List of locations to search for files (local path or S3 bucket).
- include_filters: list[str]#
List of Unix filename patterns for files to include in summary.
- exclude_filters: list[str]#
List of Unix filename patterns for files to exclude from summary.
- class SeriesConfig[source]#
Series configuration for summarize manifest flow.
- name: str#
Name of the simulation series.
- manifest_key: str#
Key for manifest file.
- seeds: list[int]#
List of series random seeds.
- conditions: list[dict]#
List of series condition dictionaries (must include unique condition “key”).