Workflow#
- class hpcflow.api.hpcflow.Workflow(path)#
Bases:
Workflow
- Parameters:
path (PathLike) –
- add_loop(loop, parent_loop_indices=None)#
Add a loop to a subset of workflow tasks.
- Parameters:
loop (Loop) –
parent_loop_indices (Dict) –
- Return type:
None
- add_submission(JS_parallelism=None)#
- Parameters:
JS_parallelism (Optional[bool]) –
- Return type:
- add_task(task, new_index=None)#
- Parameters:
task (Task) –
new_index (int | None) –
- Return type:
None
- add_task_after(new_task, task_ref=None)#
Adds the given new_task after the task specified in task_ref.
- add_task_before(new_task, task_ref=None)#
Adds the given new_task before the task specified in task_ref.
- app = BaseApp(name=hpcflow, version=0.2.0a39)#
- property artifacts_path#
- batch_update(is_workflow_creation=False)#
A context manager that batches up structural changes to the workflow and commits them to disk all together when the context manager exits.
- Parameters:
is_workflow_creation (bool) –
- Return type:
Iterator[None]
- check_parameters_exist(indices)#
- Parameters:
indices (int | List[int]) –
- Return type:
bool | List[bool]
- copy(path=None)#
Copy the workflow to a new path and return the copied workflow.
- Return type:
- property creation_info#
- delete()#
- classmethod from_JSON_file(JSON_path, path=None, name=None, overwrite=False, store='zarr', ts_fmt=None, ts_name_fmt=None)#
Generate from a JSON file.
- Parameters:
JSON_path (PathLike) – The path to a workflow template in the JSON file format.
path (str | None) – The directory in which the workflow will be generated. The current directory if not specified.
name (str | None) – The name of the workflow. If specified, the workflow directory will be path joined with name. If not specified the WorkflowTemplate name will be used, in combination with a date-timestamp.
overwrite (bool | None) – If True and the workflow directory (path + name) already exists, the existing directory will be overwritten.
store (str | None) – The persistent store to use for this workflow.
ts_fmt (str | None) – The datetime format to use for storing datetimes. Datetimes are always stored in UTC (because Numpy does not store time zone info), so this should not include a time zone name.
ts_name_fmt (str | None) – The datetime format to use when generating the workflow name, where it includes a timestamp.
- Return type:
- classmethod from_JSON_string(JSON_str, path=None, name=None, overwrite=False, store='zarr', ts_fmt=None, ts_name_fmt=None)#
Generate from a JSON string.
- Parameters:
JSON_str (PathLike) – The JSON string containing a workflow template parametrisation.
path (str | None) – The directory in which the workflow will be generated. The current directory if not specified.
name (str | None) – The name of the workflow. If specified, the workflow directory will be path joined with name. If not specified the WorkflowTemplate name will be used, in combination with a date-timestamp.
overwrite (bool | None) – If True and the workflow directory (path + name) already exists, the existing directory will be overwritten.
store (str | None) – The persistent store to use for this workflow.
ts_fmt (str | None) – The datetime format to use for storing datetimes. Datetimes are always stored in UTC (because Numpy does not store time zone info), so this should not include a time zone name.
ts_name_fmt (str | None) – The datetime format to use when generating the workflow name, where it includes a timestamp.
- Return type:
- classmethod from_YAML_file(YAML_path, path=None, name=None, overwrite=False, store='zarr', ts_fmt=None, ts_name_fmt=None)#
Generate from a YAML file.
- Parameters:
YAML_path (PathLike) – The path to a workflow template in the YAML file format.
path (str | None) – The directory in which the workflow will be generated. The current directory if not specified.
name (str | None) – The name of the workflow. If specified, the workflow directory will be path joined with name. If not specified the WorkflowTemplate name will be used, in combination with a date-timestamp.
overwrite (bool | None) – If True and the workflow directory (path + name) already exists, the existing directory will be overwritten.
store (str | None) – The persistent store to use for this workflow.
ts_fmt (str | None) – The datetime format to use for storing datetimes. Datetimes are always stored in UTC (because Numpy does not store time zone info), so this should not include a time zone name.
ts_name_fmt (str | None) – The datetime format to use when generating the workflow name, where it includes a timestamp.
- Return type:
- classmethod from_YAML_string(YAML_str, path=None, name=None, overwrite=False, store='zarr', ts_fmt=None, ts_name_fmt=None)#
Generate from a YAML string.
- Parameters:
YAML_str (PathLike) – The YAML string containing a workflow template parametrisation.
path (str | None) – The directory in which the workflow will be generated. The current directory if not specified.
name (str | None) – The name of the workflow. If specified, the workflow directory will be path joined with name. If not specified the WorkflowTemplate name will be used, in combination with a date-timestamp.
overwrite (bool | None) – If True and the workflow directory (path + name) already exists, the existing directory will be overwritten.
store (str | None) – The persistent store to use for this workflow.
ts_fmt (str | None) – The datetime format to use for storing datetimes. Datetimes are always stored in UTC (because Numpy does not store time zone info), so this should not include a time zone name.
ts_name_fmt (str | None) – The datetime format to use when generating the workflow name, where it includes a timestamp.
- Return type:
- classmethod from_file(template_path, template_format=None, path=None, name=None, overwrite=False, store='zarr', ts_fmt=None, ts_name_fmt=None)#
Generate from either a YAML or JSON file, depending on the file extension.
- Parameters:
template_path (PathLike) – The path to a template file in YAML or JSON format, and with a “.yml”, “.yaml”, or “.json” extension.
template_format (str | None) – If specified, one of “json” or “yaml”. This forces parsing from a particular format regardless of the file extension.
path (str | None) – The directory in which the workflow will be generated. The current directory if not specified.
name (str | None) – The name of the workflow. If specified, the workflow directory will be path joined with name. If not specified the WorkflowTemplate name will be used, in combination with a date-timestamp.
overwrite (bool | None) – If True and the workflow directory (path + name) already exists, the existing directory will be overwritten.
store (str | None) – The persistent store to use for this workflow.
ts_fmt (str | None) – The datetime format to use for storing datetimes. Datetimes are always stored in UTC (because Numpy does not store time zone info), so this should not include a time zone name.
ts_name_fmt (str | None) – The datetime format to use when generating the workflow name, where it includes a timestamp.
- Return type:
- classmethod from_template(template, path=None, name=None, overwrite=False, store='zarr', ts_fmt=None, ts_name_fmt=None)#
Generate from a WorkflowTemplate object.
- Parameters:
template (WorkflowTemplate) – The WorkflowTemplate object to make persistent.
path (PathLike | None) – The directory in which the workflow will be generated. The current directory if not specified.
name (str | None) – The name of the workflow. If specified, the workflow directory will be path joined with name. If not specified the WorkflowTemplate name will be used, in combination with a date-timestamp.
overwrite (bool | None) – If True and the workflow directory (path + name) already exists, the existing directory will be overwritten.
store (str | None) – The persistent store to use for this workflow.
ts_fmt (str | None) – The datetime format to use for storing datetimes. Datetimes are always stored in UTC (because Numpy does not store time zone info), so this should not include a time zone name.
ts_name_fmt (str | None) – The datetime format to use when generating the workflow name, where it includes a timestamp.
- Return type:
- classmethod from_template_data(template_name, tasks=Field(name=None, type=None, default=<dataclasses._MISSING_TYPE object>, default_factory=<function Workflow.<lambda>>, init=True, repr=True, hash=None, compare=True, metadata=mappingproxy({}), kw_only=<dataclasses._MISSING_TYPE object>, _field_type=None), loops=Field(name=None, type=None, default=<dataclasses._MISSING_TYPE object>, default_factory=<function Workflow.<lambda>>, init=True, repr=True, hash=None, compare=True, metadata=mappingproxy({}), kw_only=<dataclasses._MISSING_TYPE object>, _field_type=None), resources=None, path=None, workflow_name=None, overwrite=False, store='zarr', ts_fmt=None, ts_name_fmt=None)#
Generate from the data associated with a WorkflowTemplate object.
- Parameters:
template_name (str) – Name of the new workflow template, from which the new workflow will be generated.
tasks (Optional[List[Task]]) – List of Task objects to add to the new workflow.
loops (Optional[List[Loop]]) – List of Loop objects to add to the new workflow.
resources (Optional[Dict[str, Dict]]) – Mapping of action scopes to resource requirements, to be applied to all element sets in the workflow. resources specified in an element set take precedence of those defined here for the whole workflow.
path (Optional[PathLike]) – The directory in which the workflow will be generated. The current directory if not specified.
workflow_name (Optional[str]) – The name of the workflow. If specified, the workflow directory will be path joined with name. If not specified template_name will be used, in combination with a date-timestamp.
overwrite (Optional[bool]) – If True and the workflow directory (path + name) already exists, the existing directory will be overwritten.
store (Optional[str]) – The persistent store to use for this workflow.
ts_fmt (Optional[str]) – The datetime format to use for storing datetimes. Datetimes are always stored in UTC (because Numpy does not store time zone info), so this should not include a time zone name.
ts_name_fmt (Optional[str]) – The datetime format to use when generating the workflow name, where it includes a timestamp.
- Return type:
- get_EARs_from_IDs(indices)#
Return element action run objects from a list of five-tuples, representing the task insert ID, element index, iteration index, action index, and run index, respectively.
- Parameters:
indices (List[EAR_ID]) –
- Return type:
List[ElementActionRun]
- get_all_parameter_data()#
- Return type:
Dict[int, Any]
- get_element_iterations_from_IDs(indices)#
Return element iteration objects from a list of three-tuples, representing the task insert ID, element index, and iteration index, respectively.
- Parameters:
indices (List[IterationID]) –
- Return type:
List[ElementIteration]
- get_elements_from_IDs(indices)#
Return element objects from a list of two-tuples, representing the task insert ID, and element index, respectively.
- get_iteration_task_pathway()#
- get_parameter_data(index)#
- Parameters:
index (int) –
- Return type:
Tuple[bool, Any]
- get_parameter_source(index)#
- Parameters:
index (int) –
- Return type:
Dict
- get_task_elements(task, selection)#
- get_task_elements_islice(task, selection)#
- get_task_unique_names(map_to_insert_ID=False)#
Return the unique names of all workflow tasks.
- Parameters:
map_to_insert_ID (bool, optional) – If True, return a dict whose values are task insert IDs, otherwise return a list.
- Return type:
List[str] | Dict[str, int]
- is_parameter_set(index)#
- Parameters:
index (int) –
- Return type:
bool
- property loops: WorkflowLoopList#
- property name#
The workflow name may be different from the template name, as it includes the creation date-timestamp if generated.
- property num_added_tasks: int#
- property num_element_iterations: int#
- property num_elements: int#
- property num_loops: int#
- property num_submissions: int#
- property num_tasks: int#
- rename(new_name)#
- Parameters:
new_name (str) –
- save_parameter(name, value, submission_idx, jobscript_idx, JS_element_idx, JS_action_idx)#
- Parameters:
submission_idx (int) –
jobscript_idx (int) –
JS_element_idx (int) –
JS_action_idx (int) –
- save_parameters(values, submission_idx, jobscript_idx, JS_element_idx, JS_action_idx)#
Save multiple parameters to a given EAR.
- Parameters:
values (Dict) –
submission_idx (int) –
jobscript_idx (int) –
JS_element_idx (int) –
JS_action_idx (int) –
- set_EAR_end(submission_idx, jobscript_idx, JS_element_idx, JS_action_idx)#
Set the end time on an EAR.
- Parameters:
submission_idx (int) –
jobscript_idx (int) –
JS_element_idx (int) –
JS_action_idx (int) –
- Return type:
None
- set_EAR_start(submission_idx, jobscript_idx, JS_element_idx, JS_action_idx)#
Set the start time on an EAR.
- Parameters:
submission_idx (int) –
jobscript_idx (int) –
JS_element_idx (int) –
JS_action_idx (int) –
- Return type:
None
- set_EAR_submission_indices(sub_idx, EAR_indices)#
Set the submission index on an EAR.
- Parameters:
sub_idx (int) –
EAR_indices (Tuple[int, int, int, int]) –
- Return type:
None
- show_all_EAR_statuses()#
- property store_format#
- property submissions: List[Submission]#
- property submissions_path#
- submit(ignore_errors=False, JS_parallelism=None, print_stdout=False)#
- Parameters:
ignore_errors (bool | None) –
JS_parallelism (bool | None) –
print_stdout (bool | None) –
- Return type:
None
- property task_artifacts_path#
- property tasks: WorkflowTaskList#
- property template: WorkflowTemplate#
- property template_components: Dict#
- property ts_fmt#
- property ts_name_fmt#
- write_commands(submission_idx, jobscript_idx, JS_element_idx, JS_action_idx)#
Write run-time commands for a given EAR.
- Parameters:
submission_idx (int) –
jobscript_idx (int) –
JS_element_idx (int) –
JS_action_idx (int) –
- Return type:
None