The suite definition describes the static structure, it's not until the definition is loaded in the server, that we see its dynamic behaviour.
With ecflow python API, the dynamic behaviour of the suite can be simulated, ( i.e. in the same manner as the server).
Simulation has the following benefits:
- Exercise the suite definition. There is no need for '.ecf' files
- Allows for very easy experimentation.
- Can be done on the client-side, no need for server
- Can help in detecting deadlock's
- Will simulate with both 'real' and 'hybrid' clocks
- A year's simulation can be done in a few seconds
- Can be added as a unit test, to prevent regressions
The simulation relies on you adding simple verification attributes. (This is similar to c/c++/python asserts). These can be added to a task, family, and suite nodes. (see below for an example)
There are however restrictions. If the definition has large loops due to, crons or Repeat attributes, which run indefinitely, then in this case the simulation will never complete, and will timeout after a year's worth of run time.
This can be compensated for by adding start and end clock. If no start/end clock is specified, the simulator makes the following assumption about the simulation period.
- No time dependencies - simulate for 24 hours.
- day attributes - simulate for 1 week
- date attributes - simulate for 1 month
- cron attributes - simulate for 1 year
- repeat attributes - simulate for 1 year
Additionally if time base attributes like, time, today,cron has no minutes, then the simulator will use 1-hour resolution.
Here is an example of a text-based suite definition that uses a verify attribute, for which we want to check our assumption about the dynamic behaviour.
suite year # use real clock otherwise the date wont change clock real 1.1.2017 # define a start date for deterministic simulation endclock 1.1.2018 # When to finish. A endclock is *ONLY* for use with the simulator. family cronFamily task t cron -d 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31 -m 1,2,3,4,5,6,7,8,9,10,11,12 10:00 # run every day at 10am for a year verify complete:365 # verify that this task completes 365 times endfamily endsuite suite leap_year # use real clock otherwise the date wont change clock real 1.1.2016 # define a start date for deterministic simulation endclock 1.1.2017 # When to finish. A endclock is *ONLY* for use with the simulator. family cronFamily task t cron -d 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31 -m 1,2,3,4,5,6,7,8,9,10,11,12 10:00 # run every day at 10am for a year verify complete:366 # verify that this task completes 366 times in a leap year endfamily endsuite
This python segment shows how to load a text-based suite definition(cron.def) and simulate it in python.
import ecflow defs = Defs("cron.def") result = defs.simulate() assert len(result) == 0, "Expected simulation to return without any errors, but found:\n" + result
If the simulation does not complete it will produce two files, which will help in the analysis:
- defs.depth: This file shows a depth-first view, of why simulation did not complete.
- defs.flat: This shows a simple flat view, of why simulation did not complete
Both files will show which nodes are holding, and include the state of the holding trigger expressions.
Deadlock
This simulation is expected to fail since we have a deadlock/ race condition
suite dead_lock family family task t1 trigger t2 == complete task t2 trigger t1 == complete endfamily endsuite | simulate a deadlock. Create definition in python from ecflow import Defs,Suite,Family,Task,Trigger defs = Defs( Suite("dead_lock", Family('family', Task('t1', Trigger("t2 == complete")), Task('t2', Trigger("t1 == complete"))))) theResult = defs.simulate(); # simulate the definition assert len(theResult) != 0, "Expected simulation to return errors" print theResult |