Outcome
An outcome diagram should:
- Show the "physical" items that exist / yet-to-exist / to-be-cleaned.
- Show which steps are linked to it, e.g. clicking on a file shows the steps that write to / read from the file.
Determining Information for Rendering
To render the outcome diagram, we need to deduce the physical things from Item
s, and determine:
- Source: where data comes from, whether completely from parameters, or whether parameters are a reference to the data.
- Destination: where data moves to or the system work is done to.
- Whether the source or destination are declared in parameters.
As of 2024-02-18, Item::Params
is a single type, which would take in both source and destination parameters, so we cannot (realistically) determine the source/destination/host/cloud from the Item::Params
type.
Source and Destination
An Item
is actually a step that encapsulates:
- Source location.
- Destination location.
This can be further broken down into:
/// Can be source / destination
enum ItemLocation {
File {
host: Host,
path: PathBuf,
},
Cloud {
cloud_provider: CloudProvider,
global_or_region: GlobalOrRegion,
subnet: Option<Subnet>,
url: URL,
name: String,
},
}
Instead of using a flat enum, we may want to use a fixed set of data structures for simplicity, or a trait
for extensible implementations.
Data structure option
enum Realm {
Host {
address: String,
port: Option<u16>,
},
Cloud {
cloud_provider: CloudProvider,
global_or_region: GlobalOrRegion,
subnet: Option<Subnet>,
url: URL,
name: String,
},
}
struct ItemLocation {
/// Host or Cloud provider
realm: Realm,
/// Not necessarily a file path, but segments denoting hierarchy
path: Option<String>,
}
Trait option
trait Name {
fn realm(&self) -> Realm;
}
Note that we want to support nesting, e.g. an application server is "just a host", which may be nested within a network, or a cloud provider, or nothing.
How should we do the nesting:
-
Strictly defined levels, implementor selects:
- Cloud
- Availability Zone
- Network
- Subnet
- Host
- Path
-
Arbitrary levels, levels are not necessarily defined by network-hierarchy.
Note: This doesn't allow consistency across item implementations, or, it requires the tool developer to pass the level into the item.
Actually, what we can do is:
- Allow the item implementor to specify the list of parents that this item resides in.
- Each parent may be an arbitrary layer around the current item.
- The peace framework will have a
struct Layer(IndexMap<LayerId, Layer>);
-- this is essentiallyNodeHierarchy
fromdot_ix
. - When the
LayerId
s from different items match up, those are inserted into this map. - When we render, we render layer by layer as the hierarchy.
What we want
- Tool developer passes in parameters for the item.
- The correct realm for the item is automatically deduced.
e.g.
Tool Developer
let file_download_params_spec = FileDownloadParams::new(..).into();
Item Implementor
File Download
impl Item for FileDownload {
fn location_from(
params_partial: &<Self::Params<'_> as Params>::Partial,
data: Self::Data<'_>,
) -> Vec<ItemLocation> {
let host = params_partial
.src()
.map(Url::host)
.map(Host::to_owned);
let port = params_partial
.src()
.map(Url::port_or_known_default);
let url = params_partial.src();
let mut item_locations = Vec::new();
if let Some(host) = host {
item_locations.push(ItemLocation::Host { host, port });
if let Some(url) = url {
// May be rendered using the last segment of the URL as the node name.
// The full URL may be used as the tooltip.
item_locations.push(ItemLocation::Url(url));
}
}
item_locations
}
fn location_to(params: &<Self::Params<'_> as Params>::Partial) -> Vec<ItemLocation> {
let path = params_partial
.dest()
.map(Path::to_string_lossy)
.map(Cow::to_owned);
vec![
ItemLocation::Localhost,
ItemLocation::Path(path),
]
}
}
S3Bucket
impl Item for S3Bucket {
fn location_from(
params_partial: &<Self::Params<'_> as Params>::Partial,
data: Self::Data<'_>,
) -> Vec<ItemLocation> {
vec![ItemLocation::Localhost]
}
fn location_to(
params: &<Self::Params<'_> as Params>::Partial,
data: Self::Data<'_>,
) -> Vec<ItemLocation> {
let region = data.region();
let availability_zone = params.availability_zone();
let name = params.name();
let mut item_locations = Vec::with_capacity(10);
item_locations.push(ItemLocation::CloudProvider(CloudProvider::Aws));
item_locations.push(ItemLocation::Group(region.to_string()));
if let Some(availability_zone) = availability_zone {
item_locations.push(ItemLocation::Group(availability_zone.to_string()));
if let Some(name) = name {
item_locations.push(ItemLocation::Name(name.to_string()));
}
}
}
}
Framework Maintainer
Go through all items' Item::location_*
and add to outcome diagram -- see flow_spec_info.rs
.
Problems
A: How do we get items to be consistent with the item locations they publish?
Cloud provider name, region, availability zone, etc.
Options:
-
🔴 We could provide a crate with common enums.
This requires:
- The framework to have knowledge of many many types, or be inaccurate.
- Item implementors to always be up to date.
It's a lot of maintenance, so probably not a good idea.
-
🟡 Item provides default
ItemLocation
s, developer can pass in names.This isn't too bad, as it allows tool developers control over names of the items.
It still requires item implementors to use the same
ItemLocation
variant.It could mean additional development burden.
- If we go with sensible defaults, then we hope that item implementors use the same conventions.
- Tool developers passing in names to the items is an escape hatch in case the conventions are different between item implementations.
- Item implementors have to take in the names instead of only default them.
If item implementors have consistent values for their
ItemLocation
s, then the inconsistency downside is alleviated partially.For example, extracting the host / port from a
Url
could be offloaded to the framework:#[derive(Debug)] enum ItemLocation { Host(ItemLocationHost), Url(Url), } struct ItemLocationHost { host: Host<String>, port: Option<u16>, } impl ItemLocation { fn from_url(url: &Url) -> Self { Self::Url(url.clone()) } } impl From<&Url> for ItemLocationHost { fn from(url: &Url) -> Self { let host = url .map(Url::host) .map(Host::to_owned) .expect("Expected URL to contain a host."); let port = url .map(Url::port_or_known_default); Self { host, port } } }
Item implementors will implement
Item::location_from
andItem::location_to
like so:impl Item for FileDownload { fn location_from( params_partial: &<Self::Params<'_> as Params>::Partial, data: Self::Data<'_>, ) -> Vec<ItemLocation> { params_partial .src() .as_ref() .map(|src| { vec![ ItemLocation::from_server(src), ItemLocation::from_url(src), ] }) .unwrap_or_else(Vec::new) } fn location_to( params: &<Self::Params<'_> as Params>::Partial, ) -> Vec<ItemLocation> { let path = params_partial .dest() .map(Path::to_string_lossy) .map(Cow::to_owned); vec![ ItemLocation::Localhost, ItemLocation::Path(path), ] } }
-
🟡 Developer specifies
ItemLocation
s (from and to) for every item.Removes the item location inconsistency issue, but also means every tool developer has to specify item locations for every item, where this work could be commonized.
This also addresses the problem B below, where the developer has control over the level of detail.
Essentially we are placing the burden of defining the outcome diagram onto the developer.
This isn't "nice" when the developer has to extract host names from URLs (common burden).
Maybe the
Item
implementor provides a method like this:impl Item for MyItem { fn item_location_from( &self, level: MyItemLocationLevel, params_partial: &<Self::Params<'_> as Params>::Partial, ) -> ItemLocation { match level { MyItemLocationLevel::Host => { let host = params_partial .src() .map(Url::host) .map(Host::to_owned); let port = params_partial .src() .map(Url::port_or_known_default); ItemLocation::Host { host, port } } _ => todo!(), } } }
This interface is awkward -- we don't know the params_partial at the point of flow definition (for the developer).
It could be the developer that defines that function above, combined with framework helpers:
let my_item_locations = MyItem::item_locations_spec() .with_locations_from(|my_item, params_partial| { // with the `From` implementations for 2. above: params_partial .src() .as_ref() .map(|src| { vec![ ItemLocation::from_server(src), ItemLocation::from_url(src), ] }) .unwrap_or_else(Vec::new) }) .with_locations_to(todo!("similar to above"));
-
🔴 Magically infer based on parameter names.
Too inaccurate.
-
🔴 Item implementors use a derive proc-macro.
Too many annotations -- probably easier to write the trait directly.
B: How does control over whether a group is drawn get defined?
- Users may not want so many levels of detail -- it can be overwhelming.
- Developers may want sensible defaults / not require them to set whether a group is drawn.
- Developers may want to set whether a group is drawn.
Options:
-
🟡 They can't, everything is always shown.
Not the best user experience -- too much detail can overwhelm.
-
🟡 Item implementor guidelines to not include too many layers.
May be okay? It means either we have too much / too little information sometimes.
Perhaps this is diminishing returns, and it doesn't matter too much.
-
🟡 Developer specifies
ItemLocation
s (from and to) for every item.Same as A:3 above.
-
🟡 Item takes in parameter whether each layer is visible.
Probably in the
Item
constructor, so we introduce a compilation error:fn new( /* existing params */ #[cfg(feature = "web")] layer_visibility: MyItemLayerVisibility, ) -> Self { /* .. */ } struct MyItemLayerVisibility { cloud: bool, availability_zone: bool, subnet: bool, }
In tandem with A:2 above, maybe it's not worth the complexity.
-
🟡 Draw everything, developer provides separate information for layers.
- Probably want the
Item
to export what layers it draws. - Developer passes a
Map<ItemLocation, DrawEnabled>
to the framework.
Feels weird / unintuitive as a design, not too weird as a user: "I see everything, I want to turn that one off.".
- Probably want the
C: Consistent Diagram Layout
If the ItemLocation
s changes, such as the number of ItemLocation
s change (hierarchy level change), or the ItemLocation
variants change (different rendering style applied), then the layout of the diagram can change:
- Position of each box can change: pushing what used to be "here", "there".
- Width / height of each box can expand: it doesn't fit in the screen.
- Colours can change: different
ItemLocation
type. - Edges can change position.
It can be confusing to follow if these keep changing, which is counter productive to the intent of communicating clearly which physical systems are interacting.
Options:
-
🟡 Don't solve it now, we don't know how much of a problem it is.
dot_ix
's HTMLDivDiag
is not very mature anyway. We may end up developing this a lot more.This is taking a bet that we won't have to do a large refactor to the Item API later. Probably alright since there aren't any consumers of Peace just yet.
-
🔴 Try and match nodes, and animate.
Too magic / inaccurate?
-
🟡 Require / request
Item
implementors to always provide the same number ofItemLocation
s.May mean every
ItemLocation
that is unknown, is still populated:let item_location_server = item_location_server.unwrap_or(ItemLocation::HostUnknown); let item_location_url = item_location_url.unwrap_or(ItemLocation::UrlUnknown); vec![ item_location_server, item_location_url, ]
Solution
We'll go with:
-
A:2: Item implementors provide
ItemLocation
s -
B:1: Everything is shown.
There is still the possibility to migrate to B:4 (
Item
s take in visibility flags) or B:5 (developers pass in visibility to framework) later. -
C:1: Don't solve the consistent diagram layout now.
What Rendering Makes Sense
Conceptually, Item
s can be thought of either an edge or a node:
- Edge: The item represents an action / work to between the source(s) and the destination(s).
- Node: The item represents the destination thing.
Use Cases
- Upload a file -- one source, one dest.
- Download a file -- one source, one dest.
- Launch servers -- one source (localhost), one dest (AWS).
- Wait for servers to start up -- multiple within (do we need the
ItemLocationTree
for the cloud provider / subnet context? or leverage previous resource tracking to work it out?). - Wait for endpoints to become available -- one source, multiple dest (query each endpoint).
- Do we want
ItemInteraction
s to be queried multiple times whileApply
is happening? -- i.e. some servers may have started up, and so we need theItem
to report that to us. - Notably, we want these
ItemInteraction
s to be queryable without the items actually existing -- so we can generate diagrams to demonstrate what would happen upon execution.
Naive
Consider the following diagram, which is the first attempt at rendering an outcome diagram on 2024-02-18 -- this uses edges / hierarchy to draw nodes and edges:
Notes
- It is not clear where the
app_download
Item
transfers the file from, or to. - It is not clear that the
iam_policy
,iam_role
, andinstance_profile
are defined in AWS. - The links between
iam_policy
,iam_role
, andinstance_profile
should probably be reversed, to indicate what references what. - The
s3_object
item transfers the downloaded application file, to the S3 bucket created bys3_bucket
, but it isn't clear whether we should be highlighting the link, or the nested node. - If we highlight the link, then note that it is a forward link (data is pushed), compared to point 3 which are backward links (references).
Hosts, Realms, and Edges
If we could show:
- The hosts of all network communication
- The "realm" where resources live in, which may be a cloud provider or a region within
- A node for each resource that is created/modified/deleted
- For each item, the edges and nodes that interact
then what is happening becomes slightly clearer:
Notes
- There is only one level of detail.
- It is useful to have expandable detail, e.g. hide a full URL, and allow the user to expand it if they need to.
- It is useful to show animated edges while that step is in progress, and hide them after.
Old idea for code
Item Implementor
File Download
impl Item for FileDownload {
fn location_from(&self, params_partial: &<Self::Params<'_> as Params>::Partial)
-> ItemLocation {
let address = params_partial
.src()
.map(Url::host_str)
.unwrap_or("<to_be_determined>")
.to_string();
let path = params_partial
.src()
.map(Url::path);
ItemLocation {
realm: Realm::Host { address },
path,
}
}
fn location_to(&self, params: &<Self::Params<'_> as Params>::Partial) -> ItemLocation {
let address = "localhost".to_string();
let path = params_partial
.dest()
.map(Path::to_string_lossy)
.map(Cow::to_owned);
ItemLocation {
realm: Realm::Host { address },
path,
}
}
}
S3Bucket
impl Item for S3Bucket {
fn location_from(&self, params_partial: &<Self::Params<'_> as Params>::Partial)
-> ItemLocation {
let address = "localhost"; // or None
ItemLocation::Host { address }
}
fn location_to(&self, params: &<Self::Params<'_> as Params>::Partial) -> ItemLocation {
ItemLocation::Cloud {
cloud_provider: CloudProvider::Aws,
global_or_region: GlobalOrRegion::Region(Region::ApSoutheast2),
subnet: None,
url: URL::from(".."),
name: params.name.unwrap_or("<to_be_determined>"),
}
}
}