Exercise 21.2 Gremlin Queries for Confluence
Objectives
The objective of this exercise is to help new users of Syndeia graph analysis to formulate Gremlin queries to analyze their graph with the Syndeia Web Dashboard. The specific learning objectives of this exercise are to create lists of
· Confluence artifacts of a specific artifact type
· Confluence artifacts in a specific Container
· Confluence artifacts connected as part of a specific Syndeia Project
Preparation
This exercise assumes the student has
· Syndeia Cloud 3.3 or 3.4 installed with a valid user account, and
· An existing Syndeia graph containing Confluence Pages connected to elements in other repositories.
Because the content of your Syndeia graph will be different, the specific examples in the following exercise instructions are only a guide and example for your actions. It is generally advisable to carry out these exercises in a non-production repository, a “sandbox”, set up for training and practice purposes.
See the tutorials under Syndeia Cloud Web-Dashboard/Page 19 – Syndeia Cloud Graph Analysis for an overview of this feature.
Background – Syndeia Cloud Data Model
Figure 1 provides a simplified schema for elements in the Syndeia Cloud graph. All graph nodes are either Repositories, Containers, or Artifacts, where each Artifact is owned by a Container and Each Container is owned by a Repository. Each has a Type; the set of ContainerTypes and ArtifactTypes are owned by the Repository. How the Confluence data model maps to the Syndeia Cloud data model is discussed in the next section.
Figure 1 Syndeia Cloud Schema (simplified)
Background – Confluence
As of Syndeia release 3.4, the Syndeia Web Dashboard can extract and display some model information from a Confluence repository. Figure 1 shows a tree view of this information, with labels identifying the Confluence element types. Note the different icons. The label color coding indicates how the Confluence element type is mapped to the Syndeia Cloud element types: Repository (green), Container (red), and Artifact (blue). The Syndeia Confluence integration supports standard Confluence Pages. A more complete diagram of the Confluence data model as it is understood by Syndeia is available through the web dashboard help menu on the left.
It is also important to understand the limitations of graph queries with respect to the Confluence repositories. As of Syndeia 3.4, graph queries cannot extract the internal structure of a Confluence repository, i.e. they cannot be used to obtain the full structure of the Confluence repository or internal (intra-model) relations between Confluence artifacts. In this way, graph queries are different than the Tree View or Graph View (Digital Thread Explorer) capabilities of the Repositories page in the Syndeia Web Dashboard. Graph queries are most useful in viewing inter-model connections from Confluence elements to other repositories.
Figure 2 Tree view of Confluence repository
Exercise
Log on to the Syndeia Cloud Web Dashboard (see Video 1.9) and click on the Graph Queries icon on the left border.
The first task is to compile a list of Confluence Artifacts of a specific type. Per Figure 1, ArtifactTypes are owned by (specific to) a Repository. We typically want to begin by creating a list of Artifact types available in such a Repository.
If we use Query Builder (Figure 3), we select ArtifactType from the pull-down menu under Label.
Figure 3 Graph Queries page (icon outlined in red) – Query Builder
To restrict the list of ArtifactTypes to our current Confluence repository, we click Filters. We will filter by the name of our Repository, so we select Repository from the pull-down menu at the top marked Property of. Under Property Key, we select the Name property and under Property Value, we enter Confluence @ Intercax. We then click the Plus (+) button to add the filter in the bottom list and the window should look similar to Figure 4. Click Close.
Figure 4 Query Builder Filters window
Back on the Graph Queries page, click Run. The results, a list of all ArtifactTypes in Confluence @ Intercax, may be displayed in table form as in Figure 5. Key ArtifactType properties in the table are Name and Key because we will use these in the next search. Click the Exports icon to export the list as a CSV file for future reference, if desired.
Note at the top of Figure 5, the Query Builder utility has created a Gremlin query. We could have performed the same search with the same results by going to the Raw Query mode and entering this query directly. g.V().has('sLabel','ArtifactType').where(outE().has('sLabel','ownedBy').inV().has('name','Confluence @ Intercax'))
The final Page of the first task is to generate a list of all Artifacts of type Page within the Confluence @ Intercax Repository. Note that Syndeia will return only those Confluence Pages that are connected within the Syndeia Cloud graph, not all Pages in the repository.
We can search by ArtifactType Name (Page) or Key (ART-TYPE220), which we got from the table in Figure 5. Generally, it is better to search by Key, which is unique within the Syndeia Cloud database, rather than Name, which is not unique.
Figure 5 Graph Queries page, ArtifactTypes results in table format (Export icon highlighted in red)
If we use Query Builder, we select Artifact from the pull-down menu under Label, as in Figure 6.
Figure 6 Query Builder, Artifact search
To restrict the list of Artifacts to the Confluence Page type, we click Filters. We will filter by the ArtifactType Key, so we select ArtifactType from the pull-down menu at the top marked Property of. Under Property Key, we select the sKey property and under Property Value, we enter ART-TYPE249, which we took from the table in Figure 5. After we click the Plus (+) icon, the Filters window should look like Figure 7. Click Close.
Figure 7 Query Builder Filters window, filter by ArtifactType sKey
Back on the Graph Queries page, click Run. The results, a list of all Artifacts of type ART-TYPE249, which is owned by the repository Confluence @ Intercax, may be displayed in table form as in Figure 8. Click the Exports icon to export the list as a CSV file for future reference, if desired.
Figure 8 Graph Queries page, Artifacts results in table format, truncated (Export icon outlined in red)
Note at the top of Figure 5, the Query Builder utility has created a Gremlin query. We could have performed the same search with the same results by going to the Raw Query mode and entering this query directly. g.V().has('sLabel','Artifact').where(outE().has('sLabel','hasType').inV().has('sKey','ART-TYPE249'))
The second task is to compile a list of Confluence Artifacts in a specific Confluence Space. Per Figure 2, Spaces in Confluence are Containers and are owned by (specific to) a Repository. We will begin by creating a list of Containers available in a Confluence Repository.
If we use Query Builder (Figure 9), we select Container from the pull-down menu under Label.
Figure 9 Graph Queries page (icon outlined in red) – Query Builder
To restrict the list of Containers to our current Confluence repository, we click Filters. We will filter by the name of our Repository, so we select Repository from the pull-down menu at the top marked Property of. Under Property Key, we select the Name property and under Property Value, we enter Confluence @ Intercax. We then click the Plus (+) button to add the filter in the bottom list and the window should look similar to Figure 10. Click Close.
Figure 10 Query Builder Filters window
Back on the Graph Queries page, click Run. The results, a list of all Containers in Confluence @ Intercax may be displayed in table form as in Figure 11. Key Container properties in the table are Name and Key because we will use these in the next search. Click the Exports icon to export the list as a CSV file for future reference, if desired.
Caution: The list of Confluence Containers listed in Figure 11 does not include all Spaces in the Confluence @ Intercax repository. Only those Containers that own Artifacts that are connected to other models (or are connected directly themselves) appear on the list. Other Confluence Containers that do not involve connections to other repositories are not part of the Syndeia Cloud graph and do not appear in Gremlin graph query results.Figure 11 Graph Queries page, Containers results in table format (Export icon highlighted in red)
Note at the top of Figure 11, the Query Builder utility has created a Gremlin query. We could have performed the same search with the same results by going to the Raw Query mode and entering this query directly.
g.V().has('sLabel','Container').where(outE().has('sLabel','ownedBy').inV().has('name','Confluence @ Intercax'))
The final part of the second task is to generate a list of all Artifacts in a specific Container within the Confluence @ Intercax Repository. Note that Syndeia will return only those Confluence Pages that are connected within the Syndeia Cloud graph, not all Pages in the repository.
We can search by Container Name (“Intercax Knowledge Base (1)”) or Key (CONT738), which we got from the table in Figure 11. Generally, it is better to search by Key, which is unique within the Syndeia Cloud database, rather than Name, which is not unique.
If we use Query Builder, we select Artifact from the pull-down menu under Label, as in Figure 12.
Figure 12 Query Builder, Artifact search
To restrict the list of Artifacts to the Confluence Product UAV, we click Filters. We will filter by the Container Key, so we select Container from the pull-down menu at the top marked Property of. Under Property Key, we select the sKey property and under Property Value, we enter CONT738, which we took from the table in Figure 11. After we click the Plus (+) icon, the Filters window should look like Figure 13. Click Close.
Figure 13 Query Builder Filters window, filter by Container sKey
Back on the Graph Queries page, click Run. The results, a list of all Artifacts in Container CONT738, which is owned by the repository Confluence @ Intercax, may be displayed in table form as in Figure 14. Note that only Confluence Pages that are part of the Syndeia Cloud graph appear; there may be other Confluence Pages in this Space without connections to other repositories that do not appear
Figure 14 Graph Queries page, Artifacts results in table format, truncated
Note at the top of Figure 14, the Query Builder utility has created a Gremlin query. We could have performed the same search with the same results by going to the Raw Query mode and entering this query directly.
g.V().has('sLabel','Artifact').where(outE().has('sLabel','ownedBy').inV().has('sKey','CONT738'))14. The third task is to compile a list of Confluence Artifacts that are connected as part of a specific Syndeia Project. Syndeia Projects are partitions within the Syndeia Cloud graph database that separate different projects or system models. Syndeia Projects are Containers owned by the Syndeia Repository. Unlike Confluence Spaces, Syndeia Projects contain only relations, the inter-model relations that define the “macrostructure” of the Digital Thread for that system or project. In this case, we are looking not for the Confluence Pages directly; we are looking for inter-model connections where one end is a Confluence Page.
We will begin by creating a list of Containers available in the Syndeia Repository.
If we use Query Builder (Figure 15), we select Container from the pull-down menu under Label.
Figure 15 Graph Queries page (icon outlined in red) – Query Builder
To restrict the list of Containers to the Syndeia repository, we click Filters. We will filter by the name of our Repository, so we select Repository from the pull-down menu at the top marked Property of. Under Property Key, we select the Name property and under Property Value, we enter Syndeia Repository. We then click the Plus (+) button to add the filter in the bottom list and the window should look similar to Figure 16. Click Close.
Figure 16 Query Builder Filters window
Back on the Graph Queries page, click Run. The results, a list of all Containers in the Syndeia Repository may be displayed in table form as in Figure 17. Key Container properties in the table are Name and Key because we will use these in the next search. Click the Exports icon to export the list as a CSV file for future reference, if desired.
Figure 17 Graph Queries page, Containers results in table format, truncated
Note at the top of Figure 17, the Query Builder utility has created a Gremlin query. We could have performed the same search with the same results by going to the Raw Query mode and entering this query directly.
g.V().has('sLabel','Container').where(outE().has('sLabel','ownedBy').inV().has('name','Syndeia Repository'))The next Page of the third task is to generate a list of all Relations within a specific Syndeia Project.
We can search by Container Name (“Dirk Sandbox 11”) or Key (DZSB11), which we got from the table in Figure 17. Generally, it is better to search by Key, which is unique within the Syndeia Cloud database, rather than Name, which is not unique.
If we use Query Builder, we select Relation from the pull-down menu under Label, as in Figure 18. Remember, the Syndeia Projects contain relations, not artifacts.
Figure 18 Query Builder, Artifact search
To restrict the list of Relations to a specific Syndeia Project, we click Filters. We will filter by the Container Key, so we select Container from the pull-down menu at the top marked Property of. Under Property Key, we select the sKey property and under Property Value, we enter DZSB11, which we took from the table in Figure 17. After we click the Plus (+) icon, the Filters window should look like Figure 19. Click Close.
Figure 19 Query Builder Filters window, filter by Container sKey
Back on the Graph Queries page, click Run. The results, a list of all Relations in Container DZSB11, which is owned by the Syndeia Repository, may be displayed in table form as in Figure 20. Note that all relations within the project appear, not just those with a Confluence artifact at one end.
Figure 20 Graph Queries page, Relations (Edges) results in table format, truncated
The final step is to identify the Confluence Pages that participate in these relations, but this cannot be done in Query Builder alone. Note at the top of Figure 20, the Query Builder utility has created a Gremlin query.
g.E().has('sLabel','Relation').has('container','DZSB11')
We will use the Gremlin query language to append an additional condition. First, we will add an additional traversal step to go to the vertices at the end of the relations. Since we don’t know whether the Confluence Page will have an incoming or outgoing relation in the Syndeia project, we use the bothV() step to cover both ends.
g.E().has('sLabel','Relation').has('container','DZSB11').bothV()
Next, we will check all vertices for ArtifactType. Going back to the table in Figure 5, we choose Confluence Page, ART-TYPE249.
g.E().has('sLabel','Relation').has('container','DZSB11').bothV().has(‘type’,’ART-TYPE249’)
If we select Raw Query and enter this in the Gremlin Query field, we generate the table in Figure 21, showing all Confluence Pages of ART-TYPE249 used in the Syndeia Project DZSB11.
Figure 21 Graph Queries page, Artifacts results in table format, truncated
There are alternate ways to approach the problem. If we wanted to search for Confluence Pages in a specific Confluence Project (CONT738) that were used in a Syndeia Project (DZSB11), we could reformulate the query using the first Page from Step 20 and the second Page from Step 13.
g.E().has('sLabel','Relation').has('container','DZSB11').bothV().where(outE().has('sLabel','ownedBy') .inV().has('sKey','CONT738'))