# Filtering and Flattening Speckle Data in .NET
TIP
Make sure you read the introduction to .NET before you dive in here! The most important sections you need to cover first are:
As you know, Speckle data is structured according to the conventions of the host application, domain or mental model of the developer. There is no single canonical way in which data is structured!
Generally, working with structured data is a bit more difficult, as you need to parse the "graph", or the tree that describes its structure. Nevertheless, this doesn't need to be so! What if we could access all the data inside a given commit and treat it just as any other list? Well, it's actually super easy!
# Step 1: Let's Flatten The Data
The extension method below flattens any Base
object into its constituent parts: it returns a list of all its sub-Base
s. Simply add this to your project somewhere and you're good to go.
TIP
Once we test this a bit more, we're probably going to add it to our Core SDK - so keep an eye out!
public static class Extensions
{
// Flattens a base object into all its constituent parts.
public static IEnumerable<Base> Flatten(this Base obj)
{
yield return obj;
var props = obj.GetDynamicMemberNames();
foreach (var prop in props)
{
var value = obj[prop];
if (value == null) continue;
if (value is Base b)
{
var nested = b.Flatten();
foreach (var child in nested) yield return child;
}
if (value is IDictionary dict)
{
foreach (var dictValue in dict.Values)
{
if (dictValue is Base lb)
{
foreach (var lbChild in lb.Flatten()) yield return lbChild;
}
}
}
if (value is IEnumerable enumerable)
{
foreach (var listValue in enumerable)
{
if (listValue is Base lb)
{
foreach (var lbChild in lb.Flatten()) yield return lbChild;
}
}
}
}
}
}
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
# Step 2: Let's Query The Data
Now that we have our flattening method in place, what can we do? Well - quite a lot! We can now use the power of LINQ to do complex queries on our dataset. For example, let's assume we want to get all the timber walls from a given building. How should we do that? Easy:
using System;
using System.Collections.Generic;
using System.Collections;
using System.Linq;
using Speckle.Core.Api;
using Speckle.Core.Models;
// Note: some boilerplate code removed.
// Receive a revit commit (note: you will need a local account on speckle.xyz for this to work!)
var data = Helpers.Receive("https://speckle.xyz/streams/0d3cb7cb52/commits/681cdd572c").Result;
var flatData = data.Flatten().ToList();
var timberWalls = flatData.FindAll(obj => obj is Objects.BuiltElements.Revit.RevitWall wall && wall.type == "Wall - Timber Clad");
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
Check out the actual filtered timber walls here, in 3D (opens new window)!
Having fun? Let's try a couple more examples! Here's a query that will return all the windows:
var windows = flatData.FindAll(obj => (string)obj["category"] == "Windows");
2
3
Here are the actual elements in our 3D viewer (opens new window).
For extra fun, let's extract all the rooms (opens new window):
var rooms = flatData.FindAll(obj => obj is Objects.BuiltElements.Room);
2
3
All the levels:
// Note: to get only the unique levels, we need to de-duplicate them.
var levels = flatData.FindAll(obj => obj is Objects.BuiltElements.Level).Cast<Objects.BuiltElements.Level>().GroupBy(level => level.name).Select(g => g.First()).ToList();
2
3
4
For a more complex query, let's try to create a summary of all the elements on each level. Here's how to achieve this with the power of LINQ:
var elementsByLevel = flatData.FindAll(obj => obj["level"] != null).GroupBy(obj => ((Base)obj["level"])["name"]);
foreach(var grouping in elementsByLevel) {
Console.WriteLine($"On level {grouping.Key} there are {grouping.Count()} elements.");
}
2
3
4
5
6
And the output:
On level Level 1 there are 74 elements.
On level Roof Line there are 1 elements.
On level Ceiling there are 4 elements.
On level Level 2 there are 64 elements.
On level Level 1 Living Rm. there are 14 elements.
On level Foundation there are 31 elements.
2
3
4
5
6
# Conclusion: Structured Data vs. Flat Data
Both structured data and flattened data have advantages and disadvanteges. The latter lends itself for ETL worflows and various classification based exercises, whereas the former allows for a better model. Dealing with structured data doesn't mean that we can't flatten it and benefit from all processing ease of flattened data. You can use this as a basis for quite a few automation exercises, such as:
- automatically compiling bills of materials
- checking model quality (ie, why are there two Level 1s,
Level 1
andLevel 1 Living Rm
in that model?) - creating custom schedules
- and more!