Section 3. Graphing any CSV - Hypergraphs and Your Data



We can transform any CSV-like file into an insightful graph using a hypergraph transformation, wherein we use the structure of the CSV table to create a graph of nodes, edges and properties. The intuition is that every unique value in the datatable is turned into a node and every row (e.g., the event or sample) is also turned into a node, and connected to its value nodes. The resulting graph, when clustered, reveals the relationships between rows and cell values. In this notebook example, we'll be uploading and analyzing a malware file report .

Using the hypergraph transformation on the list of samples reveals phenomena such as:

  • The malware files cluster into several families
  • Several malware families have attributes connecting them, suggesting they had the same authors
  • The nodes central to a cluster reveal attributes specific to a strain of malware
  • The nodes bordering a cluster reveal attributes that show up in a strain, but are unique to each instance in that strain



Example Notebook:





Next: In tool tutorials