Using a data analysis matrix

Here is an example I developed and used in 2015, when helping a UK consulting firm plan a data mining exercise using a data set that had 60+ cases and more than 70 potentially useful attributes. In this matrix…

  • Each blue column represents a grouping of a specific kind of case attribute. At the analysis stage any one of these could be used as an outcome in an EvalC3 data set
  • Each blue row represents a grouping of a specific kind of case attribute. At the analysis stage any one of these could be used as attribute which might be predictive of the outcome of interest in an EvalC3 data set
  • Cells represent relationships between specific types of attributes (rows) and specific type sof outcomes (columns) .
    • Colored (grey and yellow) cells represent those relationships that were of interest and which would be analysed
      • Initials in these cells represent the stakeholders with specific interest in this relationship
  • The cell values in the summary column on the right represent the level of  confidence in that row type of case attribute
  • The cell values in the summary row at the bottom represent the level of interest in the potential outcomes of interest represented by each column

screenshot-2017-01-09-15-43-42

The analysis that was carried out focused on the 23 colored in cells.