For now I’ll put aside the data privacy concerns and algorithmic opportunities this raises and instead concentrate on the shift this brings in understanding ‘my data’, in relation to the group’s data.
This was an amazing ‘signal’ in the data, but deeply individual and specific to the person. It displayed a recognized indicator for pregnancy, but one that is also an indicator for many other things. If the data had been part of a larger aggregated group it could have been seen as noise (especially if the additional pregnancy insight was unknown). If the data had been anonymous and maybe even had gender identifiers removed, then what? Would similar occurrences have skewed the group?
So how does my data fit with your data, and should it?
For years, IT departments resisted people using their own devices on corporate networks - but not anymore. Many of us now work in organizations that encourage us to "bring your own device". The same shift is happening with data. We are beginning to hear more and more about "bring your own data". In the same way as familiarity and expertise with a device improves productivity so it can with data and analytics. Of course the trick is balancing what must be governed and 'official' with what can be added to and augmented. Context, granularity and focus are key to knowing what the signal may mean at the level you are viewing the data at. For many of us navigating the depth and breadth of the data available will be the core skill for identifying those important signals.
This is the dizzying frontier the healthcare industry is facing, working from the extremely close up and granular view of the individual to the pulled back view of an entire populous. As electronic health records (EHR) become more sophisticated and connected, the more the view of the individual changes. The cadence changes and the EHR shifts from a historical document of record to a near real-time pulse. It becomes a useful diagnostics tool that we can augment with personal data supplied by the sensors an individual wears. That’s a lot of data, that’s a lot of messy, complex and sometimes unreliable data. But that is a rich and fertile ground for finding new signals and insights about groups and individuals. These layers; from official to informal, from individual to group, from clean to dirty, from exact to fuzzy, from past to present form the data-scape we are beginning to inhabit. Choosing the right frame for the data we have access to will be the key to drawing meaning from it.
Image by Lenilucho [CC BY-SA 3.0 (http://creativecommons.org/licenses/by-sa/3.0)], via Wikimedia Commons