About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
AMIA Annual Symposium 2021
Poster
Automatic Stratification of Tabular Health Data
Abstract
Stratifying an outcome of interest across sub-groups is a ubiquitous technique for better understanding tabular data. This work efficiently scales stratification across multiple features simultaneously to identify the strata with the most unexpectedly high (or low) outcomes. We identified an anomalous sub-group of neonatal mortality outcomes in a large global health study. Scanning over subsets of data is an alternative to fitting regression models or interpreting machine learning prediction models.