DS2 Predicate APIs
The DS2 approach to classifier development, evaluation, and integration involves three steps:
- Train, evaluate, and select candidate classifiers based on the actual presence or absence of the target condition in test data, using WEKA – a widely-used, general purpose data mining tool.
- Experiment with candidate classifiers in the Inference Analyzer – a visual environment custom-developed as part of the DS2 project to present individual patient records and show the results of reducers derived from the classifier-based predicates.
- Plug classifiers into OpenCDS and the larger Predicate/Reducer architecture in order to use them to help redact conditions from CCDs.
We designed two Application Programming Interfaces (APIs) to connect the classifiers developed in step 1 to the Inference Analyzer and OpenCDS Predicate-Reducer in steps 2 and 3:
- SimpleProbabilisticPredicate – For classifiers that work on one section of the medical record at-a-time, this API passes a simple one-dimensional list of clinical facts, such as a list of problem diagnoses or a list of medications, to the classifier.
- ProbabilisticPredicate – For classifiers that work on the entire patient record, this API passes a vMR object, containing all components of the patient’s medical record, to the classifier.