DS2 Data Scripts

Click on the project name to access the project's Bitbucket Repository, which contains documentation and build instructions.

 

ds2_data_scripts

Scripts and instructions for downloading, processing, and loading patient and concept data into the database used by NaiveBayesPredicate, RandomForestPredicate, and SimplePredicateReducerVisualizer. Also generates WEKA ARFF files for classifier exploration and development, and concept files for the OpenCDS Predicate-Reducer.

 

Database Diagram for DS2 Data Scripts

The DS2 Data Scripts project creates and populates a simple SQLite database with patients, their diagnoses, and statistics based on the diagnoses categories to which the diagnoses belong.  The tables and relationships in that database are illustated below.