Data Mining/ Schema

Submitted by: Submitted by

Views: 47

Words: 2654

Pages: 11

Category: Science and Technology

Date Submitted: 12/09/2014 10:14 PM

Report This Essay

CSE 601: Data Mining and Bioinformatics Project 1

Biomedical Data Warehouse OLAP System

AISHWARYA BALACHANDER – 5009 6795 MIHIR SHAH – 5009 7197 PRATEEK SOLANKI – 5009 6362 SINDUJA JAGADEESAN – 5009 6174

1|Page

Table of Contents

Introduction…………………………………………………………………………………………………….….3 Biomedical Data Spaces…………………………………………………………………………………………..3 Part I: Implementation of Data Warehouse Schema………………………………………………………….…...3 Part II: OLAP operations and Queries..............................................................................................................…...3 Query 1 and its implementation…………………………………………………………………………………...3 Query 2 and its implementation…………………………………………………………………………………...5 Query 3 and its implementation…………………………………………………………………………………...6 Query 4 and its implementation…………………………………………………………………………………...9 Query 5 and its implementation………………………………………………………………………………….10 Query 6 and its implementation………………………………………………………………………………….11 Part III: OLAP operations to support Knowledge Discovery…......……………………………………………..12 Query 1 and its implementation………………………………………………………………………………….12 Query 2 and its implementation………………………………………………………………………………….14 Conclusion……………………………………………………………………………………………………….15 References………………………………………………………………………………………………………..15

2|Page

Introduction

In this project, we have been asked to implement a clinical and genomic data warehouse based on the BioStar Schema design using MySQL. The data warehouse supports the following requirements: 1. Support regular and statistical OLAP operations. 2. Be robust to potential changes in the future. 3. Support knowledge discovery.

Biomedical Data Spaces

Due to the diversity and complexity of biomedical data, our data warehouse design includes several modelling data spaces:  Clinical data space  Sample data space  Microarray data space  Proteomic data space  Experiment data space  Gene data space

Part I:

Implementation of Data Warehouse schema using MySQL...