Question Doc

Submitted by: Submitted by

Views: 89

Words: 263

Pages: 2

Category: Business and Industry

Date Submitted: 04/05/2014 11:15 AM

Report This Essay

4The question(s) that you are interested in answering (what you want to answer). 

1) Identify and categorize the major causes of death (health related) in the USA.

The data source that you will use (be specific), and how you plan on obtaining data.

2) The data source that we plan to use is from health category of the website, data.gov. 

There are a set of zipped csv files from Community Health Status Indicators (CHSI) that contains over 200 health measures from each of the 3141 United States counties. This dataset was created in Aug 26, 2013 and was updated in Feb 28, 2014.

Link:

http://catalog.data.gov/dataset/community-health-status-indicators-chsi-to-combat-obesity-heart-disease-and-cancer

What analysis or visualization you plan to do in your data project. 

3) Based on the information present in the files, we are looking at creating a heat map and map chart of places in the US that have the most number of deaths. At the same time a cross table will help to categorize the population based on age and race and will to help identify deaths in each category across years.

What kind of findings do you expect to see. 

4) We look at finding out the vulnerable populations and what reasons might relate to their health issues/early deaths. We can also see if these populations have any restrictions to health care and other health related risk factors. Occupational hazards can also be an indication of a certain type of health issue for a particular population.