Data Analysis Plus Excel 2016

There are a variety of methods that may be utilized to analyze data. Many statistical packages are available, including Microsoft Excel, which is free and can often be used for simple, efficient analysis.

Subsequently, you learn how to execute these theories in Excel. The tools of data analytics which you will learn include for instance: Correlation. Single OLS regression. Multiple OLS regression. Non linear regression. At the end of this course, you obtain a certification of completion. Note: We use Excel 2016, but other versions are fine too. Data Analysis Plus v9.0 (with VBA 6) Microsoft Excel 97 - 2016 on Windows OS Office 2001 for Mac OS Office 2004 for Mac OS.NOTE: Help file (.CHM) is a stand-alone reference and will not launch from within Excel.NOTE: With Excel 2013 Or Excel 2016, the 'Data Analysis' macros must be disabled to run the 'Data Analysis Plus' macros.

Using the table below as an example, several methods of data analysis in Excel will be examined, including the sort function and the Pivot Table. The sort function is best used for relatively small databases, while the Pivot Table is helpful for analyzing larger datasets and quickly grouping items. Utilizing the sort function on the data set below, it is possible to count the number of people with allergies and determine how many of them are male or female. The data is sorted first by diagnosis and then by gender.

Data Analysis Example

Hints for Analyzing Data

Before using the sort function or Pivot Tables, the data must be “cleaned.” This means that the first step in data analysis is to go through the data and ensure that the style of data entry is consistent within columns. In this case, for diagnoses, it is important to make sure that only one word, phrase, or abbreviation is used to describe each diagnosis. If multiple words are used to describe the same thing— for example, if “allergy” is written in the database as “allerg,” “allergy,” and “allergies” — the analysis will be more difficult, so it is best to choose one term and use it consistently. It may be necessary to change the terminology used in the dataset in order to be consistent throughout; such changes ought to be made at this preliminary stage. If there are multiple diagnoses for a single subject, it is important to list the diagnoses separately. It may be necessary to create additional columns labeled “Diagnosis 2” and “Diagnosis 3,” listing one diagnosis in each column. (If this is the case, it is possible to create multiple Pivot Tables and manually add the results together.)

Using the Sort Function in Excel

Using Excel 2016 for Windows, first select the data (“Control-A” selects all). On the top of the Excel tool bar, choose the “Data” tab. Then, click the sort function (circled below in blue). In the window that pops up, click “Sort by ‘Diagnosis.’” To sort again by gender, click the button in the upper-left corner of the window that says “Add Level.” Then, click “Gender” and the “OK” button. (See picture below.)

Sorting is a great tool to identify trends and to analyze small amounts of data. In the example above, once the data are sorted by diagnosis and then by gender, simply count the number of people with each diagnosis and record the gender breakdown, either manually or using the Excel “COUNTIFS” formula. To use the formula to count the number of females with an allergy diagnosis, select an empty cell and type “=COUNTIFS” followed by the range and criteria. For this example, the first range is D2:D23 and the first criteria is “Allergy.” The second range is C2:C23 and the second criteria is “F.” See the picture below for the proper formulaic notation.


Press “Enter,” and the number of individuals who have an allergy diagnosis and are female is revealed in the cell.


This formula is an excellent way to count specific data if it is too time consuming to count manually. Simply alter the range and criteria in the formula to examine different subgroups.

Using Pivot Tables in Excel

With large data sets, manually counting or using a formula to count can be tedious and create opportunities for error. Pivot Tables will automatically sort data and list values, producing efficient and accurate information. To create a Pivot Table, select the data, click on the “Insert” tab, and then select “Pivot Table.” (For Macs, click on the “Data” tab, followed by “Pivot Table.”)


The Pivot Table will open in a new sheet of the Excel file. The next step is to add values. On the right side, there is a box that says “Choose fields to add to report.” To first sort by diagnosis, drag the “Diagnosis” label (the one with the checkbox next to it) into the “Rows” box. It will look like this:

To next determine how many people had each diagnosis, drag “Diagnosis” (the one with the checkbox next to it) to the box with the heading “Values.” It should look like this:

To sort by gender, drag “Gender” to the box with the heading “Rows,” and Excel provides an automatic breakdown, which can be used to calculate percentages and to create graphs.

Alternatively, to sort first by gender, and then by diagnosis, switch the order of “Gender” and “Diagnosis” in the “Rows” box.

This is just one example of how Pivot Tables can be used. Fields can be added or removed as necessary. It may be helpful to practice dragging different fields to different categories in order to develop an understanding of how Pivot Tables work.

Creating a Data Display

Once the data are analyzed, it is often useful to create a display so that others can quickly and easily understand the results. One way to do this is to create a chart using Excel. First, create another table to more easily show the breakdown of number of males and females with a certain diagnosis. Do this by dragging the “Gender” field from the “Rows” category to “Columns.”

Next, click “PivotChart” under the “Analyze” tab, and select the option “Stacked Column.” This shows the number of males and females with each diagnosis, stacked on top of each other.

After the chart has been created, click on the green plus sign in the upper right corner of the chart to add chart and axis titles, add data labels, format the color scheme, and hide the field settings.

Data analysis plus excel 2016 download

Once the chart is customized, it clearly displays important trends in the data. For example, from this chart, one can quickly see that no females were diagnosed with conjunctivitis or presbyopia.


Data Analysis Plus Excel 2016 Free

The Importance of Reporting All Results

When analyzing data, it is critical to report all results, even if they seem insignificant. It is also essential to not lump data analyses together and make generalizations. For example, a researcher conducting a study on the effectiveness of a visual aid to increase knowledge of cataracts administers a 10-question survey to patients before and after showing them the visual aid. The researcher finds that the visual aid increases the overall number of questions answered correctly. This is a good start, but it is not enough. It is critical that the researcher analyze the results of each individual question. Just knowing that the intervention increases overall knowledge provides little information about the strengths and weaknesses of the intervention. Perhaps the intervention caused a significant increase in the number of people understanding what a cataract is, but not the number of people understanding proper post-operative procedures. This is important to know because the intervention can then be modified to better convey the necessary information.

The Excel Data Analysis toolpak should be enabled by default on every lab computer and computer available for checkout from the library. However, someone may have gone through and disabled the Toolpak for whatever reason, or the machine may have been overlooked by the computer tech staff (accidents happen; we're only human). If someone has gone through and manually disabled the Toolpak, you can follow the instructions below to re-enable it, or you can restart the machine you're working on. Restarting a computer in the computer labs, an email station, or one of the computers available for checkout from the library will restore that computer's default settings (which includes having the Data Analysis Toolpake enabled).

Table of Contents

Microsoft includes the Data Analysis Toolpak with every modern version of Excel for Windows, except for the version of Excel bundled with Windows RT.[1] For all other modern versions of Office for Windows, see the appropriate section below. If you encounter a Data Analysis Toolpak in a different language (e.g. German, French, Spanish), contact the library via the Request Tracker system. (See this article on our knowledge base for instructions on how to use the Request Tracker system to alert the library to an issue.)

Enabling the Data Analysis Toolpak in Excel 2007

  1. Open Excel.
  2. Click on the Office menu orb in the upper left hand corner of the application.
  3. At the bottom of the menu that pops up, there's an Excel Options button. Click that.
  4. The Excel Options box opens up on the Popular tab. Click on the Add-Ins tab (3rd from the bottom).
  5. At the bottom of the window, there's a Go... button. Click on it.
  6. Check the box next to Analysis Toolpak list item. Click OK.

And you're done. You can access the Data Analysis toolpak under the Data tab of Excel's Ribbon menu bar.

Enabling the Data Analysis Toolpak in Excel 2010

  1. Open Excel.
  2. Click on the File tab of the Excel Ribbon menu bar.
  3. Click on Options in the left column of the menu.
  4. In the Excel Options box that opens up, click the Add-Ins tab.
  5. At the bottom of the window, there's a Go... button. Click on it.
  6. Check the box next to Analysis Toolpak list item. Click OK.

And you're done. You can access the Data Analysis toolpak under the Data tab of Excel's Ribbon menu bar.

Enabling the Data Analysis Toolpak in Excel 2013

  1. Open Excel.
  2. Click on the File tab of the Excel Ribbon menu bar.
  3. At the bottom of the menu that pops up, click on Options.
  4. In the Excel Options box that opens up, click the Add-Ins tab.
  5. At the bottom of the window, there's a Go... button. Click on it.
  6. Check the box next to Analysis Toolpak list item. Click OK.

And you're done. You can access the Data Analysis toolpak under the Data tab of Excel's Ribbon menu bar.

Enabling the Data Analysis Toolpak in Excel for Mac

Microsoft decided to stop making a Data Analysis Toolpak available for Microsoft Office for Mac starting with Office 2008 (released in 2007). Microsoft has reported in its support documentation that Office 2016 for Mac (unreleased at the time of this writing, July 20th, 2015) will once again include a Data Analysis Toolpak.

For versions of Office for Mac prior to Office 2016, Microsoft states that the Data Analysis Toolpak is not included, and You must install third-party Data Analysis tools, such as StatPlus:mac LE.[2]

Enabling the Data Analysis Toolpak in Excel 2016 for Mac

The following instructions are based on Microsoft's published support documentation as of July 7, 2015, and apply only to Office 2016 for Mac. Microsoft may change their documentation at any time without warning. Go to the support documentation for up to date instructions.

  1. Open Excel.
  2. Tools menu > Add-Ins....
  3. Click the Data Analysis Toolpak option to enable it. Click OK.

And you're done. You can access the Data Analysis toolpak under the Data tab of Excel's Ribbon menu bar.

Resources & Links

Footnotes

[1] (n.d.) Load the Analysis ToolPak in Excel 2013 ⇗ Retrieved July 20, 2015.

[2] (2015, July 9) How to find and install Data Analysis ToolPak or Solver for Excel for Mac ⇗ Retrieved July 20, 2015.

Links

Data Analysis Plus Excel 2016

Tags: Microsoft Office, software