Completion requirements
Unit 5: Data Analytics
5a. Explain the difference between describing and analyzing data
- How is data analytics used to study data?
- How do we draw conclusions and support managerial decisions?
Analytics is where information becomes intelligence. It is transformed from disparate data points that can be described in terms of data sets into patterns resulting from the analysis. This is where the real brainwork of the analytic process takes place. The methods are myriad and are highly dependent upon both the available inputs and the requirements for your particular project.
Managing and extracting valuable meaning from big data is not only a science challenge but, more than anything, a leadership challenge. Becoming a big-data-enabled organization requires a culture of empowerment, trust, transparency, and inquiry. These qualities allow analytics to be woven throughout an organization's fabric, which elevates the investment and commitment to analytics. Across the literature, it is acknowledged that big data's managerial and leadership challenges outrank the technical challenges associated with utilizing big data to solve business goals.
Conclusions are based on an analysis of the data. Descriptive data is beneficial for presentation and facilitating conclusions but is not a conclusion itself.
To review, see Data Analysis.
5b. Apply various analytic techniques to various datasets to make analytic estimates
- How do you use analytic techniques to find meaning in data?
- What are some of the more commonly used classes of analytic techniques?
- What are the criteria for selecting the best graphics to display data to a particular audience?
Predictive analytics leverages statistical models and machine learning to enable managers to predict future outcomes with varying degrees of statistical confidence. Descriptive analytics collects historical data from reporting, scorecards, clustering, and various other sources of information, enabling managers to highlight trends and identify them. Decision analytics uses data-driven models and visualizes outcomes of specific organizational behaviors allowing managers to visualize the various results of different strategic approaches. Prescriptive analytics uses optimization and simulation, enabling managers to produce recommended decisions through analytical modeling.
- Prescriptive – What we should do based on the data
- Diagnostic – Identify causal relationships
- Cognitive – Use artificial intelligence and machine learning
- Descriptive – Summarization and aggregation of data
- Predictive – Determine the likelihood of future behavior
The absolute frequency is the number of times a value occurs in the data. The relative frequency is the ratio of the number of times a value occurs in the total number of values. The cumulative relative frequency is the summation of all relative frequencies and adds up to 1 (or 100%).
Analytics has defined stages of development, from descriptive analytics to cognitive analytics. As an organization moves along with analytics development, the benefits of the outcomes and implementation complexity increase.
Many analytic techniques can be applied to the attributes of a particular decision problem space. It is important to understand these techniques and under what circumstances each is most appropriate.
- Clustering - Used to group related attributes in sets that have common characteristics
- Classification - Identifying attributes and assigning them to sets based on their characteristics
- Prediction - Using the past values of an attribute to assign a future value
- Profiling - Searching for attributes that have preselected characteristics
- Smoothing - Taking the average of an attribute over time
When displaying data to an audience, it's important to choose the right choice to help them quickly understand the point being made. Some simple charts that can be used include:
- Line charts for comparing trends, multiple datasets over time, or correlations
- Area charts for comparing change over time from two or more variables
- Column charts for showing frequency distribution and comparing datasets
- Bar charts for ranking datasets or comparing datasets
- Pie charts for comparing datasets as percentages of a whole.
5c. Determine what kinds of scenarios and simulations would be most useful for your business case
- How is scenario construction used in decision-making?
- How are simulation techniques used to model real-world systems?
Simulation of a system is the operation of a model in terms of time or space, which helps analyze the performance of an existing or proposed system. Simulations are similar to scenarios, although today, simulations often take the place of computational models representing some problem to be solved that might be too expensive or dangerous to attempt in the real world. These computer simulations enable analysts to see what happens in a given situation, then ask, "What happens if X is changed?" Simulations are often used to experiment with environmental conditions or to predict behavior, such as consumers in a marketplace when a new competitor is introduced. The simulation process should follow a defined procedure.
Once the outcomes from the decision are captured, it is important to return to determine if they were supported by the analysis. The results can then be utilized to better inform the analysis for future uses.
The following figure illustrates the basic process of simulation. We create a model of the real world that we represent as a "black box." Within this box are all the mathematical details of our model. We then subject our model to various inputs and observe the outputs that result. Assuming that we have created a reasonable model of the real-world situation, the output from our model should be a very realistic approximation of the results that would be obtained in the real world.

To review, see Simulation Approach to Decision-Making.
Unit 5 Vocabulary
This vocabulary list includes the terms that you will need to know to successfully complete the final exam.
- area chart
- bar chart
- classification
- clustering
- cognitive
- diagnostic
- frequency
- frequency distributions
- line chart
- prescriptive
- prediction
- profiling
- scenario
- simulation
- smoothing