BI System Effectiveness: Step 3

Methods and results

Step 3

Constructing the prototype of prediction model

The LR analysis was used to model dichotomous outcome variables and forecast relationships between the dependent variable and a set of independent explanatory variables. The LR model constructed a two-way classification system as a substitute for linear discriminant analysis, and avoided the unreasonable assumption that the binary type covariance matrix must be equal. Based on the all 17 input variables of the BISE and enterprise characteristics derived from the measurement instrument of step 2, this step first adopted LR analysis to model the influence and explanatory power of prediction variables.

Analytical results demonstrated that the value of Cox-Snell\(R^2\) was 0.583 and that of Nagelkerke \(R^2\) was 0.808, suggesting that the model had receivable prediction power. Table 5 also indicates that the four predictive performance measures, accuracy, precision, recall, and F1-measure, of the prototype LR model, were 89.61, 78.13, 96.15 and 86.21 %, respectively, leaving the effectiveness of the prototype unproven. Additionally, the two prediction variables were \(X_{15}\) and \(X_{13}\), with the estimated values of 3.703 and 3.408, and \(p\)-values were 0.003 and 0.006 (<0.05), respectively, which indicated good explanatory power. Meanwhile, under the prototype model, the remaining 15 prediction variables had less explanatory power. To improve the prototype model, this study first extracted the critical input variables and structured the association rules, and then constructed the refined LR model to enhance the predictive power and accuracy. The associated calculation work is presented below.

Identifying the critical prediction indicators and association rules

The DT algorithm was one of the methods used in data mining for knowledge discovery, and systematically analyzed the data to identify rules and relations for use in data classification and prediction. This algorithm comprised a supervised learning method for data mining. The classification tree was adopted, and included parameter setting and standards for calculating divergence. Model accuracy was assessed using the actual DT performance to calculate the proportion correctly classified as judgment. This study administered the CART algorithms, and the splitting criteria, impurity measures and Gini criterion Breiman et al. were as described below.

At node \(t\), the optimal split \(s\) was selected to maximize a splitting criterion \(Δi(s,t)\). When the impurity measure (i(t)) for a node was defined, the splitting criterion corresponded to a decrease in impurity. \(ΔI(s,t) = p(t)Δi(s,t)\) was labeled the improvement.

i(t) = \sum\limits_{i,j} {C\left( {i\left| j \right.} \right)p} \left( {i\left| t \right.} \right)p\left( {j\left| t \right.} \right)

\( \Delta i\left( {s,t} \right) = i\left( t \right) - p_{L} i\left( {t_{L} } \right) - p_{R} i\left( {t_{R} } \right) \)

\(p(t)\) The probability of a case in node \(t. p(j|t)\) The probability of a case in class \(j\) given that it falls into node \(t. C(i|j)\) The cost of miss-classifying a class\( j\) case as a class I case. \(C(j|j) = 0\)

Analytical results demonstrated that the structure of DT that obtained the most accurate classification, including the minimum number of cases, was two in the total branching nodes and the maximum DT depth was five hierarchies. The performance measures, accuracy rate, precision rate, recall rate, and F1-measure rate, of the structure of DT were 94.81, 92.31, 92.31 and 92.31 %, respectively (see Table 5). Furthermore, Fig. 2 illustrates the tree structure of the DT algorithm. The details of each terminal node included description of rule paths, categories of belonging, numbers entering the node, and analysis of category purity. The tree structure was such that five paths (namely, association rules) existed from the root node to the leaf nodes.

Fig. 2

Results of tree structure of the DT algorithm

Rule A: If \(X_{12}\)= {1,2,3} and \(X_{15}\) = {1,2,3}, then low BISE. P = 95.20 %.
Rule B: If \(X_{12}\)= {1,2,3} and \(X_{15}\) = {4,5}, then high BISE. P = 85.70 %.
Rule C: If \(X_{12}\)= {4,5} and \(X_{16}\) = {1,2}, then low BISE. P = 80.00 %.
Rule D: If \(X_{12}\)= {4,5}, \(X_{16}\) = {3,4,5} and \(X_1\) = {1,2,3}, then high BISE. P = 66.70 %.
Rule E: If \(X_{12}\)= {4,5}, \(X_{16}\) = {3,4,5} and \(X_1\) = {4,5}, then high BISE. P = 100 %.

\(Xi\) = variable of BISE

The extent of promotion of or support for \(Xi\); very low = 1, low = 2, medium = 3, high = 4, very high = 5

\(P\) = purity of predictive association rule

Analyzing all five prediction rules revealed that the first critical determining attribute and indicators of BISE is the variable \(X_{12}\): promotion of the accumulation of business intelligence is a human competency attribute. Based on rules A and B, the second is variable \(X_{15}\): support for new service development and customer acquisition. Meanwhile, rule C showed that the third is \(X_{16}\): support for service continuity and customer retention. The above two critical determining indicators are embedded in support for specific business processes attributes. Finally, the last is \(X_{1}\): promotion of data availability is a technological attribute, based on analysis of rules D and E.

Developing the refined prediction model

To improve the prototype prediction model constructed based on the first LR analysis in "Constructing the prototype of prediction model" section, this study adopted four critical prediction indicators (variables) of BISE derived from the classification and prediction model by using the DT algorithm to refine the prototype prediction model to enhance improve its predictive power and accuracy.

Analytical results demonstrated that the suggested prediction equation was as follows:

\(p = \dfrac{{e^{f(x)} }}{{1 + e^{f(x)} }} \,\)

\(\ln \left( {\frac{p}{1 - p}} \right) = f(x) = - 3 8. 4 8 2 { } + 2. 8 0 0 * X 1 2 { } + 2. 7 5 9 * X 1 5 { } + 3.862 * X 1 6 { } + 2.477 * X 1 { }\)

The probability (P) in the range 0–1 is used to identify the BISE in the BI implementation, where the value of \(P\) is close to 1 means high BISE and the value of P is close to 0 means low BISE. The four critical prediction indicators included \(X_{1}\) ,\(X_{12}\),\(X_{15}\) and \(X_{16}\) , and achieved the good explanatory power (see Table 4). The variable of \(X_{16}\) : support for service continuity and customer retention had the highest predictive power.

Based on the Hosmer–Lemeshow goodness of fit test, the \(X^2\) value was 6.777 and the p-value was 0.561(>0.05), demonstrating that the model and data were suitable and the model had good overall fitness. The value of Cox-Snell \(R^2\)was 0.621 and that of Nagelkerke \(R^2\) was 0.860, which signified the LR model had good predictive and explanatory capability. The omnibus test \(X^2\) was 74.641 and the p-value was 0.000(<0.05), which means the model was capable of predicting the BISE.

Table 4 Results of LR analysis

Variables	Estimate	Standard error	Wald \(X^2\)	p-values
Constant	−38.482	11.501	11.196	0.001***
\(X_{16}\): support for service continuity and customer retention	3.862	1.617	5.704	0.017**
\(X_{12}\): promotion of the accumulation of business intelligence	2.800	1.158	5.850	0.016**
\(X_{15}\): support for new service development and customer acquisition	2.759	1.208	5.219	0.022**
\(X_{1}\): promotion of data availability	2.477	1.120	4.892	0.027**
Model fit properties	Omnibus test \(x^2\) = 74.864, p-value = 0.000***
	Hosmer–Lemeshow test \(x^2\) = 6.777, p-value = 0.561
	Cox-Snell \(R^2\)=0.622
	Nagelkerke\(R^2\)=0.862

*** \(p < 0.01\); ** \(p < 0.05\)

Additionally, Table 5 list the results of the four predictive performance measures - accuracy, precision, recall, and F1-measure of the refined LR model, and the comparison between actual conditions and test results of the two LR models and the structure of DT. For the refined LR model, the total predictive accuracy, precision, recall, and F1-measure were 94.81, 92.31, 92.31 and 92.31 %, respectively. The results demonstrated that the refined LR model exhibited a better predictive performance in terms of accuracy, precision, recall and F1-measure, than the prototype LR model, and the same predictive performance as the structure of DT. These two models had the same four predictive performance measures because of the small sample sizes; when the analyzed samples small, the probability that the two models are equally accurate is high.

Table 5 Results of predictive performance measures of the LR model and the DT structure

Methods	Groups	Actual condition
Methods	Groups	Low BISE group	High BISE group
LR model (prototype)	Test result
	Low BISE group	25(TP)	7(FP)
	High BISE group	1(FN)	44(TN)
	Accuracy	89.61 %
	Precision	78.13 %
	Recall	96.15 %
	F1-measure	86.21 %
LR model (refined)	Test result
	Low BISE group	24(TP)	2(FP)
	High BISE group	2(FN)	49(TN)
	Accuracy	94.81 %
	Precision	92.31 %
	Recall	92.31 %
	F1-measure	92.31 %
DT structure	Test result
	Low BISE group	24(TP)	2(FP)
	High BISE group	2(FN)	49(TN)
	Accuracy	94.81 %
	Precision	92.31 %
	Recall	92.31 %
	F1-measure	92.31 %

1. TP true positive; FP false positive; FN false negative; TN true negative

A live case

Introduction

AC Company, a financial services firm, was founded in Taiwan in the 1990s. The company mostly provides loans and sale intermediation in the used equipment market. Recently, the company developed and implemented BI systems based on information technology (IT), substantially improving its business agility and operational innovativeness.

BI system architecture

The information system of AC Company includes customer relationship management systems, enterprise resource planning systems (loan management information systems and equipment financing information systems), supply chain management systems, internal auditing systems, administrative information systems, and financial management systems. The company uses data mining techniques, multidimensional database techniques, data warehouse tools, and SQL2008 tools to integrate all of these information systems into its BI systems.

The BI systems of AC Company are IT-driven and function-oriented systems for analyzing data and presenting actionable information to executives and decision-makers to help them make business and management decisions. BI systems include several data processing tools, and support operation and management applications that enable the company effectively to collect data from internal systems that support loan activities and operations, and external sources that are associated with suppliers, customers, government, and related organizations. BI systems also analyze or virtualize obtained data to create reports for executives and decision-makers that provide the results of analyses on which they can act.

BI system effectiveness

The implementation of BI systems has enabled AC Company to transform collected data into useful information and reports that help to improve decision-making and management, and to construct a featured knowledge database that accumulates BI and increases BISE. For example, recently, to cope with the competition in Chinese loan markets, the company adopted a business strategy to develop innovative loan service systems to improve customer service, and to use the aforementioned information systems and BI systems in its subsidiary in mainland China to open up new loan markets.

Executives of the company examined the effectiveness of implemented BI systems using the four key indicators of BISE, which were extracted by logistic regression analysis as follows.

BI systems strongly support (level 4 on the Likert scale) service continuity and customer retention (\(X_{12}\)).
BI systems provide very effectively (level 5 on the Likert scale) promote the accumulation of business intelligence (\(X_{12}\)).
BI systems strongly support (level 4 on the Likert scale) new service development and customer acquisition (\(X_{15}\)).
BI systems provide very effectively (level 5 on the Likert scale) promote data availability (\(X_{1}\)).

Course Introduction

Course Syllabus

Unit 1: Business Intelligence Overview

1.1: What is Business Intelligence?

Business Intelligence

Introduction to Business Intelligence

1.1.1: What Business Intelligence is Not

Frontiers of Business Intelligence and Analytics

Business Intelligence Dashboards

1.1.2: Business Intelligence vs. Competitive Intelligence

What is Competitive Intelligence?

1.1.3: From Systems Engineering to Business Engineering

Information Architecture Analysis

Systems Engineering

Business Engineering

1.2.1: Contemporary Applications

Business Intelligence in ERP

Improving Outcomes with Business Intelligence

How Businesses Use Information

1.2.2: BI Approaches for Each Lifecycle Stage

The Business Cycle

Big Data Analytics in Supply Chain Management

1.2.3: BI for Prediction

Goal-Oriented BI

Big Data Analytics

BI System Effectiveness

Data Mining Analytics for BI and Decision Support

1.3: The Future of BI

Future Trends in Information Systems

Internet Trends

Trends in Information Technology

Technology Trends in the COVID-19 Pandemic

The Future of BI

1.3.1: Adapting Business Models to Globalization and Technology

Global Business Strategies for Responding to Cultural Differences

Internationalization and the Need of Business Model Innovation

1.3.2: Maintaining the Firm-Centric Approach

Designing BI Solutions in the Era of Big Data

1.3.3: Incorporating Data from the Internet of Things (IoT)

The Internet of Things

The Cognitive Internet of Things and Big Data

Data Science in Heavy Industry and the Internet of Things

Causality and Variables

The Internet of Things is Revolutionary

Unit 1 Discussion

Unit 1 Study Resources

Unit 1 Review Video

Study Guide: Unit 1

Unit 1 Assessment

Unit 1 Assessment

Unit 2: BI as Business Support

2.1: Defining the Problem

Choice and Happiness

2.1.1: Framing Internal Client Discussions

Overview of Managerial Decision-Making

2.1.2: Drafting the Terms of Reference (TOR)

Defining the Scope of your Project

Developing Terms of Reference

2.1.3: Negotiating the Project Scope

Scope Planning

Negotiation

2.2: The Art and Science of Decision-Making

Decision-Making in Management

Decision-Making Processes in the Workplace

2.2.1: Thinking about Thinking

Experience vs. Memory

Evidence Logs and Metacognitive Logs

2.2.2: Use Analysis, or "Go with Your Gut"?

Problem Solving, Thinking, and Intelligence

Using a Heuristics Checklist

2.2.3: Decision-Making Approaches

Decision-Making Tools

2.2.4: Structuring Decision-Making Effectively

RAPID Decision-Making

2.3: Using Data to Make Decisions

Business Intelligence Dashboards

2.3.1: Everyday Data

2.3.2: Why Expert Judgement is No Better than Yours

Why You Think You're Right Even if You're Wrong

2.3.3: How Forecasting can Help Decision-Making