Thursday, 7 June 2012

Chapter 1. Business Scenario and SAP BW


The objective of data warehousing is to analyze data from diverse sources to support decision making. To achieve this goal, we face two challenges:


  • Poor system performance. A data warehouse usually contains a large volume of data. It is not an easy job to retrieve data quickly from the data warehouse for analysis purposes. For this reason, the data warehouse design uses a special technique called a star schema.
  • Difficulties in extracting, transferring, transforming, and loading (ETTL) data from diverse sources into a data warehouse. Data must be cleansed before being used. ETTL has been frequently cited as being responsible for the failures of many data warehousing projects. You would feel the pain if you had ever tried to analyze SAP R/3 data without using SAP BW.

SAP R/3 is an ERP (Enterprise Resources Planning) system that most large companies in the world use to manage their business transactions. Before the introduction of SAP BW in 1997, ETTL of SAP R/3 data into a data warehouse seemed an unthinkable task. This macro-environment explained the urgency with which SAP R/3 customers sought a data warehousing solution. The result is SAP BW from SAP, the developer of SAP R/3.

In this chapter we will introduce the basic concept of data warehousing. We will also discuss what SAP BW (Business Information Warehouse) is, explain why we need it, examine its architecture, and define Business Content.

First, we use sales analysis as an example to introduce the basic concept of data warehousing.

1.1. Sales Analysis—A Business Scenario

Suppose that you are a sales manager, who is responsible for planning and implementing sales strategy. Your tasks include the following:
  • Monitoring and forecasting sales demands and pricing trends
  • Managing sales objectives and coordinating the sales force and distributors
  • Reviewing the sales activities of each representative, office, and region
Suppose also that you have the data in Tables 1.1 through 1.3 available about your firm's materials, customers, and sales organization.
Table 1.1. MATERIALS
Material NumberMaterial NameMaterial Description
MAT001TEAIce tea
MAT002COFFEEHot coffee
MAT003COOKIEFortune cookie
MAT004DESKComputer desk
MAT005TABLEDining table
MAT006CHAIRLeather chair
MAT007BENCHWood bench
MAT008PENBlack pen
MAT009PAPERWhite paper
MAT010CORNAmerica corn
MAT011RICEAsia rice
MAT012APPLENew York apple
MAT013GRAPEFRUITFlorida grapefruit
MAT014PEACHWashington peach
MAT015ORANGECalifornia orange

Table 1.2. CUSTOMERS
Customer IDCustomer NameCustomer Address
CUST001Reliable Transportation Company1 Transport Drive, Atlanta, GA 23002
CUST002Finance One Corp2 Finance Avenue, New York, NY, 10001
CUST003Cool Book Publishers3 Book Street, Boston, MA 02110
CUST004However Forever Energy, Inc.4 Energy Park, Houston, TX 35004
CUST005Easy Computing Company5 Computer Way, Dallas, TX 36543
CUST006United Suppliers, Inc.6 Suppliers Street, Chicago, IL 61114
CUST007Mobile Communications, Inc.7 Electronics District, Chicago, IL 62643
CUST008Sports Motor Company8 Motor Drive, Detroit, MI 55953
CUST009Swan Stores9 Riverside Road, Denver, CO 45692
CUST010Hollywood Studio10 Media Drive, Los Angeles, CA 78543
CUST011One Source Technologies, Inc.11 Technology Way, San Francisco, CA 73285
CUST012Airspace Industries, Inc.12 Air Lane, Seattle, WA 83476

Table 1.3. SALES ORGANIZATION
Sales RegionSales OfficeSales RepresentativeSales Representative ID
EASTATLANTAJohnSREP01
NEW YORKSteveSREP02
MarySREP03
MIDWESTDALLASMichaelSREP04
LisaSREP05
CHICAGOKevinSREP06
ChrisSREP07
WESTDENVER[*]SamSREP08
LOS ANGELESEugeneSREP09
SEATTLEMarkSREP10

[*] Prior to January 1, 2000, the Denver office was in the Midwest region.
You also have three years of sales data, as shown in Table 1.4.
Table 1.4. SALES DATA
Customer IDSales Representative IDMaterial NumberPer Unit Sales PriceUnit of MeasureQuantity SoldTransaction Date
CUST001SREP01MAT0012Case119980304
CUST002SREP02MAT0022Case219990526
CUST002SREP02MAT0035Case319990730
CUST003SREP03MAT0035Case420000101
CUST004SREP04MAT00450Each519991023
CUST004SREP04MAT005100Each619980904
CUST004SREP04MAT005100Each719980529
CUST005SREP05MAT006200Each819991108
CUST006SREP06MAT00720Each920000408
CUST007SREP07MAT0083Dozen1020000901
CUST007SREP07MAT0083Dozen119990424
CUST008SREP08MAT0083Dozen219980328
CUST008SREP08MAT0092Case319980203
CUST008SREP08MAT0101U.S. pound419991104
CUST009SREP09MAT0111.5U.S. pound520000407
CUST010SREP10MAT0111.5U.S. pound620000701
CUST010SREP10MAT0111.5U.S. pound719990924
CUST010SREP10MAT0122U.S. pound819991224
CUST010SREP10MAT0133Case920000308
CUST011SREP10MAT0141U.S. pound1019980627
CUST012SREP11MAT0142U.S. pound119991209
CUST012SREP11MAT0153Case219980221
CUST012SREP11MAT0152Case320000705
CUST012SREP11MAT0153.5Case420001225

The data in these tables represent a simplified business scenario. In the real world, you might have years of data and millions of records.
To succeed in the face of fierce market competition, you need to have a complete and up-to-date picture of your business and your business environment. The challenge lies in making the best use of data in decision support. In decision support, you need to perform many kinds of analysis.
This type of online analytical processing (OLAP) consumes a lot of computer resources because of the size of data. It cannot be carried out on anonline transaction processing (OLTP) system, such as a sales management system. Instead, we need a dedicated system, which is the data warehouse.