版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡介
1、Chapter 2:Data Warehousing,Business Intelligence: A Managerial Perspective on Analytics (3rd Edition),Learning Objectives,Understand the basic definitions and concepts of data warehousesLearn different types of data w
2、arehousing architectures; their comparative advantages and disadvantagesDescribe the processes used in developing and managing data warehousesExplain data warehousing operations,(Continued…),Learning Objectives,Explain
3、 the role of data warehouses in decision supportExplain data integration and the extraction, transformation, and load (ETL) processesDescribe real-time (a.k.a. right-time and/or active) data warehousingUnderstand data
4、 warehouse administration and security issues,Opening Vignette…,Isle of Capri Casinos Is Winning with Enterprise Data WarehouseCompany backgroundProblem descriptionProposed solutionResultsAnswer & discuss the ca
5、se questions.,Questions for the Opening Vignette,Why is it important for Isle to have an EDW?What were the business challenges or opportunities that Isle was facing?What was the process Isle followed to realize EDW? Co
6、mment on the potential challenges Isle might have had going through the process of EDW development.What were the benefits of implementing an EDW at Isle? Can you think of other potential benefits that were not listed in
7、 the case?Why do you think large enterprises like Isle in the gaming industry can succeed without having a capable data warehouse/business intelligence infrastructure?,Main Data Warehousing Topics,DW definitionCharacte
8、ristics of DWData Marts ODS, EDW, MetadataDW FrameworkDW Architecture & ETL ProcessDW DevelopmentDW Issues,What is a Data Warehouse?,A physical repository where relational data are specially organized to provid
9、e enterprise-wide, cleansed data in a standardized format“The data warehouse is a collection of integrated, subject-oriented databases designed to support DSS functions, where each unit of data is non-volatile and rele
10、vant to some moment in time”,A Historical Perspective to Data Warehousing,Characteristics of DWs,Subject orientedIntegratedTime-variant (time series)NonvolatileSummarizedNot normalizedMetadataWeb based, relationa
11、l/multi-dimensional Client/server, real-time/right-time/active …,Data Mart,A departmental small-scale “DW” that stores only limited/relevant data Dependent data mart A subset that is created directly from a data war
12、ehouse Independent data martA small data warehouse designed for a strategic business unit or a department,Other DW Components,Operational data stores (ODS)A type of database often used as an interim area for a data
13、 warehouseOper marts - an operational data mart. Enterprise data warehouse (EDW)A data warehouse for the enterprise. Metadata Data about data. In a data warehouse, metadata describe the contents of a data warehous
14、e and the manner of its acquisition and use,Application Case 2.1,A Better Data Plan: Well-Established TELCOs Leverage Data Warehousing and Analytics to Stay on Top in a Competitive IndustryQuestions for DiscussionWhat
15、are the main challenges for TELCOs?How can data warehousing and data analytics help TELCOs in overcoming their challenges?Why do you think TELCOs are well suited to take full advantage of data analytics?,A Generic DW F
16、ramework,Application Case 2.2,Data Warehousing Helps MultiCare Save More LivesQuestions for DiscussionWhat do you think is the role of data warehousing in healthcare systems?How did MultiCare use data warehousing to
17、improve health outcomes?,DW Architecture,Three-tier architectureData acquisition software (back-end)The data warehouse that contains the data & softwareClient (front-end) software that allows users to access and a
18、nalyze data from the warehouseTwo-tier architectureFirst two tiers in three-tier architecture is combined into one… sometimes there is only one tier?,DW Architectures,3-tier architecture,2-tier architecture,1-tier
19、Architecture?,Data Warehousing Architectures,Issues to consider when deciding which architecture to use:Which database management system (DBMS) should be used? Will parallel processing and/or partitioning be used? W
20、ill data migration tools be used to load the data warehouse?What tools will be used to support data retrieval and analysis?,A Web-Based DW Architecture,Alternative DW Architectures,Alternative DW Architectures,Each arch
21、itecture has advantages and disadvantages!Which architecture is the best?,Ten factors that potentially affect the architecture selection decision,Information interdependence between organizational unitsUpper management
22、’s information needsUrgency of need for a data warehouseNature of end-user tasksConstraints on resources,Strategic view of the data warehouse prior to implementationCompatibility with existing systemsPerceived abili
23、ty of the in-house IT staffTechnical issuesSocial/political factors,Teradata Corp. DW Architecture,Data Integration and the Extraction, Transformation, and Load (ETL) Process,ETL = Extract Transform LoadData integrati
24、on Integration that comprises three major processes: data access, data federation, and change capture. Enterprise application integration (EAI)A technology that provides a vehicle for pushing data from source system
25、s into a data warehouse Enterprise information integration (EII) An evolving tool space that promises real-time data integration from a variety of sources, such as relational or multidimensional databases, Web service
26、s, etc.,Data Integration and the Extraction, Transformation, and Load (ETL) Process,ETL (Extract, Transform, Load),Issues affecting the purchase of an ETL toolData transformation tools are expensiveData transformation
27、tools may have a long learning curveImportant criteria in selecting an ETL toolAbility to read from and write to an unlimited number of data sources/architecturesAutomatic capturing and delivery of metadataA history
28、of conforming to open standardsAn easy-to-use interface for the developer and the functional user,Data Warehouse Development,Data warehouse development approachesInmon Model: EDW approach (top-down) Kimball Model: Dat
29、a mart approach (bottom-up)Which model is best?Table 2.3 provides a comparative analysis between EDW and Data Mart approachOne alternative is the hosted warehouse,Application Case 2.5,Starwood Hotels & Resorts Ma
30、nages Hotel Profitability with Data WarehousingQuestions for DiscussionHow big and complex are the business operations of Starwood Hotels & Resorts?How did Starwood Hotels & Resorts use data warehousing for be
31、tter profitability?What were the challenges, the proposed solution, and the obtained results?,Additional Data Warehouse Considerations Hosted Data Warehouses,Benefits:Requires minimal investment in infrastructureFree
32、s up capacity on in-house systemsFrees up cash flowMakes powerful solutions affordableEnables solutions that provide for growthOffers better quality equipment and softwareProvides faster connections… more in the bo
33、ok,Representation of Data in DW,Dimensional Modeling A retrieval-based system that supports high-volume query accessStar schema The most commonly used and the simplest style of dimensional modelingContain a fact tabl
34、e surrounded by and connected to several dimension tablesSnowflakes schema An extension of star schema where the diagram resembles a snowflake in shape,The ability to organize, present, and analyze data by several dime
35、nsions, such as sales by region, by product, by salesperson, and by time (four dimensions)Multidimensional presentation Dimensions: products, salespeople, market segments, business units, geographical locations, distr
36、ibution channels, country, or industryMeasures: money, sales volume, head count, inventory profit, actual versus forecastTime: daily, weekly, monthly, quarterly, or yearly,Multidimensionality,Star versus Snowflake Sche
37、ma,Analysis of Data in DW,OLTP vs. OLAP…OLTP (online transaction processing)Capturing and storing data from ERP, CRM, POS, …The main focus is on efficiency of routine tasksOLAP (Online analytical processing)Conver
38、ting data into information for decision supportData cubes, drill-down / rollup, slice & dice, …Requesting ad hoc reportsConducting statistical and other analyses Developing multimedia-based applications…more in
39、 the book,OLAP vs. OLTP,OLAP Operations,Slice - a subset of a multidimensional arrayDice - a slice on more than two dimensionsDrill Down/Up - navigating among levels of data ranging from the most summarized (up) to the
40、 most detailed (down)Roll Up - computing all of the data relationships for one or more dimensions Pivot - used to change the dimensional orientation of a report or an ad hoc query-page display,OLAP,Slicing Operations o
41、n a Simple Tree-DimensionalData Cube,Variations of OLAP,Multidimensional OLAP (MOLAP)OLAP implemented via a specialized multidimensional database (or data store) that summarizes transactions into multidimensional view
42、s ahead of time Relational OLAP (ROLAP)The implementation of an OLAP database on top of an existing relational database Database OLAP and Web OLAP (DOLAP and WOLAP); Desktop OLAP,…,Technology Insights 2.2Hands-On DW
43、 with MicroStrategy,A wealth of teaching and learning resources can be found at TUN portalwww.teradatauniversitynetwork.com The available resources include scripted demonstrations, assignments, white papers, etc…,DW
44、Implementation Issues,Identification of data sources and governanceData quality planning, data model designETL tool selectionEstablishment of service-level agreementsData transport, data conversionReconciliation pro
45、cessEnd-user supportPolitical issues… more in the book,Successful DW ImplementationThings to Avoid,Starting with the wrong sponsorship chainSetting expectations that you cannot meetEngaging in politically naive beh
46、aviorLoading the data warehouse with information just because it is availableBelieving that data warehousing database design is the same as transactional database designChoosing a data warehouse manager who is technol
47、ogy oriented rather than user oriented… more in the book,Failure Factors in DW Projects,Lack of executive sponsorshipUnclear business objectivesCultural issues being ignoredChange managementUnrealistic expectations
48、Inappropriate architectureLow data quality / missing informationLoading data just because it is available,Massive DW and Scalability,ScalabilityThe main issues pertaining to scalability:The amount of data in the ware
49、houseHow quickly the warehouse is expected to growThe number of concurrent usersThe complexity of user queries Good scalability means that queries and other data-access functions will grow linearly with the size of t
50、he warehouse,Real-Time/Active DW/BI,Enabling real-time data updates for real-time analysis and real-time decision making is growing rapidlyPush vs. Pull (of data)Concerns about real-time BINot all data should be updat
51、ed continuouslyMismatch of reports generated minutes apartMay be cost prohibitiveMay also be infeasible,Enterprise Decision Evolution and Data Warehousing,Real-Time/Active DW at Teradata,Traditional versus Active DW,D
52、W Administration and Security,Data warehouse administrator (DWA)DWA should…h(huán)ave the knowledge of high-performance software, hardware and networking technologiespossess solid business knowledge and insightbe familiar
53、with the decision-making processes so as to suitably design/maintain the data warehouse structurepossess excellent communications skillsSecurity and privacy is a pressing issue in DWSafeguarding the most valuable asse
54、ts Government regulations (HIPAA, etc.)Must be explicitly planned and executed,The Future of DW,Sourcing…Web, social media, and Big DataOpen source softwareSaaS (software as a service)Cloud computingInfrastructure
55、…ColumnarReal-time DWData warehouse appliancesData management practices/technologiesIn-database & In-memory processing New DBMSAdvanced analytics…,Free of Charge DW Portal for Teaching & Learning,www.Tera
56、dataUniversityNetwork.comPassword to signup: ,,End of the Chapter,Questions, comments,All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by a
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 眾賞文庫僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。
最新文檔
- 基于商務(wù)智能的物流數(shù)據(jù)分析.pdf
- 商務(wù)數(shù)據(jù)分析考核題目
- 商務(wù)智能技術(shù)在高考數(shù)據(jù)分析及志愿填報(bào)中的應(yīng)用.pdf
- 酒店前臺信息系統(tǒng)中數(shù)據(jù)分析和智能商務(wù)的應(yīng)用.pdf
- 商務(wù)智能在TS集團(tuán)直銷數(shù)據(jù)分析中的應(yīng)用研究.pdf
- 《商務(wù)數(shù)據(jù)分析與應(yīng)用》-教學(xué)設(shè)計(jì)
- 面向商務(wù)智能的公共交通數(shù)據(jù)分析系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn).pdf
- 23831.人壽保險(xiǎn)行業(yè)商務(wù)智能數(shù)據(jù)分析系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)
- 商務(wù)數(shù)據(jù)分析模塊樣題及答案
- 淺談數(shù)據(jù)分析在電子商務(wù)中的應(yīng)用
- 基于Hadoop的智能電網(wǎng)數(shù)據(jù)分析平臺.pdf
- 24823.多視角數(shù)據(jù)分析算法研究
- 智能井壓力數(shù)據(jù)分析方法研究.pdf
- 面向社交商務(wù)的大數(shù)據(jù)分析方法研究.pdf
- 《電子商務(wù)數(shù)據(jù)分析與應(yīng)用》教學(xué)教案
- 【數(shù)據(jù)分析】店鋪經(jīng)營數(shù)據(jù)分析表
- 商業(yè)智能在電信數(shù)據(jù)分析中的應(yīng)用.pdf
- 智能電網(wǎng)電流數(shù)據(jù)分析儀的設(shè)計(jì).pdf
- 電子商務(wù)數(shù)據(jù)分析平臺的設(shè)計(jì)與實(shí)現(xiàn).pdf
- 服裝銷售數(shù)據(jù)分析與管理
評論
0/150
提交評論