Introduction To Data Warehousing & Business Intelligence .

3y ago
32 Views
5 Downloads
1.10 MB
73 Pages
Last View : 1m ago
Last Download : 3m ago
Upload by : Jerry Bolanos
Transcription

Introduction to Data Warehousing & Business Intelligence SystemsIntroduction to Data Warehousing &Business Intelligence SystemsStudent GuideIntroduction to Agile Methods by Evan Leybourn is licensed under aCreative Commons Attribution-ShareAlike 3.0 Australia License http://creativecommons.org/licenses/by-sa/3.0/au/ Evan Leybournevan@theagiledirector.comTwitter: @eleybournIntroduction to Data Warehousing & Business Intelligence Systems(cc)-by-sa – Evan LeybournPage 1 of 73

Introduction to Data Warehousing & Business Intelligence SystemsOTHER WORKS BYEVAN LEYBOURNDIRECTING THE AGILE ORGANISATION –BY EVAN LEYBOURNhttp://theagiledirector.com/book Embrace change and steal a march onyour competitors Discover the exciting adaptiveapproach to management Become the Agile champion for yourorganisationBusiness systems do not always end up the way that we first plan them. Requirements canchange to accommodate a new strategy, a new target or a new competitor. In thesecircumstances, conventional business management methods often struggle and a differentapproach is required.Agile business management is a series of concepts and processes for the day-to-daymanagement of an organisation. As an Agile manager, you need to understand, embodyand encourage these concepts. By embracing and shaping change within your organisationyou can take advantage of new opportunities and outperform your competition.Using a combination of first-hand research and in-depth case studies, Directing the AgileOrganisation offers a fresh approach to business management; applying Agile processespioneered In the IT and manufacturing industries.Introduction to Data Warehousing & Business Intelligence Systems(cc)-by-sa – Evan LeybournPage 2 of 73

Introduction to Data Warehousing & Business Intelligence SystemsTABLE OF CONTENTSOther Works by Evan Leybourn . 2Directing the Agile Organisation – by Evan Leybourn . 2Table of Contents. 3Introduction . 6A Note About The Associated Slides . 7After Completing This Course . 7Business Intelligence Principles . 8Creating Information from Data. 9Business Intelligence Phases . 9Centralised Repository . 10Combining Multiple Sources . 10Owning Your Data . 11Exercise (Your Environment) . 12Information Management . 13Garbage In, Garbage Out (GIGO) . 14Exercise (GIGO) . 15Information Management . 16Data Acquisition . 17Exercise (Data Acquisition). 18Data Reduction. 19Reverse Engineering . 19Data Analysis . 20Identifying Potential Data Sources . 21Exercise (Data Sources) . 22Creating Business Cases for Data Sources . 23Exercise (Business Case). 24Introduction to Data Warehousing & Business Intelligence Systems(cc)-by-sa – Evan LeybournPage 3 of 73

Introduction to Data Warehousing & Business Intelligence SystemsTypes of Sources . 25Databases . 26Documents . 27Websites . 27Research . 29Log Files. 29Corporate Emails . 29Accessing the Data Sources. 31Source Type . 31Connection . 31Query . 31Building a Data Dictionary . 31Exercise (Data Dictionary) . 32A Business Intelligence System is Never Complete . 33Long Term benefits. 33Data Warehouse Design . 34The Consolidation Database . 35Data Marts . 35Exercise (Data Sources) . 36DBMS Choices . 37A Data Warehouse Table . 37Star Schema. 39Snowflake Schema . 40Exercise (Schema) . 41Historical Data . 42Database Triggers . 42Indexes. 42Extraction, Transformation and Load . 44Data Validation . 45Data Integrity . 46Data Standardisation . 47Exercise (Validation). 48Extraction . 49Transformation . 50Adding Data . 51Deleting Data. 52Introduction to Data Warehousing & Business Intelligence Systems(cc)-by-sa – Evan LeybournPage 4 of 73

Introduction to Data Warehousing & Business Intelligence SystemsModifying Data . 53A Quick Overview of Regular Expressions (REGEX) . 54Splitting Data . 57Joining Data . 58Dropping Rows . 59Exercise (Transformation) . 60Load . 61Resolving Errors . 62Historical Extraction . 62Reporting . 63Writing Good Reports . 64Exercise (Reports) . 66Data Access Control . 67Organisational Status and Dashboarding . 68Exercise (Dashboard) . 69Scheduled Reporting . 70Narrowing your Results . 70Mashups . 70Web Services . 71References . 73Introduction to Data Warehousing & Business Intelligence Systems(cc)-by-sa – Evan LeybournPage 5 of 73

Introduction to Data Warehousing & Business Intelligence SystemsINTRODUCTION“Computers are getting smarter all the time. Scientists tell us that soon they will beable to talk to us. (And by ‘they’, I mean ‘computers’. I doubt scientists will ever beable to talk to us.)”- Dave BarryNotes:Introduction to Data Warehousing & Business Intelligence Systems(cc)-by-sa – Evan LeybournPage 6 of 73

Introduction to Data Warehousing & Business Intelligence SystemsA NOTE ABOUT THE ASSOCIATED SLIDESThe presentation material for this course (also released under a Creative Commons BY-SAlicense) was created in Prezi.You can locate the presentation here: TER COMPLETING THIS COURSEAfter completing this course, you should be able to do the following. Identify any organisational requirement for a Data Warehouse or BusinessIntelligence application.Understand how to improve an organisation's data and information.Understand what is involved in the creation and ongoing administration of aneffective Business Intelligence system.Identify and analyse potential data sources inside and outside an organisation andhow to use that data to improve the business intelligence of an organisation.Design a reporting plan that suits an organisational environment and improveinformation management.Use a Business Intelligence system to improve data integrity and quality.Create a database schema suitable for a Business Intelligence application.Notes:Introduction to Data Warehousing & Business Intelligence Systems(cc)-by-sa – Evan LeybournPage 7 of 73

Introduction to Data Warehousing & Business Intelligence SystemsBUSINESS INTELLIGENCE PRINCIPLES“Computers are useless. They can only give you answers.”- Pablo PicassoNotes:Introduction to Data Warehousing & Business Intelligence Systems(cc)-by-sa – Evan LeybournPage 8 of 73

Introduction to Data Warehousing & Business Intelligence SystemsCREATING INFORMATION FROM DATAThe first step in any Business Intelligence project is to identify the data requirements of anorganisation. There are two areas that need to be covered.1. What data they have, and2. What information they need.For the sake of clarity, the terms data and information convey different meanings.1. Data is the raw output of any database, website, log files or other data source.2. Information is the processed and refined version of the data for human usage.BUSINESS INTELLIGENCE PHASESDesigning a Business Intelligence system can be a complicated and time-consumingprocess. There are many factors, both technical and organisational, to consider and it is notalways possible to resolve all of these in a timely manner.There are four major areas to designing and creating a Business Intelligence system;1.2.3.4.Analysis and Reverse EngineeringDesign of the Consolidation DB and Data MartsExtraction and Transformation of the dataBusiness level reporting on the dataBusiness Intelligence development can benefit from an iterative approach to development.Once you have identified your data sources, focus on a single data source and take itthrough all four areas of development.Using an iterative approach you can have a limited production Business Intelligence systemup and running in a short period of time, which users can be trained on and issues can beresolved.Notes:Introduction to Data Warehousing & Business Intelligence Systems(cc)-by-sa – Evan LeybournPage 9 of 73

Introduction to Data Warehousing & Business Intelligence SystemsCENTRALISED REPOSITORYAll data within a Business Intelligence system is stored in a central repository called a“Consolidation Database”.A consolidation database allows an organisation to; Gain access to valuable data from multiple sources from a central location,increasing efficiency and reliability.Standardise and simplify the way information is managed within an organisation.Develop cross-application reporting to improve business intelligence.Identify obsolete, duplicate and redundant data across multiple applications.COMBINING MULTIPLE SOURCESAs mentioned before, one of the most important aspects of a Business Intelligence system isthe ability to combine multiple sources into a single consolidated database. However thereare some issues that should be considered. How to resolve conflicts between data from different sources relating to the samerecord.How to correlate records that may have different primary keys to identify the record.How to accurately and efficiently store the data in the consolidation database, suchthat it does not duplicate existing records.Notes:Introduction to Data Warehousing & Business Intelligence Systems(cc)-by-sa – Evan LeybournPage 10 of 73

Introduction to Data Warehousing & Business Intelligence Systems How to differentiate identical primary keys that refer to different records in the sameconsolidation table. (E.g. companies and universities in an organisation table.)When a Business Intelligence system is being designed, it is important to keep theseproblems in mind. How each organisation resolves these problems is specific to each datasource.A good Extraction, Transformation and Load (ETL) design, and robust consolidationdatabase will mitigate most of these problems. Thoroughly testing the ETL design withrepresentative data from all data sources will limit potential issues.OWNING YOUR DATAOne of the largest issues facing organisations today is the lack of data ownership. More andmore organisations are purchasing software off-the-shelf and deploying them in missioncritical environments.The problem occurs when it comes time to migrate from one vendor to another and the datais locked within a proprietary format, which you cannot access.Open source software and some closed source vendors have started to promote the conceptof “Owning Your Data”. The premise is that your data should always be available andaccessible regardless of license or software issues.A Business Intelligence system can be central to this issue. By extracting information into theBusiness Intelligence system, you have complete ownership of all your data. It can alsoidentify vendors and applications which restrict access to your data.Case Study: A large professional organisation.At a very early stage of the Business Intelligence development process, a vendor of a criticalCRM application refused to allow analysis or extraction of the data into the BusinessIntelligence system, citing that by giving the organisation access they would have access tothe vendor’s intellectual property.It was eventually decided to perform the extracts from the nightly backup dumps of thedatabase, which were stored on the servers. Whilst not ideal, it was the only solutionavailable.Notes:Introduction to Data Warehousing & Business Intelligence Systems(cc)-by-sa – Evan LeybournPage 11 of 73

Introduction to Data Warehousing & Business Intelligence SystemsEXERCISE (YOUR ENVIRONMENT)What BI Application do you use?What do you want out of it?What Information do you have?What do you think is involved in a BI project (time, resources, etc)?Notes:Introduction to Data Warehousing & Business Intelligence Systems(cc)-by-sa – Evan LeybournPage 12 of 73

Introduction to Data Warehousing & Business Intelligence SystemsINFORMATION MANAGEMENT“On two occasions I

Introduction to Data Warehousing & Business Intelligence Systems Notes: Introduction to Data Warehousing & Business Intelligence Systems (cc)-by-sa – Evan Leybourn Page 7 of 73 A NOTE ABOUT THE ASSOCIATED SLIDES The presentation material for this course (also released under a Creative Commons BY-SA license) was created in Prezi.

Related Documents:

Data Warehousing on AWS AWS Whitepaper Introduction Data Warehousing on AWS Publication date: January 15, 2021 (Document histor y and contributors (p. 23)) Enterprises across the globe want to migrate data warehousing to the cloud to improve performance and lower costs. This whitepaper discusses a modern approach to analytics and data warehousing

Data warehousing fundamentals for IT professionals / Paulraj Ponniah.—2nd ed. p. cm. Previous ed. published under title: Data warehousing fundamentals. Includes bibliographical references and index. ISBN 978-0-470-46207-2 (cloth) 1. Data warehousing. I. Ponniah, Paulraj. Data warehousing

Data warehousing is the process of creating, populating and querying a data warehouse. It is a framework for deriving information from data. Data mining is the process of identifying and interpreting intrinsic patterns in data to solve a business problem. A vital discovery that propelled the development of data warehousing was the

Introduction 1 Big Data: The evolution of data warehousing 2 Oracle Database 12c and Oracle Exadata: A Data Warehouse as a Foundation for Big Data 3 . However, data warehousing is undergoing a major transition. The benefits of data warehouses are currently being realized in most organizations, partially if not wholly. .

706.520 Data Integration and Large‐Scale Analysis -02 Data Warehousing, ETL, and SQL/OLAP Matthias Boehm, Graz University of Technology, WS 2020/21 Data Warehouse Architecture Data Warehousing S1 Data Warehouse (consolidated raw data, aggregates, metadata) S3 S4 S2 Asyncreplication, and ETL vs ELT Materialized, non‐

Notes for Unit 1 & 2 UNIT I DATA WAREHOUSING Data warehousing Components -Building a Data warehouse -- Mapping the Data Warehouse to . data warehouse that can provide data for reporting and analysis on a section, unit, department or operation in the company, e.g. sales, payroll, production. Data marts are sometimes complete

Chapter 11: Data Warehousing and Business Intelligence New England Data Management Community Data Warehousing Emerged in the 1980's as technology evolved and Decision Support Systems really began to take hold, in order to reduce data redundancy, improve consistency and allow organizations to make better decisions.

Archaeological Illustration ARCL0036 UCL - INSTITUTE OF ARCHAEOLOGY COURSE NUMBER: ARCL0036 Archaeological Finds Illustration 2018/2019 Year 2, 0.5 unit 15 Credits Co-ordinator: Stuart Laidlaw Co-ordinator's e-mail tcfasjl@ucl.ac.uk Co-ordinator's room number is 405 Telephone number 020 7679 4743 Internal 24743 The Turnitin 'Class ID' is 3884493 and the 'Class Enrolment Password' is IoA1819 .