Dw Presentation 20010925

download Dw Presentation 20010925

of 23

Transcript of Dw Presentation 20010925

  • 7/31/2019 Dw Presentation 20010925

    1/23

    Data Administration

    Data Warehouse Implementation

    9/25/01

  • 7/31/2019 Dw Presentation 20010925

    2/23

    DW Current Resources

    Server - IBM RS6000 S7A with 4 engines, 4GB memory,200GB disk space, running AIX 4.3.0.

    Data100GB disk space used to house FRS, FES, HRS andSIS Oracle extracts. This is essentially a data dump of the

    IDMS data, and views still need to be created.

    Staff

    DBA - Henry 80%, Song 20%, Short 10%, Chen 10%

    DA - Kisil 70%, Cheesman 50%, Claunch 50%, Carter 100%

  • 7/31/2019 Dw Presentation 20010925

    3/23

    DW Tasks

    DBA Extract data, design Oracle DB, load data, andproduction support (i.e. monitor system and DB performance,enforce security, schedule backups, etc.)

    Data Administration User interface, develop requirementsdocument for all DW projects, evaluate data quality, createDB views, develop specialized reports, test, train users, andcoordinate projects.

    Both Infrastructure design (with Systems staff), and toolevaluation (ETL, OLAP and desktop reporting) with help fromthe C/S group.

  • 7/31/2019 Dw Presentation 20010925

    4/23

    Where are we now?

    Data Staging Area

    Oracle Extract of FRS, FES,

    HRS, and SIS No cleansin

    Source Data

    IDMS Sybase

    Data Inventory

    Element names, formats, anddefinitions

    FRS, FES, HRS and SIS

    SAS Data Mining Software

    Software purchasedPlanning implementation

  • 7/31/2019 Dw Presentation 20010925

    5/23

    Where do we want to be in three

    years?

    Data Warehouse Cleansed Subset of Detail Data Subset of Summary

    Data Multiple Years of Data Periodic Updates Strategic

    External DataCensus Data, Benchmark,Salary Surveys, Economic

    Data

    Data Staging Area Extract Data Transform Data Quality Assurance Create Metadata

    Operational DataIDMS

    Oracle

    Sybase

    Ad Hoc Query Repository Copy of Source Data Operational Daily Updates All Elements

    Minimum Number of Years ofData

    Data Mart #?

    Data Mart #2Resource Management

    Data Mart #1Course Management

    Subset of DW Summarized in specific

    manner Tactical

    MetadataSAS Data Mining

    Server

    OLAP Server

    Ad Hoc andOperational

    Reports

    Tell me what mayhappen, or what is

    interesting?

    Tell me what

    happened?

    Tell me whathappened and

    why?

    Give me informationto help me achievespecific goals!

  • 7/31/2019 Dw Presentation 20010925

    6/23

    What is a

    Decision Support System

    EIS

    Data Mart

    Data Warehouse

    Operational Data Store

    Covansys

    High Level Summarized DataFor Top Executives

    (Pre-programmed DASHBOARD)

    Addresses SpecificSubject Area

    Collection Of Integrated SubjectOriented Databases(Historical)

    Time-Current, Integrated

    Databases(Tactical-Power Users)

    Components of a Decision Support System

  • 7/31/2019 Dw Presentation 20010925

    7/23

    How do we get there?

    Educate users.

    Develop detailed requirements documents,including Information Value Chains for all

    goals which the DW/DM is expected toaddress.

    Data Mart approach.

    Phased implementation.

    Additional resources.

  • 7/31/2019 Dw Presentation 20010925

    8/23

    Educate Users

    BasicsWhat is a Data Warehouse? Create a

    single-source-of-truth.What its not! (All the data,with daily updates and online access.)

    Change in cultureLets make better decisionsbased on objective analysis of data.

    Set realistic expectations - No silver bullet. It can helpyou make better decisions, but you still have to beresponsible for implementing those decisions.

    Focus on institutional goalsWhat is it we need toachieve? What metrics do we need to evaluate our

    progress in attaining goals?

    Importance of business sponsorsMake timelybusiness decisions and support IT requests for additionalfunds.

  • 7/31/2019 Dw Presentation 20010925

    9/23

    Course Management (I.V.C.)Business Functions and Goals Optimize course offerings to meet student need.

    Improvement Opportunities Increase number of high demand courses/sections Increase maximum enrollment in sections Eliminate or reduce frequency of low demand courses Improve course meeting patterns and delivery mode Performance Measures

    # and % decrease of students who do not getany section of the course requested

    # and % decrease of low demand courses # and % increase in enrollment % usage of classroom capacity % decrease in length of time to graduate

    # and % increase in courses taught throughpreferred mode

    Business Questions What are the characteristics of high/low demand courses? What characteristics of the student are related to demand? What courses can be eliminated? Which courses should/can be moved to smaller/larger facilities? What impact does the meeting time and location have on demand? What improvements can be made with/without additional money?

    Data Model

    Data Mart/Warehouse

    (American Management Systems, Inc.)

    Courses

    Student

    Enrollment

    Defines

    Economic Data

    Course Demand

    College Budgets Degree Reqs.

    Facilities

    Available Faculty

  • 7/31/2019 Dw Presentation 20010925

    10/23

  • 7/31/2019 Dw Presentation 20010925

    11/23

    DW Subject Areas

  • 7/31/2019 Dw Presentation 20010925

    12/23

    Accounts (dollars for faculty and supplies)

    Assets (facilities and equipment)

    Benchmark Data(what and when coursesoffered, degree requirements)

    Courses (meeting pattern, teaching mode,requirements)

    Department/College (who owns the course,degree offered and requirements)

    Economic Data(salary and demand byprofession)

    Faculty (distribution of effort, availability to teach)Faculty Applicants (who and how many)

    Population Data(who, where and what volume)

    Staff (instructional support)

    Students (classification, course demand)

  • 7/31/2019 Dw Presentation 20010925

    13/23

    Accounts (dollars for services, financial aid)

    Assets (facilities and equipment, housing)

    Benchmark Data(enrollment patterns)

    Courses (meeting pattern, teaching mode,requirements, professor)

    Department/College (who owns the course,

    degree requirements, degrees offered)Economic Data(general condition of economy,hot professions)

    Faculty (quality, student evaluations)

    Population Data(who, where and what volume)Staff (instructional support, student service area)

    Students (demographics, course demand, GPA)

    Student Prospects (demographics, contacts)

    Student Applicants (demographics, contacts)

  • 7/31/2019 Dw Presentation 20010925

    14/23

    Project Schedule

    Phase I, 2001-02

    Infrastructure & Planning

    Phase II, 2002-03

    Phase III, 2003-04

  • 7/31/2019 Dw Presentation 20010925

    15/23

    1) IDMS Data Dump to Oracle100% complete.

    2) Create Views for Data Dump (Ad HocReporting Repository) - Target completion 3Q 01-02.

    3) WebFOCUS ImplementationTarget completion01/01/02.

    4) Data Mining Tools for IR staffPurchased, andplanning installation. Target installation completion 12/1/01.

    5) Establish Enterprise Standards for Key DataAnalysis and recommendations are ongoing.

    6) Identify and Prioritize Data Mart DevelopmentOngoing work with Data Stewards.

  • 7/31/2019 Dw Presentation 20010925

    16/23

    7) Select Desktop Reporting Tools Preliminaryevaluation beginning by DA, DBA and C/S staff. Selectedtool(s) will be the IT standard and supported by our staff.Target completion 1/1/02. Choices: Crystal Reports

    Infomaker SAS

    WebFOCUS

    Cognos

    8) Select ETL Tools Preliminary comparison, and funds

    will be requested for 02-03. Choices: Ascential

    IBM

    Informatica

    Oracle

    SAS

  • 7/31/2019 Dw Presentation 20010925

    17/23

    9) Develop basic Data Marts andCorresponding DW Functionality

    CPE Six years of data loaded in Oracle; SAStesting underway. Target completion 11/1/01.

    GASB Requirements phase. Target projectcompletion 2/1/02.

    Course ManagementDM Requirementsphase. Target completion 4/1/02.

    Enrollment Management DM Preliminarymeetings. Data will be added incrementally,with target project completion 1/1/03.

  • 7/31/2019 Dw Presentation 20010925

    18/23

    1) ETL Tools Purchase and install by 11/1/02.

    2) Create Metadata Will need to be created for the existingData Marts, as well as the new. Target completion forexisting DM 2/1/03.

    3) Create Additional Data Marts Complete Enrollment DM

    by 7/1/03, and start/complete two or more of the following:Department Mgt., Diversity & Equity Initiatives, GrantManagement, Research Management and ResourceManagement (accounts, personnel, etc.).

    4) Evaluate OLAP Tools Choices could include Cognos andEssbase. Select tool by 1/1/03, and request funds for 03-04.

    5) Evaluate External Data Needs - Identify external dataneeded for the Data Marts, locate source(s), estimate costand request funds for 03-04. Target completion 3/1/03.

    6) Begin Preliminary Development of EIS -Executive

    Information System.

  • 7/31/2019 Dw Presentation 20010925

    19/23

    1) OLAP Tools Purchase and install by 11/1/03.

    2) Create Additional Data Marts Start/complete twoor more of the following: Department Mgt., Diversity& Equity Initiatives, Grant Management, ResearchManagement and Resource Management.

    3) Purchase and Load External Data - Targetcompletion 7/1/04.

    4) Continue Development of EIS - ExecutiveInformation System.

  • 7/31/2019 Dw Presentation 20010925

    20/23

    New DW Resources

    Server (Existing) $84,300Additional 4 engines and 4GBmemory for Data Warehouse Server.

    Servers (New)Additional servers will need to be purchased forthe Ad Hoc Reporting Repository, the Data Staging Area, theData Mining and/or OLAP tools, and for one or more of the Data

    Marts. Size, type and cost will vary depending on function. Goalis for remaining servers to be Intel-based to minimize cost. Costunknown.

    Storage Additional disk space for DW server, and for other

    servers using Enterprise Storage System. Exact amounts foronline storage and for archiving will be identified during theproject requirements phase. Cost unknown.

  • 7/31/2019 Dw Presentation 20010925

    21/23

    New Resources (Cont.)

    Training $21,200 InfrastructureHow do you design a DW environment?Staff Training (TDWI Seminar $7,600 for 2 FTE)

    Oracle DB Training ($7,200 for 4 FTE)

    Technical Training Analysis and reporting tools. ($6,400 for

    4 FTE) Onsite visits to other universities.

    Consulting on Infrastructure Design Covansys,Gavroshe, Oracle, IBM, etc. Cost unknown.

    Staff $140,000 for 2 FTE DBA 1 FTE (Totally devoted to DW development.)

    Reporting 1 FTE (Totally devoted to development of the EISand/or institutional reports from DW environment.)

  • 7/31/2019 Dw Presentation 20010925

    22/23

    DW Critical Issues

    Personnel ResourcesNeed to keep people focused on DW inlight of competing projects (i.e., Sybase/Omnis Conversion, WWTE, IPP,IDMS maintenance.)

    TrainingIT and users.

    ConsultingAssistance for infrastructure planning. This is new to us!

    Culture Change Focus on goals rather than data elements. Tell uswhat, not how! Value in making decision based on data. Proof ofconcept.

    Requirements GatheringNeeds to be thorough, and heavily relieson timely user decisions. Defining who the users are on any given

    project could be political.Budget/FundingFor additional hardware, software and people.

    Business SponsorThe Data Warehouse is not another ITproject. It is an enterprise initiative!

  • 7/31/2019 Dw Presentation 20010925

    23/23

    Data Administration

    QUESTIONS?