The Kimball Lifecycle
This article relies too much on references to primary sources. (December 2010) (Learn how and when to remove this template message) |
The Kimball Lifecycle is a methodology for developing data warehouses, and has been developed by Ralph Kimball and a variety of colleagues.
Program/Project planning
According to Kimball et al., this phase is the start of the lifecycle. It is a planning phase in which project is a single iteration of the lifecycle while program is the broader coordination of resources. When launching a project/program Kimball et al. suggests following three focus areas:
- Define and Scope project
- Plan project
- Project management
Program/Project Management
This is an ongoing discipline in the project. The purpose is to keep the project/program on course, develop a communication plan and manage expectations.
Business Requirements Definition
This phase/milestone of the project is about making the project team understand the business requirements. Its purpose is to establish a foundation for all the following activities in the lifecycle. Kimball et al. makes it clear that it is important for the project team to talk with the business users and be prepared to focus on listening and to document the interview. An output of this step is the Enterprise bus matrix.
Technology Track
The top track holds two milestones:
- Technical Architecture Design is supposed to create a framework for the DW/BI system. The main focus in this phase is to create a plan for the application architecture, while considering business requirements, technical environment and the planned strategic technical directions.
- Product Selection & Installation use the architecture plan to identify what components are needed to complete the DW/BI project. This phase then selects, installs and tests the products.
Data Track
Dimensional Modeling is a process in which the business requirements are used to design dimensional models for the system.
Physical Design is the phase where the database is designed. It involves the database environment as well as security.
ETL Design & Development is the design of some of the heavy procedures in the DW/BI-system (Extract, Transform, Load). Kimball et al. suggests four parts to this process, which are further divided into 34 subsystems (Kimball et al., 2008):
- Extracting data
- Cleaning and conforming data
- Delivering data for Presentation
- Managing the ETL system
Business Intelligence Application Track
BI Application Design deals with designing and selecting some applications to support the business requirements. BI Application Development use the design to develop and validate applications to support the business requirements.
Deployment
When the three tracks are complete they all end up in the final deployment. This phase requires planning and should include pre-deployment testing, documentation, training and maintenance and support.
Maintenance
When the deployment has finished the system will need proper maintenance to stay alive. This includes data reconciliation, execution and monitoring and performance tuning.
Growth
As the project can be seen as part of the larger iterative program, it is likely that the system will want to expand. There will be projects to add new data as well as reaching new segments of the business areas. The lifecycle then starts over again.
References
Kimball, R., Ross, M., Thornthwaite, W., Mundy, J., & Becker, B. (2008). The data warehouse lifecycle toolkit (2nd ed.). Wiley Publishing, Inc. ISBN:978-0-470-14977-5