Data Warehouse Database Design
Listing of Data Warehouse Tables
DIM |
|
FACT |
|
HIST |
|
DW |
|
|
PPI |
|
|
STG |
||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
||||
Dimension modelling reference tables |
|
Dimension modelling fact tables |
|
Daily balance tables derived from Data Warehouse fact tables. |
|
Lookup tables copied from mifos OLTP data. |
|
|
|
|
|
Staging tables. Not used for reporting. |
||||
dim_date |
|
fact_loan_disbursals |
|
hist_loan_balances |
|
dw_account_action |
|
|
dw_ppi_survey |
|
|
stg_personnel_names_and_name_changes |
||||
dim_product |
|
fact_loan_repayments |
|
hist_loan_arrears |
|
dw_currency |
|
|
ppi_category_likelihood_bands |
|
|
stg_customer_and_account_updates |
||||
dim_office |
|
fact_loan_fees_and_charges |
|
hist_savings_balances |
|
dw_fee |
|
|
fact_ppi_survey_results |
|
|
stg_customer_type1_columns |
||||
dim_personnel |
|
fact_customer_fees_and_charges |
|
|
|
dw_fund |
|
|
|
|
|
stg_loan_type1_columns |
||||
dim_customer |
|
fact_savings_transactions |
|
|
|
dw_mfi_configuration |
|
|
|
|
|
|
||||
dim_loan |
|
fact_client_attendances |
|
|
|
Lookup tables for Data Warehouse convenience |
|
|
|
|
|
stg_etl_run_history |
||||
dim_savings |
|
Schedule tables cut down and copied to Data Warehouse to allow some processing of Due Vs Collected type functionality |
|
|
|
dw_account_charge_type |
|
|
|
|
|
|
||||
|
|
dw_loan_schedules |
|
|
|
dw_arrears_band_monthly |
|
|
|
|
|
|
|
|
|
|
|
|
dw_loan_fee_schedules |
|
|
|
dw_arrears_band_weekly |
|
|
|
|
|
|
|
|
||
|
|
dw_customer_schedules |
|
|
|
|
|
|
|
|
|
|
||||
|
|
dw_customer_fee_schedules |
|
|
|
|
|
|
|
|
|
|
WIP: Items below to be updated to match the table listing above
DIM Tables
FACT Tables
HIST Tables
The following 'hist' tables hold daily balances for 'active' accounts and are derived from other data warehouse tables. These tables do increase the size of the data warehouse database substantially but offer improved performance and simpler querying.
hist_loan_balances and hist_loan_arrears provides the simplest and quickest way of identifying PAR values at MFI, branch, loan officer, center and groups levels at any point in time.
hist_loan_balances |
Contains daily balances for 'active' loan accounts (good and bad standing status's). Doesn't included reversed transactions as they affect the 'as of' situation. SELECT cust.display_name, l.loan_account_id, l.loan_amount, l.loan_status, l.disbursement_date, ifnull(lb.principal_outstanding, 0.000) balance FROM dim_loan l LEFT JOIN dim_customer cust ON cust.customer_key = l.customer_key LEFT JOIN hist_loan_balances lb ON lb.loan_account_id = l.loan_account_id and lb.as_of_date = '2010-08-01' WHERE (l.valid_from <= '2010-08-01' and l.valid_to > '2010-08-01') ORDER BY cust.display_name, l.loan_account_id |
hist_loan_arrears |
Contains daily balances for over due loans (hist_loans_overdue is a more correct table name). |
hist_savings_balances |
Contains daily balances for 'active' savings accounts (active and inactive status's). |
DW Tables
PPI Tables
STG Tables
DW Picture
Overview - Dimension tables surrounding fact tables
A representation (slightly out of date) of how DW 1.0 solves the gnarly aspect of reporting on transaction totals and arrears aging information at MFI/Branch/Loan Officer/Center levels on current and historical date ranges is ...
Star Schema Involving fact_loan_repayments
A representation of how reporting on loan repayments at MFI/Branch/Loan Officer/Center levels on current and historical date ranges is generally is solved by DW 1.0 is ...
Metrics Reporting
A representation of how DW 1.0 solves reporting on 'metrics' at MFI/Branch/Loan Officer/Center levels on current and historical date ranges is ...
(put dim hierarchy piccy here)
High level DW table Info
These tables are currently deployed and being road tested during report development on the development reporting sandbox. Access to this box is restricted but for those with access... the database is called mifos_secdep_dwh and early (poorly presented by me) reports are running from sandbox-reporting.mifos.org/pentaho/Home under the solutions directory JPWWIP/DW standard reports.
Very Simple Dimension Tables used for filtering and grouping
Table Name |
Description |
ETL Approach ('type 2' means a change results in a new entry being added) |
|
---|---|---|---|
dim_date |
Very typical date dimension table used in star schema for handy date processing e.g. processing by quarter or month or 2nd day of the week. |
Copied from what Jos had in his and Roland's book :) but can be added to (by hand or otherwise) to give whatever data categorisation is relevant. |
|
|
|
|
|
dim_account_action |
Just a list of account actions that appear on mifos account_trxn table. |
Very simple. Just copied from mifos values. No need to manage 'type 2' changes. |
|
dim_currency |
list of currencies. Currency doesn't feature much in current reports but probably will. |
Very simple. Just copied from mifos values. No need to manage 'type 2' changes. |
|
dim_product |
Just a list of products from mifos table prd_offering with a couple of extra columns to cater for savings product specifics. |
Very simple. Just copied from mifos values. No need to manage 'type 2' changes. |
|
Pretty Simple Dimension Tables used for filtering and grouping
Table Name |
Description |
ETL Approach |
|
---|---|---|---|
dim_office |
Just a list of offices from the mifos table office. Does include the search_id column that does the 'hierarchy' magic... to help with hierarchy processing in reports (i.e. easy to pick up all branches under a head office. |
The was one I wanted to capture movement of, for example, branches to other regional offices. However, that change is not record in mifos. So, for now, this is treated as a simple copy of current information. |
|
|
|
|
|
dim_personnel |
Just a list of MFI people from the mifos table personnel. Most relevant because it holds the loan officers... didn't call it dim_loan_officer because wanted to take all MFI people especially as people can 'turn into' and 'out of' loan officers over time. |
Apply 'type 2' when loan officers move branch. This means we keep a history of where the loan officer has worked. |
|
Core Dimension Tables
Table Name |
Description |
ETL Approach |
|
---|---|---|---|
dim_customer |
Very key dimension table which holds center, group and client details |
Apply 'type 2' on hierarchy and status changes. The ETL for this is explained elsewhere but is probably the only 'difficult' bit of the ETL. |
|
dim_loan |
Very key dimension table for loans. |
Apply 'type 2' on hierarchy and status changes. The ETL for this is explained elsewhere but is probably the only 'difficult' bit of the ETL. |
|
dim_savings |
Very key dimension table for savings. |
Apply 'type 2' on hierarchy and status changes. The ETL for this is explained elsewhere but is probably the only 'difficult' bit of the ETL. |
|
The transaction 'fact' Tables
Table Name |
Description |
ETL Approach |
|
---|---|---|---|
fact_savings_transactions |
Key transaction table holding details of savings deposits, withdrawals and interest payments (also adjustments for deposits and withdrawals) |
Transactions simply extracted from mifos table account_trxn and the correct dimension entries applied. |
|
fact_loan_repayments |
Key transaction table holding details of loan repayments (and reversals), write-offs, reschedules |
Transactions simply extracted from mifos table account_trxn and the correct dimension entries applied. |
|
fact_loan_disbursals |
Useful transaction table holding disbursal transactions (and reversals) |
Transactions simply extracted from mifos table account_trxn and the correct dimension entries applied. |
|
fact_client_attendances |
Simple transaction table hold client meeting attendance info. Not used as much as other fact tables but can report on attendance, absenteeism etc. by all levels. |
Transactions simply extracted from mifos equivalent and the correct dimension entries applied. Can be updated. |
|
Arrears Aging/Provisioning Tables
Table Name |
Description |
ETL Approach |
|
---|---|---|---|
hist_loan_arrears |
Snapshot table which contains loan arrears information on a particular date. Has been difficult and expensive to report on |
Scheduled as necessary (expected daily) and derived from data already in DW database. |
|
dim_arrears_band_monthly |
Convenience table. A pre-done monthly banding for arrears aging. Can be edited to meet specific customer needs. |
DW Static data (but can be altered). ETL job uses the data |
|
dim_arrears_band_weekly |
Convenience table. A pre-done weekly banding for arrears aging. Can be edited to meet specific customer needs. |
DW Static data (but can be altered). ETL job uses the data |
|
Intermediate Staging Tables (not for reporting from)
Table Name |
Description |
ETL Approach |
|
---|---|---|---|
stg_customer_and_account_updates |
Staging table - into which are put all new customers and account and all changes |
Loaded from empty when ETL Run |
|
stg_personnel_names_and_name_changes |
Staging table - when a center, group or client changes loan officer... the old and new loan officer |
Loaded from empty when ETL Run |
|
stg_customer_type1_columns |
Staging table - latest value of customer columns not involved in type 2 updates. Allows columns to |
Loaded from empty when ETL Run |
|
stg_loan_type1_columns |
Staging table - latest value of loan account columns not involved in type 2 updates. Allows columns to |
Loaded from empty when ETL Run |
Other
Table Name |
Description |
ETL Approach |
|
---|---|---|---|
dw_mfi_configuration |
currently contains: |
Loaded from prd_type mifos table |
|
dw_loan_schedules |
loan schedule details (was a 'fact' table but decided against that as found it very hard to keep schedule entries tracked to the dim_loan entries. |
Transactions simply extracted from mifos equivalent. Only links to the loan_account_id not a specific entry in dim_loan. Need to be updated if the schedule dates change (if disbursed on a date different to the expected date). |
|
dw_account_charge_type |
Indicates the type of fee...charge/fee/penalty - use on the fee transaction tables |
preloaded with DW. |
|