Apply Minimum Data Requirements for Analysis

1. Abstract

Reliable analysis in Board Foresight starts with datasets that meet minimum standards for history, completeness, and consistency. Ensuring that data satisfies these requirements before it is explored or modeled helps produce stable insights, clearer correlations, and more trustworthy forecasts.

2. Context

Apply this best practice before using any dataset in Discover, adding it to a Workbench, or including it in a Predict job. These checks should be part of standard data preparation for all time-series indicators, whether internal or external.

3. Content

3.1 Why It Matters

Time-series analysis relies on patterns observed over time. Datasets with limited history, excessive gaps, or inconsistent structure often produce misleading insights, even if tools technically allow them to be used.

When minimum data requirements are not met:

Correlations may appear strong but fail to hold over time
Trends may reflect short-term noise rather than underlying behavior
Models may overfit recent periods and perform poorly out of sample

Applying minimum data standards ensures that insights drawn from Discover, Workbenches, and Predict are grounded in sufficient historical context and are more likely to generalize as conditions change.

3.2 How to Apply

Before using a dataset in analysis, confirm that it meets the following minimum requirements:

History
- At least 5 years of data, or 60 monthly observations
- This provides exposure to multiple business and economic cycles
- 3 years of monthly data is the minimum viable history for use in the Discover and Predict engines.
Completeness
- Fewer than 5% missing values across the full series
- Any missing periods should be addressed prior to data load using documented, business-aware methods
Frequency
- Monthly data is preferred for most analyses
- Quarterly data may be acceptable but requires longer history to achieve the necessary number of observations
Continuity
- No gaps in the time series
Quality
- Values fall within reasonable business ranges
- Units, scales, and definitions are consistent over time

If a dataset does not meet these criteria, consider extending it with historical archives, supplementing it with related indicators, or excluding it from analysis until requirements are met.

3.3 Example

An analyst wants to include a retail sales indicator that contains only 36 months of history. Before modeling, they locate archived financial reports and extend the series to cover five full years. With sufficient history now available, the indicator can be confidently used in Predict.

3.4 Common Pitfalls

Accepting short histories because the model “still runs”
Mixing monthly and quarterly indicators in the same Workbench without adjustment
Ignoring small but recurring gaps that truncate training windows
Using datasets with silent definition or unit changes over time