How Scaler estimates missing energy meter data – Scaler

Purpose of this article

This article explains how Scaler estimates missing energy meter data using a transparent, weighted methodology, and how estimated values are used in analytics, exports, and reporting.

What is estimation in Scaler?

Estimation fills missing or incomplete portions of energy meter data to create a continuous and consistent dataset.

Estimation is applied when:

A meter reading covers only part of a month

One or more months of consumption data are missing

Data is delayed or partially reported

Estimated values are modeled, not measured, and are clearly identified in analytics and exports.

Important: Estimation is different from normalization. Estimation fills data gaps; normalization adjusts data for comparability.

Where estimation is used

Analytics Portal

Location: Analytics Portal → Portfolio → Analytics → Energy

How to view estimates:

Select View → Estimated (Scaler algorithm)

Estimated values appear alongside actual data

Asset-level charts show estimated portions as stacked bars

Data Collection Portal exports

Location: Data Collection Portal → Portfolio → Meter List → Download

Export column: estimated_linear_extrapolation_consumption

Purpose: Complements actual_monthly_consumption to provide a complete time series

Estimated values are included to support:

Continuous analytics

Benchmarking and scoring

External reporting workflows that expect uninterrupted time series

How the estimation model works

Scaler uses a linear, history-weighted estimation model based on each meter's actual past consumption.

The model follows these steps:

Detects missing periods: Identifies incomplete months at the meter level, even if only a few days are missing.

References recent history: Looks back up to 6 months of valid historical consumption for the same meter.

Normalizes to daily averages: Converts historical monthly values into average daily consumption.

Applies weighted logic: More recent months are weighted more heavily using a decay function.

Estimates the gap: Missing days or months are filled using the weighted daily average.

Assigns a confidence score: Each estimate is scored based on data availability and proximity to actual readings.

Confidence scoring

Each estimated value includes a confidence score to help assess reliability.

The confidence score considers:

Amount of historical data available: More recent, complete history results in higher confidence.

Distance from known data: Estimates closer to actual readings have higher confidence than those further away.

Note: Confidence scores are guidance indicators, not guarantees of accuracy.

Key benefits of Scaler’s estimation approach

Avoids flat or arbitrary assumptions

Uses real, meter-specific historical data

Supports like-for-like scoring and benchmarking

Improves portfolio comparability

Aligns with expectations of external reporting frameworks

Limitations to consider

Does not adjust for seasonality

Assumes recent consumption patterns remain relevant

Uses a fixed 6-month lookback window

Estimated values should not replace measured data when actual readings are available

Occupancy rates (Average vacancy rate) are not directly factored into estimation calculations. However, occupancy is implicitly reflected in the consumption data itself. Since Scaler estimates missing months using the past six months of actual consumption (with greater weight on recent months), any occupancy-related usage patterns are already captured in the historical data that drives the estimates.

How estimation differs from normalization

Estimation	Normalization
Fills missing or unreported consumption	Adjusts reported consumption to a common basis
Creates modelled values where data is missing	Scales existing values without inventing new usage
Uses historical patterns to estimate gaps	Uses structural factors (time, occupancy, weather)
Focused on data continuity	Focused on comparability
Answers: “What likely happened when data is missing?”	Answers: “How would this data compare under equal conditions?”

Note: Time-based normalization adjusts reported consumption to a full-year equivalent but does not estimate unreported usage; estimation is only applied when data is missing at the meter level.

See Normalization methodologies in Scaler for details on normalization logic.