How can we help? 👋

How Scaler estimates missing energy meter data

Learn how Scaler estimates missing or incomplete energy meter data, when estimated values are applied, and how to interpret estimated consumption and confidence scores.

Purpose of this article

This article explains how Scaler estimates missing energy meter data using a transparent, weighted methodology, and how estimated values are used in analytics, exports, and reporting.


What is estimation in Scaler?

Estimation fills missing or incomplete portions of energy meter data to create a continuous and consistent dataset.

Estimation is applied when:

  • A meter reading covers only part of a month
  • One or more months of consumption data are missing
  • Data is delayed or partially reported

Estimated values are modeled, not measured, and are clearly identified in analytics and exports.

Important: Estimation is different from normalization. Estimation fills data gaps; normalization adjusts data for comparability.


Where estimation is used

Analytics Portal

Location: Analytics Portal → Portfolio → Analytics → Energy

How to view estimates:

  1. Select View → Estimated (Scaler algorithm)
  1. Estimated values appear alongside actual data
  1. Asset-level charts show estimated portions as stacked bars

Data Collection Portal exports

Location: Data Collection Portal → Portfolio → Meter List → Download

Export column: estimated_linear_extrapolation_consumption

Purpose: Complements actual_monthly_consumption to provide a complete time series

Estimated values are included to support:

  • Continuous analytics
  • Benchmarking and scoring
  • External reporting workflows that expect uninterrupted time series

How the estimation model works

Scaler uses a linear, history-weighted estimation model based on each meter's actual past consumption.

The model follows these steps:

  1. Detects missing periods: Identifies incomplete months at the meter level, even if only a few days are missing.
  1. References recent history: Looks back up to 6 months of valid historical consumption for the same meter.
  1. Normalizes to daily averages: Converts historical monthly values into average daily consumption.
  1. Applies weighted logic: More recent months are weighted more heavily using a decay function.
  1. Estimates the gap: Missing days or months are filled using the weighted daily average.
  1. Assigns a confidence score: Each estimate is scored based on data availability and proximity to actual readings.

Confidence scoring

Each estimated value includes a confidence score to help assess reliability.

The confidence score considers:

  • Amount of historical data available: More recent, complete history results in higher confidence.
  • Distance from known data: Estimates closer to actual readings have higher confidence than those further away.

Note: Confidence scores are guidance indicators, not guarantees of accuracy.


Key benefits of Scaler’s estimation approach

  • Avoids flat or arbitrary assumptions
  • Uses real, meter-specific historical data
  • Supports like-for-like scoring and benchmarking
  • Improves portfolio comparability
  • Aligns with expectations of external reporting frameworks

Limitations to consider

  • Does not adjust for seasonality
  • Assumes recent consumption patterns remain relevant
  • Uses a fixed 6-month lookback window
  • Estimated values should not replace measured data when actual readings are available
  • Occupancy rates (Average vacancy rate) are not directly factored into estimation calculations. However, occupancy is implicitly reflected in the consumption data itself. Since Scaler estimates missing months using the past six months of actual consumption (with greater weight on recent months), any occupancy-related usage patterns are already captured in the historical data that drives the estimates.

How estimation differs from normalization

Estimation
Normalization
Fills missing or unreported consumption
Adjusts reported consumption to a common basis
Creates modelled values where data is missing
Scales existing values without inventing new usage
Uses historical patterns to estimate gaps
Uses structural factors (time, occupancy, weather)
Focused on data continuity
Focused on comparability
Answers: “What likely happened when data is missing?”
Answers: “How would this data compare under equal conditions?”

Note: Time-based normalization adjusts reported consumption to a full-year equivalent but does not estimate unreported usage; estimation is only applied when data is missing at the meter level.

See Normalization methodologies in Scaler for details on normalization logic.

Did this answer your question?
😞
😐
🤩