ML vs SPC March 10, 2026

Predictive AI vs. Statistical SPC: What Formation Data Actually Needs

SPC is designed for stable processes. Formation cycling is an electrochemical transformation. The right tool depends on what you're actually measuring.

By Priya Nair, Lead Data Scientist

Data visualization comparing SPC control charts with ML model inference outputs for battery formation

Statistical process control is a mature discipline with a clear founding assumption: the process being monitored is operating in a state of statistical control around a stable mean, and departures from that mean are detectable as signals against a background of common-cause variation. SPC works beautifully when that assumption holds. The question for battery formation monitoring is whether that assumption ever actually holds — and the honest answer is: only partially, and not in the ways that matter most.

Formation cycling isn't a stable process around a mean. It's an electrochemical transformation. Each cell starts in a fundamentally different thermodynamic state than it ends in. The "process" being monitored isn't stamping a part to a tolerance — it's inducing a one-time irreversible chemical change that defines a cell's electrochemical character for its entire service life. That distinction has real consequences for which analytical tools are appropriate.

What SPC Does Well at Formation

Let's be precise about where SPC earns its place on the formation floor, because it does have one.

SPC control charts on scalar formation outputs — first-cycle coulombic efficiency, nominal capacity at C/5, internal resistance at 50% SOC — are appropriate for detecting assignable-cause variation in those metrics when the line is running at steady state. If your formation line runs 10,000 cells per day at a mean CE of 98.2% with a process standard deviation of 0.15%, a Western Electric rule violation on a Shewhart X-bar chart (8 consecutive points on the same side of the mean, or 1 point beyond 3σ) will catch step changes in line behavior. Equipment failure, major chemistry lot transitions, gross protocol errors — these all manifest as assignable-cause signals that SPC is designed to find.

For this category of signal — sudden, large, affect-many-cells-at-once — SPC is adequate. It's cheap, interpretable, auditable, and your QA team already knows how to operate it.

Where SPC Breaks Down

The problem is that the defect modes with the highest cost-per-event aren't the ones SPC catches. They're the slow drifts, the spatially-correlated anomalies, and the feature-shape deviations that don't move the scalar summary statistics outside control limits.

The Scalar Aggregation Problem

A dQ/dV curve from a single formation cycle contains thousands of data points. The charge curve and discharge curve each carry distinct electrochemical information. Peak positions reflect the cathode phase transition voltages; peak widths reflect kinetic limitations and electrode tortuosity; valley depths encode relative capacity ratios between active materials.

When you reduce that curve to a single CE number, you discard approximately 99.9% of the information it contains. The CE number tells you: was total charge returned within spec? It does not tell you: did the SEI form uniformly? Were there any sub-threshold lithium plating events? Is the dQ/dV valley at 3.45V deeper than the reference distribution for this chemistry and temperature?

SPC can only monitor what you measure. If you measure scalars, you get scalar monitoring. The information gap isn't a failure of SPC as a technique — it's a consequence of the measurement choices that preceded the monitoring system design.

The Non-Stationarity Problem

An SPC chart requires a stable reference distribution to define the control limits. Formation data is conditionally non-stationary: the mean and variance of formation metrics shift with electrolyte lot, ambient humidity, cathode coating lot, and seasonal temperature variation (particularly relevant for facilities without full HVAC in their formation rooms). Every time a new electrolyte lot arrives, the CE mean shifts by a small but real amount. Every season, temperature-dependent kinetics affect early-cycle charge behavior.

A naive SPC chart set against a 3-month rolling baseline will either have control limits that are too wide (accommodating real process shifts as common-cause variation) or generate false alarms every time a material lot transition occurs. Either outcome erodes operator trust in the monitoring system — and an alert system that operators learn to ignore is worse than no alert system at all.

The High-Dimensional Correlation Problem

The most predictive features in formation data are not individual metrics — they are relationships between metrics. The ratio of first-cycle CE to second-cycle CE. The correlation between early-step IR and final-step capacity. The relationship between charge-curve peak position variance and the distribution of discharge-curve valley depth across a population of cells from the same formation run.

These relationships are high-dimensional. Monitoring them with SPC requires either reducing them to hand-crafted scalar features (losing information again) or running multivariate SPC (Hotelling T² or similar), which is technically valid but requires an extremely stable reference covariance structure and becomes unreliable as soon as chemistry or equipment configuration changes.

What Machine Learning Actually Adds — and Where It Doesn't

Consider a specific scenario: a 2170-format NMC-811 line running an Arbin BT-2000 bank, 500 channels, 48-hour formation protocol. The line has been running for 8 months and has accumulated approximately 4 million cell-formation records. The goal is to predict which cells in the current formation run will fail the post-formation aging self-discharge test, before the aging test is complete.

The relevant features for this prediction task aren't available as single numbers — they require extracting shape features from the dQ/dV curves of cycles 1 and 2, computing the IR evolution across the protocol steps, and correlating those features with the temperature history of the formation chamber during those specific cycles. That's a roughly 200-feature input space per cell.

An SPC framework cannot operate in a 200-dimensional space in any practically useful way. A gradient-boosted tree or a 1D convolutional model trained on the time-series telemetry can learn the relevant feature interactions from historical failure labels without requiring a human to specify which features matter. The model learns that the combination of "CE > 98.0% AND dQ/dV valley depth ratio < 0.87 AND IR step increase between cycle 1 and cycle 3 > 0.4 mΩ" predicts self-discharge failure at 3× the baseline rate — a conjunction that no individual threshold rule would capture.

We're not claiming that ML eliminates the need for domain knowledge — the feature extraction and training label design require deep process understanding. What we're saying is that the right analytical tool for a complex, high-dimensional, conditionally non-stationary time-series prediction problem is not Shewhart control charts. The physics doesn't care about tool familiarity.

The Practical Hybrid Architecture

On production lines where we've had the chance to observe both approaches operating in parallel, the appropriate architecture isn't "replace SPC with ML" — it's layered monitoring with complementary roles:

SPC layer: Real-time scalar monitoring of CE, capacity, and IR against adaptive control limits. Catches assignable-cause step changes. Fast, auditable, explainable to QA auditors. Required for ISO 9001 and IATF 16949 documentation.
Feature-extraction layer: Per-cell dQ/dV curve analysis, IR evolution tracking, and protocol-step cross-correlation. Runs on the raw cycler telemetry, outputs per-cell feature vectors.
Predictive model layer: Population-aware anomaly scoring and end-of-process failure prediction, operating on the feature vectors. Generates risk scores and actionable process parameter recommendations, not just pass/fail flags.

The SPC layer keeps QA compliance intact. The predictive layer catches what SPC misses. Neither replaces the other; they cover different parts of the signal space.

The goal isn't to monitor formation data more aggressively. The goal is to act on information that's already in the telemetry — before the formation run ends, before the cells move to aging, before the defect propagates through the rest of the production process and becomes a warranty event.