Skip to content

Quantum Advantage in Encoding

Quantum data encoding is not automatically better than classical feature engineering. Quantum advantage is conditional — it depends on the encoding, the data, and the downstream task. This page clarifies when quantum encodings can outperform classical alternatives and what theoretical guarantees exist.


The Central Question

  Can a quantum encoding produce a feature space that a classical
  computer cannot efficiently replicate?

  If yes  →  Potential quantum advantage
  If no   →  Quantum encoding adds cost without benefit

When Quantum Advantage Is Possible

1. Non-Simulable Encodings with Hard Kernels

The quantum kernel \( k(x, x') = |\langle\psi(x)|\psi(x')\rangle|^2 \) can be computed by running the encoding circuit on a quantum processor. For certain encodings (IQP, ZZ Feature Map), computing this kernel classically is believed to require exponential resources.

Theoretical result

If IQP circuits could be efficiently simulated classically, the polynomial hierarchy would collapse to the third level (Bremner, Montanaro & Shepherd, 2016). This is considered extremely unlikely, providing evidence that the IQP quantum kernel is classically intractable.

2. Data with Quantum-Relevant Structure

Quantum advantage is most likely when the data has structure that aligns with what the quantum encoding naturally captures:

  • Pairwise feature interactions → IQP, ZZ Feature Map
  • Symmetry → Equivariant encodings
  • High-dimensional correlations → Amplitude encoding
  • Data generated by quantum processes → native quantum encodings

3. Equivariant Encodings on Symmetric Problems

When data possesses a known symmetry (rotational, permutation, cyclic), equivariant encodings can exploit this structure in ways that generic classical models may need exponentially many parameters to learn.


When Quantum Advantage Is NOT Present

  • Simulable encodings (Angle, Basis): these can always be replaced by a classical feature map at equal or lower cost
  • Random data: no structure for the encoding to exploit
  • Trivially separable problems: a linear classifier suffices — no encoding advantage
  • Encodings that don't match the data structure: using IQP on data with no pairwise interactions wastes resources

The Expressibility-Trainability Tradeoff

More expressive encodings can represent richer functions but are harder to train:

                             Potential
                             Advantage
                                │         ╱
                                │       ╱
                                │     ★  Sweet spot
                                │   ╱
                                │ ╱
  No advantage ─────────────────┼──────────────► Expressibility
                          Barren plateaus
                          (untrainable)

The optimal encoding sits in a regime where it is expressive enough to capture the relevant features but trainable enough to be optimised in practice.


Summary

Condition Advantage?
Non-simulable encoding + quantum-structured data Likely
Equivariant encoding + symmetric data Likely
Non-simulable encoding + random data Unlikely
Simulable encoding + any data No
Any encoding + trivially separable data No

Further Reading

  • Havlicek et al., Supervised learning with quantum-enhanced feature spaces, Nature 567 (2019)
  • Schuld & Killoran, Quantum machine learning in feature Hilbert spaces, PRL 122 (2019)
  • Kubler et al., The inductive bias of quantum kernels, NeurIPS 2021
  • Encodings Reference — detailed properties of each encoding