Kernel Design

Kernel Design#

The Q-statistic measures the strength of spatial dependence in a feature, but the kernel decides which spatial patterns count as “dependent”. Smooth large-scale gradients (low-frequency), sharp boundaries between neighbouring spots (high-frequency), and graph-defined neighbourhoods are all valid choices, and they are captured by different kernels. Two genes can swap which one scores higher just by swapping the kernel.

This page is the practical guide. For the underlying math, see Theoretical Results (Theorems 1-2 and the CAR derivation).

Default recommendations#

The choice splits along two axes: what pattern you are looking for (smooth gradient or sharp local variation) and how the data sits in space (irregular coordinates, graph, or regular grid).

Data layout	Pattern of interest	Pick
Coordinate cloud	Smooth, large-scale gradient	`NUFFTKernel` with `method="matern"` (`nu=1.5`).
Coordinate cloud (graph-defined)	Smooth, large-scale gradient	`MatrixKernel` with `method="car"` (`rho=0.9`, `k_neighbors=4`).
Coordinate cloud (graph-defined)	Sharp variation between neighbours	`MatrixKernel` with `method="graph_laplacian"`.
Regular rasterised grid (Visium HD, imaging)	Smooth, large-scale gradient	`FFTKernel` with `method="car"` or `method="matern"`. Both have polynomial spectral decay, which keeps power on mid- and high-frequency modes.
Small `n` (under ~5 000), sanity check	Any	`MatrixKernel`. The dense matrix path is the simplest.

If you don’t yet know which pattern type you want, start with CAR or Matérn. They look for smooth gradients and still report calibrated p-values when the true signal is sharp, just with lower power.

Code:

from quadsv import NUFFTKernel, MatrixKernel, FFTKernel

# Irregular coords, smooth pattern (default)
kernel = NUFFTKernel(coords, method="matern", bandwidth=25.0, nu=1.5)

# Irregular coords, graph-flavoured pattern
kernel = MatrixKernel.from_coordinates(
    coords, method="car", rho=0.9, k_neighbors=4
)

# Regular rasterised grid (CAR)
kernel = FFTKernel(
    shape=(1000, 1000), method="car", rho=0.9, neighbor_degree=1
)

# Regular rasterised grid (Matérn)
kernel = FFTKernel(
    shape=(1000, 1000), method="matern", bandwidth=4.0, nu=1.5
)

The backend keyword on DetectorIrregular selects between NUFFTKernel and MatrixKernel. See Quick Start.

Picking a method#

Kernel `method`	What it captures	Spectral decay
`"gaussian"`	Soft, distance-based smooth patterns.	Exponential in \(\|\omega\|^2\). Drops mid- and high-frequency structure under the noise floor quickly.
`"matern"`	Distance-based smooth patterns with tunable smoothness.	Polynomial, rate \(-(2\nu + d)\) in \(\|\omega\|\). `nu` controls smoothness, `bandwidth` controls the cut-off frequency. `nu=1.5` is a common starting point.
`"car"`	Graph-based smooth patterns.	Polynomial, rate \(-2\) in \(\|\omega\|\) near the cut-off. `rho` close to `1` pushes the cut-off lower in frequency. Strictly positive definite for any `rho < 1`. Supports the FFT path on regular grids.
`"graph_laplacian"`	Graph-based local variation: sharp differences between neighbouring spots.	High-pass; spectrum grows like \(\|\omega\|^2\) at low frequency. Pairs with CAR. Use when you care about boundaries or textures rather than smooth gradients.
`"moran"`	Classical autocorrelation.	Indefinite spectrum, suffers from spectral cancellation. Avoid for SVG detection. Available for backwards comparison with legacy methods. See Theoretical Results Theorem 2.

Spectral decay rate#

The kernel spectrum \(\lambda(\omega)\) is a frequency filter: \(Q(f) = \sum_\omega |\hat{f}_\omega|^2 \lambda(\omega)\). How \(\lambda\) falls off at large \(|\omega|\) decides which spatial frequencies the test still has power for.

Two regimes show up across the kernels in this library:

Exponential decay (Gaussian). \(\lambda(\omega) \propto \exp(-\sigma^2 |\omega|^2 / 2)\), where \(\sigma\) is the bandwidth. Power on a frequency \(\omega\) collapses to numerical zero once \(|\omega| \gtrsim 1/\sigma\). Anything finer than that scale is invisible to the test.
Polynomial decay (Matérn, CAR). Power falls off as a fixed power of \(|\omega|\), so even modes well past the cut-off keep some weight. The test loses power gradually rather than abruptly.

For dense rasterised data (Visium HD, imaging), interesting biology often shows up at fine scales, so the polynomial tail of Matérn or CAR usually wins over Gaussian even when the dominant signal is smooth.

How decay rates depend on hyper-parameters:

Kernel	Spectrum (large \(\|\omega\|\), dimension \(d\))	Hyper-parameters
Gaussian	\(\lambda(\omega) \propto e^{-\sigma^{2}\|\omega\|^{2}/2}\)	`bandwidth` \(\sigma\) sets the cut-off frequency \(\sim 1/\sigma\). There is no smoothness knob; decay is always exponential in \(\|\omega\|^{2}\).
Matérn	\(\lambda(\omega) \propto \|\omega\|^{-(2\nu + d)}\)	`nu` \(\nu\) controls the polynomial decay rate (larger \(\nu\) = faster decay = smoother). `bandwidth` \(\sigma\) sets the cut-off \(\sim 1/\sigma\). `nu=1.5` in 2-D gives rate \(-5\); `nu=0.5` (exponential covariance) gives rate \(-3\); `nu` \(\to \infty\) recovers Gaussian.
CAR (regular grid)	\(\lambda(\omega) \propto \|\omega\|^{-2}\) past the cut-off	`rho` \(\rho\) controls the cut-off frequency \(\sim \sqrt{(1-\rho)/\rho}\). `rho` close to `1` places the cut-off near DC, which puts most weight on large-scale structure but still keeps the \(\|\omega\|^{-2}\) tail. `neighbor_degree` shifts the very-high-frequency behaviour through the adjacency symbol, but the asymptotic rate stays \(\|\omega\|^{-2}\).
Graph Laplacian (regular grid)	\(\lambda(\omega) \propto \|\omega\|^{2}\) at low \(\|\omega\|\)	High-pass with no smoothing knob. `neighbor_degree` controls how local “local” is.

Tuning the hyper-parameters#

Bandwidth (Gaussian, Matérn). A reasonable starting value is the median pairwise distance:

import numpy as np
from scipy.spatial.distance import pdist

bandwidth0 = float(np.median(pdist(coords)))
kernel = NUFFTKernel(coords, method="matern", bandwidth=bandwidth0, nu=1.5)

ρ (CAR). Larger rho means stronger smoothing.

rho = 0.5: moderate smoothing.
rho = 0.9: strong smoothing (the default).
rho close to 1: maximum smoothing. The spectrum becomes very heavy-tailed.

k_neighbors (graph kernels). Larger k means a denser graph and more global patterns.

k between 5 and 10: sparse, local.
k = 30 or more: dense, global.

If you sweep a small grid of values, look at how \(Q\) and the trace move with the parameter. Neither metric replaces a held-out validation set, but together they highlight runs where the kernel collapses onto a single mode.

Custom kernels#

Two extension points cover almost everything: a custom matrix and a custom subclass.

1. Custom adjacency or distance matrix

import numpy as np
from quadsv.kernels import MatrixKernel

# Build a CAR precision from a custom adjacency
W = ...                                        # (n, n) adjacency
precision = np.eye(W.shape[0]) - 0.9 * W       # I - ρW
kernel = MatrixKernel.from_matrix(
    precision, method="car", is_precision=True
)

2. Subclass an ABC from quadsv.kernels

Backend authors get two extension points, both in quadsv.kernels:

quadsv.kernels.Kernel is the universal ABC. Subclass it for any custom kernel that you can express through a single self._K buffer.
quadsv.kernels.MatrixKernelBase is the matrix-family base used by MatrixKernel. Subclass it if you need the dense / sparse / sparse-precision auto-switching machinery for a new matrix backend.

Neither ABC is re-exported from the top-level quadsv namespace. Always import them through quadsv.kernels.

The Kernel ABC ships default implementations of Kx(), xtKx(), xtKy(), trace(), square_trace(), and eigenvalues() that all read off the single self._K buffer. Override _build_kernel() to plug in your own matrix:

import numpy as np
from quadsv.kernels import Kernel

class MyKernel(Kernel):
    def _build_kernel(self):
        # Return any (n, n) symmetric PSD matrix.
        return self.params["K"]

K = np.eye(100)
kernel = MyKernel(n=100, method="custom", K=K)

Override Kx() only if you have a faster operator than K @ x. That is what the FFT and NUFFT backends do.