pysiglib.sig_kernel_gram

pysiglib.sig_kernel_gram#

Added in version v0.2.1.

sig_kernel_gram(path1, path2, dyadic_order, static_kernel=None, time_aug=False, lead_lag=False, end_time=1.0, n_jobs=1, max_batch=-1, return_grid=False)[source]#

Given batches of paths \(\{x_i\}_{i=1}^B\) and \(\{y_i\}_{i=1}^B\), computes the gram matrix of signature kernels

\[G = (k_{x_i, y_j})_{i,j = 1}^B.\]

The signature kernel of two \(d\)-dimensional paths \(x,y\) is defined as

\[k_{x,y}(s,t) := \left< S(x)_{[0,s]}, S(y)_{[0, t]} \right>_{T((\mathbb{R}^d))}\]

where the inner product is defined as

\[\left< A, B \right> := \sum_{k=0}^{\infty} \left< A_k, B_k \right>_{\left(\mathbb{R}^d\right)^{\otimes k}}\]
\[\left< u, v \right>_{\left(\mathbb{R}^d\right)^{\otimes k}} := \prod_{i=1}^k \left< u_i, v_i \right>_{\mathbb{R}^d}.\]

Optionally, a static kernel can be specified. For details, see the documentation on static kernels.

Parameters:
  • path1 (numpy.ndarray | torch.tensor) – The first underlying path or batch of paths, given as a numpy.ndarray or torch.tensor. For a single path, this must be of shape (length_1, dimension). For a batch of paths, this must be of shape (batch_size_1, length_1, dimension).

  • path2 (numpy.ndarray | torch.tensor) – The second underlying path or batch of paths, given as a numpy.ndarray or torch.tensor. For a single path, this must be of shape (length_2, dimension). For a batch of paths, this must be of shape (batch_size_2, length_2, dimension).

  • dyadic_order (int | tuple) – If set to a positive integer \(\lambda\), will refine the paths by a factor of \(2^\lambda\). If set to a tuple of positive integers \((\lambda_1, \lambda_2)\), will refine the first path by \(2^{\lambda_1}\) and the second path by \(2^{\lambda_2}\).

  • static_kernel (None | pysiglib.StaticKernel) – Static kernel. If None (default), the linear kernel will be used. For details, see the documentation on static kernels.

  • time_aug (bool) – If set to True, will compute the signature of the time-augmented path, \(\hat{x}_t := (t, x_t)\), defined as the original path with an extra channel set to time, \(t\). This channel spans \([0, t_L]\), where \(t_L\) is given by the parameter end_time.

  • lead_lag (bool) – If set to True, will compute the signature of the path after applying the lead-lag transformation.

  • end_time (float) – End time for time-augmentation, \(t_L\).

  • n_jobs (int) – (Only applicable to CPU computation) Number of threads to run in parallel. If n_jobs = 1, the computation is run serially. If set to -1, all available threads are used. For n_jobs below -1, (max_threads + 1 + n_jobs) threads are used. For example if n_jobs = -2, all threads but one are used.

  • max_batch (int) – Maximum batch size to run in parallel. If the computation is failing due to insufficient memory, this parameter should be decreased. If set to -1, the entire batch is computed in parallel.

  • return_grid (bool) – If True, returns the entire PDE grid.

Returns:

Gram matrix of signature kernels

Return type:

numpy.ndarray | torch.tensor

Note

Ideally, any array passed to pysiglib.sig_kernel_gram should be both contiguous and own its data. If this is not the case, pysiglib.sig_kernel_gram will internally create a contiguous copy, which may be inefficient.

Note

When called via pysiglib.torch_api, the default behaviour is to reconstruct the PDE grids during backpropagation. This is done to avoid memory allocation issues for large batch sizes.


Citation#

If you found this library useful in your research, please consider citing the paper:

@article{shmelev2025pysiglib,
  title={pySigLib-Fast Signature-Based Computations on CPU and GPU},
  author={Shmelev, Daniil and Salvi, Cristopher},
  journal={arXiv preprint arXiv:2509.10613},
  year={2025}
}