Introduction
============

Objective
---------

The ``mrinversion`` package solves the linear inverse problems involving Fredholm integrals of
the first kind, which are often encounted in magnetic resonance. The package currently
supports inversion of

- pure shielding anisotropic NMR lineshape into a distribution of the second-rank traceless
  symmetric shielding tensor principal components, and
- NMR relaxometry measurement into a distribution of relaxation parameters.

.. whose frequency
.. contributions are assumed to arise predominantly from the second-rank traceless
.. symmetric tensors.

**Pure shielding anisotropic lineshape inversion**

In the case NMR lineshape inversion, the pure anisotropic frequency spectra corresponds to the
cross-sections of the 2D isotropic *v.s.* anisotropic correlation spectrum, such as the
2D One Pulse (TOP) MAS, phase adjusted spinning sidebands (PASS), magic-angle turning
(MAT), extended chemical shift modulation (XCS), magic-angle hopping (MAH), magic-angle
flipping (MAF), and Variable Angle Correlation Spectroscopy (VACSY). A key feature of all
these 2D isotropic/anisotropic correlation spectra—--either as acquired or after a shear
transformation—--is that the anisotropic cross-section can be modeled as a linear
combination of subspectra,

.. math::
    :label: eq_0a

    s(\nu| \delta_\text{iso}) = \int_{\bf R} \mathcal{K}(\nu, {\bf R}) f({\bf R} | \delta_\text{iso}) d{\bf R},

where :math:`s(\nu| \delta_\text{iso})` is the observed anisotropic cross-section at a
given isotropic shift, :math:`\delta_\text{iso}`, :math:`\mathcal{K}(\nu, {\bf R})` represents
a simulated subspectrum of a nuclear spin system with a given set of parameters, :math:`{\bf R}`,
and :math:`f({\bf R} | \delta_\text{iso})` is the probability of the respective set of
parameters. In Eq. :eq:`eq_0a`, :math:`{\bf R}` represents the anisotropic and asymmetry
parameters of the shielding tensor.

**Relaxometry inversion**

For inversion of relaxometry measurements, the signal growth or decay from spin relaxation is
modeled as,

.. math::
    :label: eq_0b

    s(t| \nu) = \int_{\bf R} \mathcal{K}(t, {\bf R}) f({\bf R} | \nu) d{\bf R},

where :math:`s(t, \nu)` is the observed signal relaxation at a given frequency cross-section,
:math:`\nu`, :math:`\mathcal{K}(t, {\bf R})` is a simulated kernel of spin relaxation
with a given set of parameters, :math:`{\bf R}`, and :math:`f({\bf R} | \nu)` is the probability
of the respective set of parameters. In Eq. :eq:`eq_0b`, :math:`{\bf R}` represents the relaxation
parameters---:math:`T_1`, :math:`T_2`.

Note, both Eq. :eq:`eq_0a`, and Eq. :eq:`eq_0b` are Fredholm integrals of the first kind.

..  and the inverse of the forward
.. computation, i.e., calculating :math:`f({\bf R})` from :math:`s(\nu| \delta_\text{iso})`, is often
.. an ill-posed problem.

.. When expressed in a matrix notation, Eq. :eq:`eq_0` corresponds to

.. .. math::
..     :label: eq_0_matirx

..     {\bf s} = {\bf K \cdot f},


Generic Linear problem
----------------------

Linear inverse problems on Fredholm integral of the first kind are frequently
encountered in the scientific community and have the following generic form

.. math::
   :label: eq_1

    {\bf s} = {\bf K \cdot f},

where :math:`{\bf K} \in \mathbb{R}^{m\times n}` is the transforming kernel (matrix),
:math:`{\bf f} \in \mathbb{R}^n` is the unknown and desired solution, and
:math:`{\bf s} \in \mathbb{R}^m` is the known signal, which includes the
measurement noise. When the matrix :math:`{\bf K}` is non-singular and :math:`m=n`,
the solution to the problem in Eq. :eq:`eq_1` has a simple closed-form solution,

.. math::
    :label: eq_2

    {\bf f} = {\bf K}^{-1} \cdot {\bf s}.

The deciding factor whether the solution :math:`{\bf f}` exists in Eq. :eq:`eq_2`
is whether or not the kernel :math:`{\bf K}` is invertible.
Often, most scientific problems with practical applications suffer from singular,
near-singular, or ill-conditioned kernels, where :math:`{\bf K}^{-1}` doesn't exist.
Such types of problems are termed as *ill-posed*. The inversion of a purely anisotropic
NMR spectrum to the distribution of the tensorial parameters is one such ill-posed
problem.


Regularized linear problem
''''''''''''''''''''''''''

A common approach in solving ill-posed problems is to employ the regularization
methods of form

.. math::
    :label: eq_3

    {\bf f^\dagger} = \underset{{\bf f} > 0}{\text{argmin}} \left(
        \|{\bf K \cdot f} - {\bf s}\|^2_2 + g({\bf f})
    \right),

where :math:`\|{\bf z}\|_2` is the *l-2* norm of :math:`{\bf z}`, :math:`g({\bf f})`
is the regularization term, and :math:`{\bf f}^\dagger` is the regularized solution.
The choice of the regularization term, :math:`g({\bf f})`, is often based on prior
knowledge of the system for which the linear problem is defined. For anisotropic NMR
spectrum inversion, we choose the smooth-LASSO regularization.

.. Elastic net regularization
.. ''''''''''''''''''''''''''

.. When the matrix, :math:`{\bf J}_i`, in Eq. :eq:`slasso` is identity, the regularization
.. term is the elastic net regularization.


.. For example, in a more familiar linear-inverse problem, the inverse Fourier transform, the two dimensions are the frequency and time dimensions, where the frequency dimension undergoes the inverse transformation, and the time dimension is where the inversion method transforms the data.

.. _l1_intro:

l1 regularization
"""""""""""""""""

The l1 regularized linear model minimizes the objective function,

.. math::
    :label: l1

    \| {\bf K \cdot f - s} \|^2_2 + \lambda  \| {\bf f} \|_1 ,

where :math:`\lambda` is the regularization hyperparameter controlling the sparsity
of the solution :math:`{\bf f}`.


.. _smooth_lasso_intro:

Smooth-LASSO regularization
"""""""""""""""""""""""""""

Our prior assumption for the distribution of the tensor parameters is that it should
be smooth and continuous for disordered and sparse and discrete for crystalline
materials. Therefore, we employ the smooth-lasso method, which is a linear model
that is trained with the combined l1 and l2 priors as the regularizer. The method
minimizes the objective function,

.. math::
    :label: slasso

    \| {\bf K \cdot f - s} \|^2_2 + \alpha \sum_{i=1}^{d} \| {\bf J}_i \cdot {\bf f} \|_2^2
                + \lambda  \| {\bf f} \|_1 ,

where :math:`\alpha` and :math:`\lambda` are the hyperparameters controlling the
smoothness and sparsity of the solution :math:`{\bf f}`. The matrix :math:`{\bf J}_i`
typically reflects some underlying geometry or the structure in the true solution. Here,
:math:`{\bf J}_i` is defined to promote smoothness along the :math:`\text{i}^\text{th}`
dimension of the solution :math:`{\bf f}` and is given as

.. math::
    {\bf J}_i = {\bf I}_{n_1} \otimes \cdots \otimes {\bf A}_{n_i}
                \otimes \cdots \otimes {\bf I}_{n_{d}},

where :math:`{\bf I}_{n_i} \in \mathbb{R}^{n_i \times n_i}` is the identity matrix, and
:math:`{\bf A}_{n_i}` is the first difference matrix given as

.. math::
    {\bf A}_{n_i} = \left(\begin{array}{ccccc}
                    1 & -1 & 0 & \cdots & \vdots \\
                    0 & 1 & -1 & \cdots & \vdots \\
                    \vdots & \vdots & \vdots & \vdots & 0 \\
                    0 & \cdots & 0 & 1 & -1
                \end{array}\right) \in \mathbb{R}^{(n_i-1)\times n_i}.

The symbol :math:`\otimes` is the Kronecker product. The terms,
:math:`\left(n_1, n_2, \cdots, n_d\right)`, are the number of points along the
respective dimensions, with the constraint that :math:`\prod_{i=1}^{d}n_i = n`,
where :math:`d` is the total number of dimensions in the solution :math:`{\bf f}`,
and :math:`n` is the total number of features in the kernel, :math:`{\bf K}`.

Understanding the *x-y* plot
----------------------------

A second-rank symmetric tensor, :math:`{\bf S}`, in a three-dimensional space, is
described by three principal components, :math:`s_{xx}`, :math:`s_{yy}`, and
:math:`s_{zz}`, in the principal axis system (PAS). Often, depending on the context of
the problem, the three principal components are expressed with three new parameters
following a convention. One such convention is the Haeberlen convention, which defines
:math:`\delta_\text{iso}`, :math:`\zeta`, and :math:`\eta`, as the isotropic shift,
anisotropy, and asymmetry parameters, respectively. Here, the parameters :math:`\zeta`
and :math:`\eta` contribute to the purely anisotropic frequencies, and determining the
distribution of these two parameters is the focus of this library.

Defining the inverse grid
''''''''''''''''''''''''''

When solving any linear inverse problem, one needs to define an inverse grid before
solving the problem. A familiar example is the inverse Fourier transform, where
the inverse grid is defined following the Nyquist–Shannon sampling theorem. Unlike
inverse Fourier transform, however, there is no well-defined sampling grid for the
second-rank traceless symmetric tensor parameters. One obvious choice is
to define a two-dimensional :math:`\zeta`-:math:`\eta` Cartesian grid.

As far as the inversion problem is concerned, :math:`\zeta` and :math:`\eta`
are just labels for the subspectra. In simplistic terms, the inversion problem solves
for the probability of each subspectrum, from a given pre-defined basis of subspectra,
that describes the observed spectrum. If the subspectra basis is defined over a
:math:`\zeta`-:math:`\eta` Cartesian grid, multiple
:math:`(\zeta, \eta)` coordinates points to the same subspectra. For
example, the subspectra from coordinates :math:`(\zeta, \eta=1)` and
:math:`(-\zeta, \eta=1)` are identical, therefore, distinguishing these
coordinates from the subspectra becomes impossible.

The issue of multiple coordinates pointing to the same object is not new. It is
a common problem when representing polar coordinates in the Cartesian basis. Try describing
the coordinates of the south pole using latitudes and longitudes. You can define the latitude,
but describing longitude becomes problematic. A similar situation arises in the context of
second-rank traceless tensor parameters when the anisotropy goes to zero. You can specify
the anisotropy as zero, but defining asymmetry becomes problematic.

Introducing the :math:`x`-:math:`y` grid
""""""""""""""""""""""""""""""""""""""""

A simple fix to this issue is to define the :math:`(\zeta, \eta)` coordinates
in a polar basis. We, therefore, introduce a piece-wise polar grid representation of the
second-rank traceless tensor parameters, :math:`\zeta`-:math:`\eta`, defined as

.. math::
    :label: zeta_eta_def

    r_\zeta = | \zeta_ | ~~~~\text{and}~~~~
    \theta = \left\{ \begin{array}{l r}
                \frac{\pi}{4} \eta      &: \zeta \le 0, \\
                \frac{\pi}{2} \left(1 - \frac{\eta}{2} \right) &: \zeta > 0.
             \end{array}
            \right.

Because Cartesian grids are more manageable in computation, we re-express the above polar
piece-wise grid as the *x*-*y* Cartesian grid following,

.. math::
    :label: x_y_def

    x = r_\zeta \cos\theta ~~~~\text{and}~~~~ y = r_\zeta \sin\theta.

In the *x*-*y* grid system, the basis subspectra are relatively distinguishable. The
``mrinversion`` library provides a utility function to render the piece-wise polar grid
for your matplotlib figures. Copy-paste the following code in your script.

.. plot::
    :format: doctest
    :context: close-figs
    :include-source:

    >>> import matplotlib.pyplot as plt # doctest: +SKIP
    >>> from mrinversion.utils import get_polar_grids # doctest: +SKIP
    ...
    >>> plt.figure(figsize=(4, 3.5)) # doctest: +SKIP
    >>> ax=plt.gca() # doctest: +SKIP
    >>> # add your plots/contours here.
    >>> get_polar_grids(ax) # doctest: +SKIP
    >>> ax.set_xlabel('x / ppm') # doctest: +SKIP
    >>> ax.set_ylabel('y / ppm') # doctest: +SKIP
    >>> plt.tight_layout() # doctest: +SKIP
    >>> plt.show() # doctest: +SKIP

.. _fig1_introduction:
.. figure:: _static/null.*

    The figure depicts the piece-wise polar :math:`\zeta`-:math:`\eta` grid represented on
    an `x`-`y` grid. The radial and angular grid lines represent the magnitude of
    :math:`\zeta` and :math:`\eta`, respectively. The blue and red shading represents the
    positive and negative values of :math:`\zeta`, respectively. The radian grid lines are
    drawn at every 0.2 ppm increments of :math:`\zeta`, and the angular grid lines are
    drawn at every 0.2 increments of :math:`\eta`. The `x` and `y`-axis are :math:`\eta=0`,
    and the diagonal :math:`x=y` is :math:`\eta=1`.

If you are familiar with the matplotlib library, you may notice that most code lines are
the basic matplotlib statements, except for the line that says *get_polar_grids(ax)*.
The :py:meth:`~mrinversion.utils.get_polar_grids` is a utility function that generates
the piece-wise polar grid for your figures.

Here, the shielding anisotropy parameter, :math:`\zeta`, is the radial dimension,
and the asymmetry parameter, :math:`\eta`, is the angular dimension, defined using Eqs.
:eq:`zeta_eta_def` and :eq:`x_y_def`. The region in blue and red corresponds to the
positive and negative values of :math:`\zeta`, where the magnitude of the anisotropy
increases radially. The *x* and the *y*-axis are :math:`\eta=0` for the negative and positive
:math:`\zeta`, respectively. When moving towards the diagonal from *x* or *y*-axes, the
asymmetry parameter, :math:`\eta`, uniformly increase, where the diagonal is
:math:`\eta=1`.