(path-filtering)=
# Activity 7: Large clusters and path filtering

So far you have seen that multiple scattering calculations with MsSpec can use two different algorithms: *matrix inversion* or *Rehr Albers series expansion*. When matrix inversion becomes impossible because the kinetic energy is too high and the number of atoms is too large, serial expansion is the alternative. This algorithm requires very little memory but it processes the scattering paths sequentially, which can lead to very long calculation times.

In this activity, we will explore how to configure MsSpec to reduce this calculation time to compute the signal from a deep emitter in Si(001).



:::

## The number of scattering paths

To fix the idea, we will first evaluate how many scattering paths we need to compute for an emitter in the subsurface (2{sup}`nd` plane) or in the bulk (7{sup}`th` plane) of a Si(001) cluster.

::::{tab-set}

:::{tab-item} <i class="fa-solid fa-circle-question"></i> Quiz
Create a cluster of Si(001) with the emitter in the subsurface, 40 Å diameter with 2 planes (since atoms below the emitter can be ignored at high kinetic energies).

1. For an emitter in the subsurface, we can use single scattering (see {ref}`forward-scattering`). How many paths would be generated for this calculation ?

2. Same question for an emitter in the 7{sup}`th` plane. If we were able to treat each scattering path within only 1 µs. How long would be such calculation ?

```{note}
Remember that 
1. for an emitter in plane $p$, the scattering order has to be at least the number of planes `above` the emitter
2. The number of scattering paths of order $n$ corresponds to the number of possibilities of arranging up to $n$ atoms (taking order into account).
```


::::

:::{toggle}

To get the total number of paths generated by a cluster of $N$ atoms up to order $M$, use the following formula:

```{math}
:label: eq-nbpaths
\sum_{i=0}^{i=M} (N-1)^i
```

:::{figure-md} nbpaths-fig
<img src="fig1.jpg" alt="path filtering" width="600px" align="center">

The time for computing all scattering path for increasing cluster size and scattering order (up to 6{sup}`th` order with 739 atoms. (One path is assumed to be calculated within 1 µs)
:::

:::

## Paths filtering in MsSpec

As you may expect, not all paths contribute significantly to the total intensity. This is why we can filter out some scattering paths and drastically reduce the computation time. MsSpec offers several filters for this. The 3 most common filters are:
1. the `forward_scattering` filter which allows all paths where each scattering angle is within a cone of defined aperture
2. the `backward_scattering` filter which is similar to the previous one but for backscattering direction
3. the `distance` filter which rejects all paths longer than a threshold distance

The following figure illustrate the effect of theses filters on scattering paths

:::{figure-md} filters-fig
<img src="filters.jpg" alt="path filtering" width="600px" align="center">

Some examples of scattering paths with `forward_scattering`, `backward_scattering` and `distance` filters selected. The accepted forward angle is 45°, the accepted backscattering angle is 20° and the threshold distance is $6a_0$ where $a_0$ is the lattice parameter. Note that the yellow path is rejected but if the `off_cone_events` option is set to a value > 1, then it could have been accepted.
:::

## Application to a deep plane in a Si(001) sample

The following script will compute the contribution of a Si(2p) atom in the 4{sup}`th` plane of a Si(001) cluster at scattering order 3.

Taking into account all scattering paths took 15 minutes to compute.

(msd-paper)=
:::{seealso}
based on this paper from S. Tricot *et al.*
[J. Electron. Spectrosc. Relat. Phenom. **256** 147176 (2022)](https://doi.org/10.1016/j.elspec.2022.147176)
:::


::::{tab-set}

:::{tab-item} <i class="fa-solid fa-circle-question"></i> Quiz

The following script is almost completed, try to define path filtering options (no backscattering, accept all paths with forward angles < 40° and reject paths longer than the diameter of the cluster).

```{literalinclude} Si001.py
:lineno-match:
:emphasize-lines: 37-41
```

1. How long was your calculation ?
2. How does it compare to the calculation with **all** scattering paths up to order 3 ?
3. What is the proportion of scattering paths of order 3 that were actually taken into account ?

:::

::::

```{toggle}
The calculation took few seconds and the result is very close to the calculation with all scattering paths.

Only 0.01% of 3{sup}`rd` order paths were actually taken into account

:::{figure-md} si-fig
<img src="results.png" alt="Si polar scan" width="600px" align="center">

Si(2p) polar scan (contribution of an emitter in the 4{sup}`th` plane with all 7 114 945 scattering paths taken into account (orange curve), and for only 1525 filtered paths (blue curve).

:::

:::{literalinclude} Si001_completed.py
:lineno-match:
:emphasize-lines: 37-41
:::

``` 