From 180898a6b0b2c3eaad75417d88cab94f36f7d1d3 Mon Sep 17 00:00:00 2001
From: rnhmjoj <rnhmjoj@inventati.org>
Date: Fri, 29 May 2020 19:32:03 +0200
Subject: [PATCH] ex-4: review

---
 notes/sections/4.md | 183 +++++++++++++++++++++++---------------------
 1 file changed, 94 insertions(+), 89 deletions(-)

diff --git a/notes/sections/4.md b/notes/sections/4.md
index 80775df..db38cfd 100644
--- a/notes/sections/4.md
+++ b/notes/sections/4.md
@@ -1,16 +1,16 @@
 # Exercise 4
 
-## Kinematic dip PDF derivation
+## Kinematic dip derivation
 
-Consider a great number of non-interacting particles, each of which with a
-random momentum $\vec{P}$ with module between 0 and $P_{\text{max}}$ randomly
-angled with respect to a coordinate system {$\hat{x}$, $\hat{y}$, $\hat{z}$}.
-Once the polar angle $\theta$ is defined, the momentum vertical and horizontal
-components of a particle, which will be referred as $\vec{P_v}$ and $\vec{P_h}$
-respectively, are the ones shown in @fig:components.  
-If $\theta$ is evenly distributed on the sphere and the same holds for the
-module $P$, which distribution will the average value of $|P_v|$ follow as a
-function of $P_h$?
+Consider a great number of non-interacting particles having random momenta
+$\vec{P}$, with magnitude between 0 and $P_{\text{max}}$, at an angle $\theta$
+wrt to some coordinate system ($\hat{x}$, $\hat{y}$, $\hat{z}$).
+The vertical and horizontal components of a particle momentum, which will be
+referred as $\vec{P_v}$ and $\vec{P_h}$ respectively, are shown in
+@fig:components.  
+If $\theta$ is uniformly distributed on the unit sphere and $P$ is uniformly
+distributed in $[0, P^\text{max}]$, what will be the average value
+$|P_v|$ of the particles with a given $P_h$?
 
 \begin{figure}
 \hypertarget{fig:components}{%
@@ -41,10 +41,11 @@ function of $P_h$?
 }
 \end{figure}
 
+
 Since the aim is to compute $\langle |P_v| \rangle (P_h)$, the conditional
 distribution probability of $P_v$ given a fixed value of $P_h = x$ must first
 be determined. It can be computed as the ratio between the probability of
-getting a fixed value of $P_v$ given $x$ over the total probability of
+getting a fixed value of $P_v$ given $x$ to the total probability of
 getting that $x$:
 $$
   f (P_v | P_h = x) = \frac{f_{P_h , P_v} (x, P_v)}
@@ -52,137 +53,141 @@ $$
   = \frac{f_{P_h , P_v} (x, P_v)}{I}
 $$
 
-where $f_{P_h , P_v}$ is the joint pdf of the two variables $P_v$ and $P_h$ and
+where $f_{P_h , P_v}$ is the joint PDF of the two variables $P_v$ and $P_h$ and
 the integral $I$ runs over all the possible values of $P_v$ given a certain
 $P_h$.  
-$f_{P_h , P_v}$ can simply be computed from the joint pdf of $\theta$ and $P$
-with a change of variables. For the pdf of $\theta$ $f_{\theta} (\theta)$, the
+$f_{P_h , P_v}$ can simply be computed from the joint PDF of $\theta$ and $P$
+with a change of variables. For the PDF of $\theta$ $f_{\theta} (\theta)$, the
 same considerations done in @sec:3 lead to:
 $$
   f_{\theta} (\theta) = \frac{1}{2} \sin{\theta} \chi_{[0, \pi]} (\theta)
 $$
 
-whereas, being $P$ evenly distributed:
+whereas, being $P$ uniform:
 $$
   f_P (P) = \chi_{[0, P_{\text{max}}]} (P)
 $$
 
 where $\chi_{[a, b]} (y)$ is the normalized characteristic function which value
 is $1/N$ between $a$ and $b$ (where $N$ is the normalization term) and 0
-elsewhere. Being a couple of independent variables, their joint pdf is simply
-given by the product of their pdfs:
+elsewhere. Since $P,\theta$ are independent variables, their joint PDF is
+simply given by the product:
 $$
   f_{\theta , P} (\theta, P) = f_{\theta} (\theta) f_P (P)
   = \frac{1}{2} \sin{\theta} \chi_{[0, \pi]} (\theta)
     \chi_{[0, P_{\text{max}}]} (P)
 $$
+and they are related to the vertical and horizontal components
+by a standard polar coordinate transformation:
 
-Given the new variables:
-$$
+\begin{align*}
   \begin{cases}
     P_v = P \cos(\theta) \\
     P_h = P \sin(\theta)
   \end{cases}
-$$
-
-with $\theta \in [0, \pi]$, the previous ones can be written as:
-$$
+  &&
   \begin{cases}
     P = \sqrt{P_v^2 + P_h^2} \\
-    \theta = \text{atan}_2 ( P_h/P_v ) :=
-      \begin{cases}
-        \text{atan} ( P_h/P_v )       &\incase P_v > 0 \\
-        \text{atan} ( P_h/P_v ) + \pi &\incase P_v < 0
-      \end{cases} 
+    \theta = \text{atan2}(P_h, P_v)
   \end{cases}
+\end{align*}
+
+where:
+
+- $\theta \in [0, \pi]$,
+
+- and atan2 is defined by:
+$$
+\begin{cases}
+  \arctan(P_h/P_v) &\incase P_v > 0 \\
+  \arctan(P_h/P_v) + \pi &\incase P_v < 0 \\
+  \pi/2 &\incase P_v = 0
+\end{cases}
 $$
 
-which can be shown having Jacobian:
+The Jacobian of the inverse transformation is easily found to be:
 $$
-  J = \frac{1}{\sqrt{P_v^2 + P_h^2}}
+  |J^{-1}| = \frac{1}{\sqrt{P_v^2 + P_h^2}}
+$$
+Hence, the PDF written in the new coordinates is:
 $$
-
-Hence:
-$$  
   f_{P_h , P_v} (P_h, P_v) =
-    \frac{1}{2} \sin[ \text{atan}_2 ( P_h/P_v )]
-    \chi_{[0, \pi]} (\text{atan}_2 ( P_h/P_v )) \cdot \\
-    \frac{\chi_{[0, p_{\text{max}}]} \left( \sqrt{P_v^2 + P_h^2} \right)}
+    \frac{1}{2} \sin\left[ \text{atan2}(P_h, P_v) \right]
+    \chi_{[0, \pi]} \left[\text{atan2}(P_h, P_v)\right] \cdot \\
+    \frac{\chi_{[0, p_{\text{max}}]} \left(\sqrt{P_v^2 + P_h^2} \right)}
     {\sqrt{P_v^2 + P_h^2}}
 $$
 
-from which, the integral $I$ can now be computed. The edges of the integral
-are fixed by the fact that the total momentum can not exceed $P_{\text{max}}$:
+The integral $I$ can now be computed. Note that the domain
+is implicit in the characteristic function:
 $$
-  I = \int
-  \limits_{- \sqrt{P_{\text{max}}^2 - P_h}}^{\sqrt{P_{\text{max}}^2 - P_h}}
-  dP_v \, f_{P_h , P_v} (x, P_v)
+  I(x) = \int_{-\infty}^{+\infty} dP_v \, f_{P_h , P_v} (x, P_v)
+       = \int \limits_{- \sqrt{P_{\text{max}}^2 - P_h}}
+         ^{\sqrt{P_{\text{max}}^2 - P_h}}
+         dP_v \, f_{P_h , P_v} (x, P_v)
 $$
 
-after a bit of maths, using the identity:
+With some basic calculus and the identity
 $$
-  \sin[ \text{atan}_2 ( P_h/P_v )] = \frac{P_h}{\sqrt{P_h^2 + P_v^2}}
+  \sin[ \text{atan2}(P_h, P_v)] = \frac{P_h}{\sqrt{P_h^2 + P_v^2}},
 $$
-
-and the fact that both the characteristic functions play no role within the
-integral limits, the following result arises:
+the integral can be evaluated to give
 $$
-  I = 2 \, \text{atan} \left( \sqrt{\frac{P_{\text{max}}^2}{x^2} - 1} \right)
+  I = 2 \, \arctan \left( \sqrt{\frac{P_{\text{max}}^2}{x^2} - 1} \right),
 $$
-
 from which:
 $$
-  f (P_v | P_h = x) = \frac{x}{P_v^2 + x^2} \cdot
-  \frac{1}{2 \, \text{atan}
+  f (P_v | P_h = x) =
+  \frac{x}{P_v^2 + x^2} \cdot
+  \frac{\chi_{[0, \pi]} \left[\text{atan2}(P_h, P_v)\right]
+    \chi_{[0, p_{\text{max}}]} \left(\sqrt{P_v^2 + P_h^2}\right)}{2 \, \arctan
            \left( \sqrt{\frac{P_{\text{max}}^2}{x^2} - 1} \right)}
 $$
 
 Finally, putting all the pieces together, the average value of $|P_v|$ can be
 computed:
 $$
-  \langle |P_v| \rangle = \int
-  \limits_{- \sqrt{P_{\text{max}}^2 - P_h}}^{\sqrt{P_{\text{max}}^2 - P_h}}
-  f (P_v | P_h = x) = [ \dots ]
-  = x \, \frac{\ln \left( \frac{P_{\text{max}}}{x} \right)}
-  {\text{atan} \left( \sqrt{ \frac{P^2_{\text{max}}}{x^2} - 1} \right)}
+  \langle |P_v| \rangle(x)
+  = \int P_v f (P_v | P_h = x) dP_v
+  = \frac{x \ln \left( \frac{P_{\text{max}}}{x} \right)}
+  {\arctan \left( \sqrt{ \frac{P^2_{\text{max}}}{x^2} - 1} \right)}
 $$ {#eq:dip}
 
-Namely:
+The result is plotted in the figure below:
 
-![Plot of the expected distribution with
+![Plot of the expected dependence of $\langle |P_v| \rangle$ with
   $P_{\text{max}} = 10$.](images/4-expected.pdf){#fig:plot}
 
 
 ## Monte Carlo simulation
 
-The same distribution should be found by generating and binning points in a
-proper way. A number of $N = 50000$ points were generated as a couple of values
-($P$, $\theta$), with $P$ evenly distributed between 0 and $P_{\text{max}}$ and
-$\theta$ given by the same procedure described in @sec:3, namely:
+This dependence should be found by running a Monte Carlo simulation and
+computing a binned average of the vertical momentum. A number of $N = 50000$
+particles were generated as pair of values ($P$, $\theta$), with $P$
+uniformly distributed between 0 and $P_{\text{max}}$ and $\theta$ given by the
+same procedure described in @sec:3, namely:
 $$
-  \theta = \text{acos}(1 - 2x)
+  \theta = \arccos(1 - 2x)
 $$
 
-with $x$ uniformly distributed between 0 and 1.  
-The data binning turned out to be a bit challenging. Once $P$ was sampled and
-$P_h$ was computed, the bin containing the latter's value must be found. If $n$
-is the number of bins in which the range $[0, P_{\text{max}}]$ is divided into,
-then the width $w$ of each bin is given by:
+where $x$ is uniformly distributed between 0 and 1.  
+The binning turned out to be quite a challenge: once a $P$ is sampled and
+$P_h$ computed, the bin containing the latter has to be found. If
+the range $[0, P_{\text{max}}]$ is divided into $n$ equal bins
+of the width
 $$
   w = \frac{P_{\text{max}}}{n}
 $$
-
-and the $i^{th}$ bin in which $P_h$ goes in is:
+then (counting from zero) $P_h$ goes into the $i$-th bin where
 $$
-  i = \text{floor} \left( \frac{P_h}{w} \right)
+  i = \left\lfloor \frac{P_h}{w} \right\rfloor
 $$
 
-where 'floor' is the function which gives the bigger integer smaller than its
-argument and the bins are counted starting from zero.  
-Then, a vector in which the $j^{\text{th}}$ entry contains both the sum $S_j$
-of all the $|P_v|$s relative to each $P_h$ fallen into the $j^{\text{th}}$ bin
-itself and the number num$_j$ of the bin entries was iteratively updated. At
-the end, the average value of $|P_v|_j$ was computed as $S_j / \text{num}_j$.  
+Then, the sum $S_j$ of all the $|P_v|$ values relative to the $P_h$ of the
+$j$-th bin itself and number num$_j$ of the bin counts are stored in an array
+and iteratively updated.  Once every bin has been updated, the average value of
+$|P_v|_j$ is computed as $S_j / \text{num}_j$.
+
 For the sake of clarity, for each sampled couple the procedure is the
 following. At first $S_j = 0 \wedge \text{num}_j = 0 \, \forall \, j$, then:
 
@@ -196,21 +201,21 @@ For $P_{\text{max}} = 10$ and $n = 50$, the following result was obtained:
 
 ![Sampled points histogram.](images/4-dip.pdf)
 
-In order to check whether the expected distribution (@eq:dip) properly matches
-the produced histogram, a chi-squared minimization was applied. Being a simple
-one-parameter fit, the $\chi^2$ was computed without a suitable GSL function
-and the error of the so obtained estimation of $P_{\text{max}}$ was given as
-the inverse of the $\chi^2$ second derivative in its minimum, according to the
-Cramér-Rao bound.
+In order to assert the compatibility of the expected function (@eq:dip)
+with the histogram, a least squares minimization was applied. Being a simple
+one-parameter fit, the $\chi^2$ was implemented manually and minimised
+without using a general LSQ routine. The error of the estimation of
+$P_{\text{max}}$ was computed as the inverse of the $\chi^2$ second derivative
+at the minimum, according to the Cramér-Rao bound.
 
 The following results were obtained:
 
-$$
-  P^{\text{oss}}_{\text{max}} = 10.005 \pm 0.018 \with \chi_r^2 = 0.071
-$$
+\begin{align*}
+  P^{\text{oss}}_{\text{max}} = 10.005 \pm 0.018 && \chi^2 &= 0.071 \\
+  && \text{P}(x > \chi^2) &= 0.79
+\end{align*}
 
-where $\chi_r^2$ is the $\chi^2$ per degree of freedom, proving a good
-convergence.  
+The $\chi^2$ and $p$-value show a very good agreement.
 In order to compare $P^{\text{oss}}_{\text{max}}$ with the expected value
 $P_{\text{max}} = 10$, the following compatibility $t$-test was applied:
 
@@ -228,8 +233,8 @@ In this case:
   - p = 0.768
 
 which allows to assert that the sampled points actually follow the predicted
-distribution. In @fig:fit, the fit function superimposed on the histogram is
+function. In @fig:fit, the fit function superimposed on the histogram is
 shown.
 
 ![Fitted sampled data. $P^{\text{oss}}_{\text{max}} = 10.005
-  \pm 0.018$, $\chi_r^2 = 0.071$.](images/4-fit.pdf){#fig:fit}
+  \pm 0.018$, $\chi^2 = 0.071$.](images/4-fit.pdf){#fig:fit}