%!s(int64=11) %!d(string=hai) anos · c9def13de2
--- a/documents/write-math-ba-paper/README.md
+++ b/documents/write-math-ba-paper/README.md
@@ -1,3 +1,6 @@
 
				+[Download compiled PDF](https://github.com/MartinThoma/LaTeX-examples/blob/master/documents/write-math-ba-paper/write-math-ba-paper.pdf)
			
 
				+
			
 
				 ## Spell checking
			
 
				 * Spell checking `aspell --lang=en --mode=tex check write-math-ba-paper.tex`
			
 
				-* Spell checking with `http://www.reverso.net/spell-checker`
			
 
				+* Spell checking with `http://www.reverso.net/spell-checker`
			
 
				+* https://github.com/devd/Academic-Writing-Check
			
--- a/documents/write-math-ba-paper/write-math-ba-paper.pdf
+++ b/documents/write-math-ba-paper/write-math-ba-paper.pdf
--- a/documents/write-math-ba-paper/write-math-ba-paper.tex
+++ b/documents/write-math-ba-paper/write-math-ba-paper.tex
@@ -75,22 +75,21 @@ Daniel Kirsch describes in~\cite{Kirsch} a system called Detexify which uses
 
				 time warping to classify on-line handwritten symbols and claims to achieve a
			
 
				 TOP-3 error of less than $\SI{10}{\percent}$ for a set of $\num{100}$~symbols.
			
 
				 He also published his data on \url{https://github.com/kirel/detexify-data},
			
 
				-which was collected by a crowd-sourcing approach via
			
 
				+which was collected by a crowdsourcing approach via
			
 
				 \url{http://detexify.kirelabs.org}. Those recordings as well as some recordings
			
 
				 which were collected by a similar approach via \url{http://write-math.com} were
			
 
				 used to train and evaluated different classifiers. A complete description of
			
 
				 all involved software, data and experiments is given in~\cite{Thoma:2014}.
			
 
				 
			
 
				 \section{Steps in Handwriting Recognition}
			
 
				-The following steps are used in all classifiers which are described in the
			
 
				-following:
			
 
				+The following steps are used in many classifiers:
			
 
				 
			
 
				 \begin{enumerate}
			
 
				     \item \textbf{Preprocessing}: Recorded data is never perfect. Devices have
			
 
				-          errors and people make mistakes while using devices. To tackle these
			
 
				-          problems there are preprocessing algorithms to clean the data. The
			
 
				-          preprocessing algorithms can also remove unnecessary variations of
			
 
				-          the data that do not help in the classification process, but hide
			
 
				+          errors and people make mistakes while using the devices. To tackle
			
 
				+          these problems there are preprocessing algorithms to clean the data.
			
 
				+          The preprocessing algorithms can also remove unnecessary variations
			
 
				+          of the data that do not help in the classification process, but hide
			
 
				           what is important. Having slightly different sizes of the same symbol
			
 
				           is an example of such a variation. Four preprocessing algorithms that
			
 
				           clean or normalize recordings are explained in
			
@@ -117,15 +116,16 @@ following:
 
				           improve the performance of learning algorithms.
			
 
				 \end{enumerate}
			
 
				 
			
 
				-After these steps, we are faced with a classification learning task which consists of
			
 
				-two parts:
			
 
				+After these steps, we are faced with a classification learning task which
			
 
				+consists of two parts:
			
 
				 \begin{enumerate}
			
 
				     \item \textbf{Learning} parameters for a given classifier. This process is
			
 
				           also called \textit{training}.
			
 
				     \item \textbf{Classifying} new recordings, sometimes called
			
 
				           \textit{evaluation}. This should not be confused with the evaluation
			
 
				           of the classification performance which is done for multiple
			
 
				-          topologies, preprocessing queues, and features in \Cref{ch:Evaluation}.
			
 
				+          topologies, preprocessing queues, and features in
			
 
				+          \Cref{ch:Evaluation}.
			
 
				 \end{enumerate}
			
 
				 
			
 
				 The classification learning task can be solved with \glspl{MLP} if the number
			
@@ -141,7 +141,7 @@ and feature extraction easier, more effective or faster. It does so by resolving
 
				 errors in the input data, reducing duplicate information and removing irrelevant
			
 
				 information.
			
 
				 
			
 
				-Preprocessing algorithms fall in two groups: Normalization and noise
			
 
				+Preprocessing algorithms fall into two groups: Normalization and noise
			
 
				 reduction algorithms.
			
 
				 
			
 
				 A very important normalization algorithm in single-symbol recognition is
			
@@ -157,12 +157,12 @@ Another normalization preprocessing algorithm is resampling. As the data points
 
				 on the pen trajectory are generated asynchronously and with different
			
 
				 time-resolutions depending on the used hardware and software, it is desirable
			
 
				 to resample the recordings to have points spread equally in time for every
			
 
				-recording. This was done with linear interpolation of the $(x,t)$ and $(y,t)$
			
 
				+recording. This was done by linear interpolation of the $(x,t)$ and $(y,t)$
			
 
				 sequences and getting a fixed number of equally spaced points per stroke.
			
 
				 
			
 
				 \textit{Connect strokes} is a noise reduction algorithm. It happens sometimes
			
 
				 that the hardware detects that the user lifted the pen where the user certainly
			
 
				-didn't do so. This can be detected by measuring the euclidean distance between
			
 
				+didn't do so. This can be detected by measuring the Euclidean distance between
			
 
				 the end of one stroke and the beginning of the next stroke. If this distance is
			
 
				 below a threshold, then the strokes are connected.
			
 
				 
			
@@ -207,19 +207,20 @@ activation functions can be varied. The learning algorithm is parameterized by
 
				 the learning rate $\eta \in (0, \infty)$, the momentum $\alpha \in [0, \infty)$
			
 
				 and the number of epochs.
			
 
				 
			
 
				-The topology of \glspl{MLP} will be denoted in the following by separating
			
 
				-the number of neurons per layer with colons. For example, the notation $160{:}500{:}500{:}500{:}369$
			
 
				-means that the input layer gets 160~features, there are three hidden layers
			
 
				-with 500~neurons per layer and one output layer with 369~neurons.
			
 
				-
			
 
				-\glspl{MLP} training can be executed in
			
 
				-various different ways, for example with \gls{SLP}.
			
 
				-In case of a \gls{MLP} with the topology $160{:}500{:}500{:}500{:}369$,
			
 
				-\gls{SLP} works as follows: At first a \gls{MLP} with one hidden layer ($160{:}500{:}369$)
			
 
				-is trained. Then the output layer is discarded, a new hidden layer and a new
			
 
				-output layer is added and it is trained again, resulting in a $160{:}500{:}500{:}369$
			
 
				-\gls{MLP}. The output layer is discarded again, a new hidden layer is added and
			
 
				-a new output layer is added and the training is executed again.
			
 
				+The topology of \glspl{MLP} will be denoted in the following by separating the
			
 
				+number of neurons per layer with colons. For example, the notation
			
 
				+$160{:}500{:}500{:}500{:}369$ means that the input layer gets 160~features,
			
 
				+there are three hidden layers with 500~neurons per layer and one output layer
			
 
				+with 369~neurons.
			
 
				+
			
 
				+\glspl{MLP} training can be executed in various different ways, for example
			
 
				+with \gls{SLP}. In case of a \gls{MLP} with the topology
			
 
				+$160{:}500{:}500{:}500{:}369$, \gls{SLP} works as follows: At first a \gls{MLP}
			
 
				+with one hidden layer ($160{:}500{:}369$) is trained. Then the output layer is
			
 
				+discarded, a new hidden layer and a new output layer is added and it is trained
			
 
				+again, resulting in a $160{:}500{:}500{:}369$ \gls{MLP}. The output layer is
			
 
				+discarded again, a new hidden layer is added and a new output layer is added
			
 
				+and the training is executed again.
			
 
				 
			
 
				 Denoising auto-encoders are another way of pretraining. An
			
 
				 \textit{auto-encoder} is a neural network that is trained to restore its input.