will-do.tex 1.3 KB

1234567891011121314151617181920212223242526272829303132
  1. \subsection{What will I do next?}
  2. \begin{frame}{What will I do next?}
  3. \begin{itemize}
  4. \item Get classification performance with cross-validation
  5. \item Implement neural net for classification
  6. \begin{itemize}
  7. \item preprocessing: compute cubic spline for each line
  8. \begin{itemize}
  9. \item equi-spaced points or
  10. \item get equi-timed points
  11. \end{itemize}
  12. \item 5 - 20 input neurons for each line
  13. \item 1076 output neurons (one for each symbol)
  14. \end{itemize}
  15. \item Get a language model (e.g. by parsing Wikipedia)
  16. \item Use ANN with HMM (?)
  17. \end{itemize}
  18. \end{frame}
  19. \subsection{Far future}
  20. \begin{frame}{What could be done?}
  21. \begin{itemize}
  22. \item Make use of audio data in a multimodal approach\\
  23. e.g. $R$ and $\mathcal{R}$
  24. \item Currently, the Lecture Translation system doesn't recognize math.\\
  25. You get \enquote{integral of e raised to the power of x d x} instead
  26. of $\int e^x \mathrm{d} x$.
  27. \item Spoken math is ambigous: $\sqrt{a+b}$ vs. $\sqrt{a} + b$
  28. \item The language model I create could help to find probable formulas
  29. \item The platform could be used to get more input data of users
  30. \end{itemize}
  31. \end{frame}