Selaa lähdekoodia

Added the two papers SAN and DMN+ which are currently SOTA and beating the bechmarks for Visual Question Answering

Aaditya GPGPU 9 vuotta sitten
vanhempi
commit
71613ce907
1 muutettua tiedostoa jossa 5 lisäystä ja 2 poistoa
  1. 5 2
      README.md

+ 5 - 2
README.md

@@ -295,8 +295,11 @@ with Deep Convolutional Neural Network, arXiv:1512.07928 [[Paper](http://arxiv.o
 * Baidu / UCLA [[Paper]](http://arxiv.org/pdf/1505.05612) [[Dataset]]()
   * Hauyuan Gao, Junhua Mao, Jie Zhou, Zhiheng Huang, Lei Wang, Wei Xu, Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering, arXiv:1505.05612.
 * POSTECH [[Paper](http://arxiv.org/pdf/1511.05756.pdf)] [[Project Page](http://cvlab.postech.ac.kr/research/dppnet/)]
-  * Hyeonwoo Noh, Paul Hongsuck Seo, and Bohyung Han, Image Question Answering using Convolutional Neural Network 
-with Dynamic Parameter Prediction, arXiv:1511.05765
+  * Hyeonwoo Noh, Paul Hongsuck Seo, and Bohyung Han, Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction, arXiv:1511.05765
+* CMU / Microsoft Research [[Paper](http://arxiv.org/pdf/1511.02274v2.pdf)]
+  * Yang, Z., He, X., Gao, J., Deng, L., & Smola, A. (2015). Stacked Attention Networks for Image Question Answering. arXiv:1511.02274.
+* MetaMind [[Paper](http://arxiv.org/pdf/1603.01417v1.pdf)]
+  * Xiong, Caiming, Stephen Merity, and Richard Socher. "Dynamic Memory Networks for Visual and Textual Question Answering." arXiv:1603.01417 (2016).
 
 ### Other Topics
   * Visual Analogy [[Paper](https://web.eecs.umich.edu/~honglak/nips2015-analogy.pdf)]