|
@@ -83,8 +83,9 @@ Please feel free to [pull requests](https://github.com/kjw0612/awesome-deep-visi
|
|
|
* Karel Lenc, Andrea Vedaldi, R-CNN minus R, arXiv:1506.06981.
|
|
|
* End-to-end people detection in crowded scenes [[Paper]](http://arxiv.org/abs/1506.04878)
|
|
|
* Russell Stewart, Mykhaylo Andriluka, End-to-end people detection in crowded scenes, arXiv:1506.04878.
|
|
|
-* You Only Look Once: Unified, Real-Time Object Detection [[Paper]](http://arxiv.org/abs/1506.02640)
|
|
|
+* You Only Look Once: Unified, Real-Time Object Detection [[Paper]](http://arxiv.org/abs/1506.02640), [[Paper Version 2]](https://arxiv.org/abs/1612.08242), [[C Code]](https://github.com/pjreddie/darknet), [[Tensorflow Code]](https://github.com/thtrieu/darkflow)
|
|
|
* Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi, You Only Look Once: Unified, Real-Time Object Detection, arXiv:1506.02640
|
|
|
+ * Joseph Redmon, Ali Farhadi (Version 2)
|
|
|
* Inside-Outside Net [[Paper]](http://arxiv.org/abs/1512.04143)
|
|
|
* Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross Girshick, Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
|
|
|
* Deep Residual Network (Current State-of-the-Art) [[Paper]](http://arxiv.org/abs/1512.03385)
|
|
@@ -96,8 +97,6 @@ Please feel free to [pull requests](https://github.com/kjw0612/awesome-deep-visi
|
|
|
* Wei Liu1, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg, SSD: Single Shot MultiBox Detector, arXiv:1512.02325
|
|
|
* Speed/accuracy trade-offs for modern convolutional object detectors [[Paper]](https://arxiv.org/pdf/1611.10012v1.pdf)
|
|
|
* Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Kevin Murphy, Google Research, arXiv:1611.10012
|
|
|
-* You Only Look Once: Object detection and classification in real-time [[Paper]](https://arxiv.org/abs/1612.08242), [[C Code]](https://github.com/pjreddie/darknet), [[Tensorflow Code]](https://github.com/thtrieu/darkflow)
|
|
|
- * Joseph Redmon, Ali Farhadi
|
|
|
|
|
|
### Video Classification
|
|
|
* Nicolas Ballas, Li Yao, Pal Chris, Aaron Courville, "Delving Deeper into Convolutional Networks for Learning Video Representations", ICLR 2016. [[Paper](http://arxiv.org/pdf/1511.06432v4.pdf)]
|