|
10 lat temu | |
---|---|---|
README.md | 10 lat temu |
A curated list of deep learning resources for computer vision, inspired by awesome-php and awesome-computer-vision.
Please feel free to pull requests or email jiwon@alum.mit.edu to add links.
Baidu/UCLA: Explain Images with Multimodal Recurrent Neural Networks(http://arxiv.org/abs/1410.1090) Toronto: Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models(http://arxiv.org/abs/1411.2539) Berkeley: Long-term Recurrent Convolutional Networks for Visual Recognition and Description(http://arxiv.org/abs/1411.4389) Google: Show and Tell: A Neural Image Caption Generator(http://arxiv.org/abs/1411.4555) Stanford: Deep Visual-Semantic Alignments for Generating Image Description(http://cs.stanford.edu/people/karpathy/deepimagesent/) UML/UT: Translating Videos to Natural Language Using Deep Recurrent Neural Networks(http://arxiv.org/abs/1412.4729) Microsoft/CMU: Learning a Recurrent Visual Representation for Image Caption Generation(http://arxiv.org/abs/1411.5654) Microsoft: From Captions to Visual Concepts and Back(http://arxiv.org/abs/1411.4952)