Layer normalization. arXiv preprint, 2016. ,
Neural machine translation by jointly learning to align and translate, 2014. ,
Does Multimodality Help Human and Machine for Translation and Image Captioning?, Proceedings of the First Conference on Machine Translation: Volume 2,
Shared Task Papers, pp.627-633, 2016. ,
DOI : 10.18653/v1/W16-2358
URL : https://hal.archives-ouvertes.fr/hal-01433183
Multimodal attention for neural machine translation, 2016. ,
Nmtpy: A flexible toolkit for advanced neural machine translation systems. arXiv preprint, 2017. ,
Doubly-attentive decoder for multimodal neural machine translation. arXiv preprint, 2017. ,
Incorporating global visual features into attentionbased neural machine translation. arXiv preprint, 2017. ,
Empirical evaluation of gated recurrent neural networks on sequence modeling, 2014. ,
Better hypothesis testing for statistical machine translation: Controlling for optimizer instability, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers, pp.176-181, 2011. ,
Findings of the Second Shared Task on Multimodal Machine Translation
and Multilingual Image Description, Proceedings of the Second Conference on Machine Translation, 2017. ,
DOI : 10.18653/v1/W17-4718
Multi30K: Multilingual English-German Image Descriptions, Proceedings of the 5th Workshop on Vision and Language, pp.70-74, 2016. ,
DOI : 10.18653/v1/W16-3210
Imagination improves multimodal translation, 2017. ,
Conditional gated recurrent unit with attention mechanism, 2016. ,
Understanding the difficulty of training deep feedforward neural networks, Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics. PMLR, volume 9 of Proceedings of Machine Learning Research, pp.249-256, 2010. ,
Deep Residual Learning for Image Recognition, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.770-778, 2016. ,
DOI : 10.1109/CVPR.2016.90
Attentionbased multimodal neural machine translation, Proceedings of the First Conference on Machine Translation. Association for Computational Linguistics, pp.639-645, 2016. ,
DOI : 10.18653/v1/w16-2360
Adam: A method for stochastic optimization. arXiv preprint arXiv:1412, 2014. ,
Meteor, Proceedings of the Second Workshop on Statistical Machine Translation, StatMT '07, pp.228-231, 2007. ,
DOI : 10.3115/1626355.1626389
Attention strategies for multi-source sequenceto-sequence learning, 2017. ,
Multi-task sequence to sequence learning, 2015. ,
BLEU, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , ACL '02, pp.311-318, 2002. ,
DOI : 10.3115/1073083.1073135
On the difficulty of training recurrent neural networks, Proceedings of the 30th International Conference on International Conference on Machine Learning -Volume 28. JMLR.org, ICML'13, pp.1310-1318, 2013. ,
Faster r-cnn: Towards real-time object detection with region proposal networks, Proceedings of the 28th International Conference on Neural Information Processing Systems, pp.91-99, 2015. ,
ImageNet Large Scale Visual Recognition Challenge, International Journal of Computer Vision, vol.1010, issue.1, pp.211-252, 2015. ,
DOI : 10.1007/978-3-642-15555-0_11
Neural Machine Translation of Rare Words with Subword Units, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.1715-1725, 2016. ,
DOI : 10.18653/v1/P16-1162
Very deep convolutional networks for large-scale image recognition. arXiv preprint, 2014. ,
A Shared Task on Multimodal Machine Translation and Crosslingual
Image Description, Proceedings of the First Conference on Machine Translation: Volume 2,
Shared Task Papers, pp.543-553, 2016. ,
DOI : 10.18653/v1/W16-2346
Dropout: A simple way to prevent neural networks from overfit- ting, 2014. ,
Sequence to sequence learning with neural networks, Proceedings of the 27th International Conference on Neural Information Processing Systems, pp.3104-3112, 2014. ,
A multifaceted evaluation of neural versus phrasebased machine translation for 9 language directions, Proceedings of the 15th Conference of the European Chapter, pp.1063-1073, 2017. ,
Show, attend and tell: Neural image caption generation with visual attention, Proceedings of the 32nd International Conference on Machine Learning (ICML-15). JMLR Workshop and Conference Proceedings, pp.2048-2057, 2015. ,