Show, Attend and Tell:
Neural Image Caption Generation with Visual Attention
Kelvin Xu*, Jimmy Ba†, Ryan Kiros†, Kyunghyun Cho*, Aaron Courville*, Ruslan Salakhutdinov†, Richard Zemel†, Yoshua Bengio*
Universit´ e de Montr´ eal*/ University of Toronto†
(some figures from Hugo Larochelle)
July 8, 2015
1 / 46