Victor Zhou victorzhou.com

A gentle introduction to Visual Question Answering using neural networks  ↦

Show us humans a picture of someone in uniform on a mound of dirt throwing a ball and we will quickly tell you we’re looking at baseball. But how do you make a computer come to the same conclusion?

Visual Question Answering

In this post, we’ll explore basic methods for performing VQA and build our own simple implementation in Python


Discussion

Sign in or Join to comment or subscribe

Player art
  0:00 / 0:00