In the previous segment we discussed how you can attach a general label to also create an embedding.
But let's now discuss a general network architecture that allows you to embed two different kinds of data into the same space. In the example below we'll associate text with an image.
The image part would require some convolutional layers, but besides that we still end up with a gradient that can propogate backward. Updating all the weights along the way.