Multimodal deep learning