CoCa(Contrastive Captioners)
Pretraining method : encoder-decoder models encoder dual encoder decoder transfer learning multimodal : In CoCa using text data + image data modality: In the context of human–computer interaction, a modality is the classification of a single independent channel of sensory input/output between a computer and a human. A system is designated unimodal if it has only one modality implemented, and mul..
2022.11.03