Emergent Communication Pretraining.

2023

Description

Investigations entailed using existing methods to develop emergent languages to be utilized as a pretraining dataset for multimodal-LLMs. Tests included downstream tasks such as Visual Grounding and VQA.

By training a Speaker agent that outputs an emergent language describing images, we used the resulting language to pretrain a multimodal LLM ,OFA, with corresponding images for various multimodal learning tasks.