The discourse on climate change has become a centerpiece of public debate, thereby creating a pressing need to analyze the multitude of messages created by the participants in this communication process. In addition to text, messages on this topic are communicated through images, videos, tables and other data objects that are embedded within a document and accompany the text. This paper presents the process of building the InsightsNet Climate Change Corpus (ICCC), a multimodal corpus on the topic of climate change, using NLP tools to enrich corpus metadata, a dataset that lends itself to the exploration of the interplay between the various modalities that constitute the discourse on climate change.