Logo des Repositoriums
 

A Corpus of Memes from Reddit: Acquisition, Preparation and First Case Studies

dc.contributor.authorSchmidt, Thomas
dc.contributor.authorSchiller, Fabian
dc.contributor.authorGötz, Matthias
dc.contributor.authorWolff, Christian
dc.contributor.editorKlein, Maike
dc.contributor.editorKrupka, Daniel
dc.contributor.editorWinter, Cornelia
dc.contributor.editorWohlgemuth, Volker
dc.date.accessioned2023-11-29T14:50:35Z
dc.date.available2023-11-29T14:50:35Z
dc.date.issued2023
dc.description.abstractWe present a corpus of memes and their textual components that were acquired from the popular meme platform r\memes, a subreddit of Reddit and one of the major outlets of online meme culture. The corpus consists of the most popular memes from 2013-2021 on the platform and we acquired 11,701 memes and 280,351 text tokens. We conduct several case studies focused on diachronic analysis to highlight the possibilities of the corpus for research in internet studies and online culture. We examine the general activity on the platform throughout the years and identify a significant increase in meme production beginning 2017. Results of sentiment analysis show a tendency towards memes with positively classified texts. The analysis of most frequent words per half-year spotlights the importance of certain cultural events for meme culture (e.g. the 2016 US election). Using the LIWC to analyze swear and sexual words shows an overall decrease in the usage of these words pointing to an increased moderation of the platform. The corpus is publicly available for the research community for further studies.en
dc.identifier.doi10.18420/inf2023_89
dc.identifier.isbn978-3-88579-731-9
dc.identifier.pissn1617-5468
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/43213
dc.language.isoen
dc.publisherGesellschaft für Informatik e.V.
dc.relation.ispartofINFORMATIK 2023 - Designing Futures: Zukünfte gestalten
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-337
dc.subjectmemes
dc.subjectinternet studies
dc.subjectcorpus
dc.subjectnatural language processing
dc.subjectsentiment analysis
dc.subjectReddit
dc.titleA Corpus of Memes from Reddit: Acquisition, Preparation and First Case Studiesen
dc.typeText/Conference Paper
gi.citation.endPage804
gi.citation.publisherPlaceBonn
gi.citation.startPage795
gi.conference.date26.-29. September 2023
gi.conference.locationBerlin
gi.conference.sessiontitleKultur & Design - Digital Cultures Cultural Analytics (InfDH 2023)

Dateien

Originalbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
INFDH_2023_Schmidt_Memes.pdf
Größe:
559.36 KB
Format:
Adobe Portable Document Format