id	YoP	Paper Name	Description
	2023		***
48	2023	GLIGEN: Open-Set Grounded Text-to-Image Generation	***
47	2023	Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation	***
46	2023	CLIPAG: Towards Generator-Free Text-to-Image Generation	***
45	2023	Text-to-image Diffusion Models in Generative AI: A Survey	***
44	2023	DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation	***
43	2023	Imagic: Text-Based Real Image Editing With Diffusion Models	***
42	2023	Adding Conditional Control to Text-to-Image Diffusion Models	***
41	2023	ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model With Knowledge-Enhanced Mixture-of-Denoising-Experts	***
40	2023	Multi-Concept Customization of Text-to-Image Diffusion	***
39	2023	GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis	***
38	2023	SINE: SINgle Image Editing with Text-to-Image Diffusion Models	***
37	2023	Scaling Up GANs for Text-to-Image Synthesis	***
36	2023	Versatile Diffusion: Text, Images and Variations All in One Diffusion Model	***
35	2023	ReCo: Region-Controlled Text-to-Image Generation	***
34	2023	Multiscale Feature Extraction and Fusion of Image and Text in VQA	***
33	2023	BanglaAbuseMeme: A Dataset for Bengali Abusive Meme Classification	***
32	2023	MemeCap: A Dataset for Captioning and Interpreting Memes	***
31	2023	Stochastic evolution of bad memes	This paper hypothesizes that even a bad meme with low/unattractive content can have significant circulation and adoption in a network iff it has a social conformation factor. This phenomenon occurs due to the pluralistic ignorance effect of SNA. The authors have created a mathematical model of meme categorization + propagation evolution.
30	2023	Mapping Meme to Words	This paper introduces a framework, 'ISSUES' which uses 3 techniques, * Textual inversion: it maps the image into the pseudo-text/token space. * Extracts latent representation of textual and image data using CLIP. * Multimodal Fusion It uses dataset (1) HMC and (2) HarMeme
29	2023	Contextualizing Internet Memes Across Social Platforms.	This paper creates a method to map a meme to meme-KG, 'IMKG' and hypothesizes that the learned mapping function can unveil implicit-contextual-knowledge of the meme. For evaluation, the authors have used two social media platforms - (1) Reddit and (2) Dischord.
28	2023	FLYPE : Multitask Prompt Tuning for Multimodal Human Understanding of Social Media	FLYPE has proposed a composite loss function for cross-task, shared prompts and targets unseen scenario to solve.
27	2023	"Somewhere Along Your Pedigree, a Bitch Got Over the Wall!” – A Data-Driven Approach to a Typology of Implicitly Offensive Language	This paper defines a NEW TOPOLOGY and corresponding definitions for implicit hate content. Also, highlights the issues with old approaches.
26	2023	PromptMTopic: Unsupervised Multimodal Topic Modeling of Memes using Large Language Models	PromptMTopic identifies and extracts topics involved in a set of memes. Used datasets, (1) TOTALDEFMEME (2)FHM (3)MEMOTION
25	2023	Causal Intersectionality and Dual Form of Gradient Descent for Multimodal Analysis: a Case Study on Hateful Memes	It merges two distinct lines of research work, XAI and Cuasal Analysis. The authors have hypothesized that the task of hateful meme detection can be formulated in terms of Average Treatment Effect (ATE of XAI) and summarised gradient-based attention attribution score (from Causal Analysis).
24	2023	Detecting and Correcting Hate Speech in Multimodal Memes with Large Visual Language Model	It uses 0-shot prompting in LLaVA for the task of hateful meme detection and correction.
23	2023	Leveraging World Knowledge in Implicit Hate Speech Detection	Uses Entity-Linking technique to incorporate world-contextual knowledge of entities to improve the detection of EXPLICIT and IMPLICIT memes.
22	2023	LLM-Assisted Content Analysis: Using Large Language Models to Support Deductive Coding	***
21	2023	Chain of Explanation: New Prompting Method to Generate Quality Natural Language Explanation for Implicit Hate Speech	Using predefined prompts, Explanations for implicit hate content are generated in natural language.
20	2023	Is ChatGPT better than Human Annotators? Potential and Limitations of ChatGPT in Explaining Implicit Hate Speech	It tests the limitations and feasibility of ChatGPT for NLE of an implicit meme.
19	2023	An In-depth Analysis of Implicit and Subtle Hate Speech Messages	It compares different benchmarks for Hate-Speech detection and highlights the facts that conventional models are not suitable for detecting a implicit/subtle hate content.
18	2022	ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection	It is a toxic Data Creation Framework. Using LLM, it uses description-based prompting + adversarial-classifier-in-loop method to generate toxic data using a LLM.
17	2022	Generalizable Implicit Hate Speech Detection using Contrastive Learning	It uses contrastive learning to train a model for CROSS_TASK evaluation for IMPLICIT hate content detection.
16	2021	Latent Hatred: A Benchmark for Understanding Implicit Hate Speech	It defines a theoretically justified Taxonomy and a fine-grained annotated corpus for IMPLICIT hate speech.
15	2023	Tad: A Domain-Aware Framework Learning to Adapt Target Shifts of Hate Speech	It incorporates Cross-Domain + Target-Shift modelling of hate sppech content.
15	2023	Social Meme-ing: Measuring Linguistic Variation in Memes	It considers each meme as a node and each template as a semantically binding function w.r.t. the meme-text. Later it extracts all possible clusters, where each cluster contains memes with similar templates. Authors have studied these clusters to know how socio-factors have influenced the evolution of meme-text demographically.
14	2023	COGVLM: VISUAL EXPERT FOR LARGE LANGUAGE MODELS	A visual-language model. It trains and attaches a visual expert in attention + MLP layers of a language model which is responsible for extracting important similarity between text and an image. It is able to do deep-fusion.
13	2023	COVLM: COMPOSING VISUAL ENTITIES AND RELATIONSHIPS IN LARGE LANGUAGE MODELS VIA COMMUNICATIVE DECODING	A helper model that can guide an LLM to express the relationship between entities in the image and the visual text.
12	2023	GLaMM : Pixel Grounding Large Multimodal Model	First kind of LMM that can generate a textual response at multiple shades of granularity w.r.t. an input image.
11	2023	A Template Is All You Meme	It builds a KB based on 54k images. It consists of different templates with corresponding information and example images.
10	2023	Detecting and Correcting Hate Speech in Multimodal Memes with Large Visual Language Model	It uses a 0-shot prompt with LlaVa for Detecting and Correcting Hate meme.
9	2023	You Know What I Meme Enhancing People's Understanding and Awareness of Hateful Memes Using Crowdsourced Explanations	***
8	2023	What Do You MEME Generating Explanations for Visual Semantic Role Labelling in Memes	Authors have represented an HVV dataset of 3K instances and an NLE novel task + a model 'LUMEN' with a comparative study.
7	2023	Pro-Cap Leveraging a Frozen Vision-Language Model for Hateful Meme Detection	It uses 0-shot-based QA to probe an LM and generate captions for no-text memes.
6	2023	MEMEX Detecting Explanatory Evidence for Memes via Knowledge-Enriched Contextualization	Given a meme and a document, MEMEX can extract the background information.
5	2023	HateProof: Are Hateful Meme Detection Systems really Robust?	A case study over the vulnerability of the existing systems in hate detection. It provides a solution based on contrastive learning + adversarial training.
4	2023	Decoding the Underlying Meaning of Multimodal Hateful Memes.	A novel task + annotated-contextual information dataset + relevance study over hateful meme.
3	2023	Characterizing the Entities in Harmful Memes Who is the Hero, the Villain, the Victim	Understanding and Identifying Hero, Villain and Victim of a meme.
2	2023	Review of Vision-Language Models and their Performance on the Hateful Memes Challenge	A survey of different unimodal + multimodal systems w.r.t. meme classification.
1	2022	Prompting for Multimodal Hateful Meme Classification	A prompt-based model that takes (1) image text, (2) image caption, and (3) a predefined prompt to classify an input meme.