id | YoP | Paper Name | Description |
---|---|---|---|
2023 | *** | ||
48 | 2023 | GLIGEN: Open-Set Grounded Text-to-Image Generation | *** |
47 | 2023 | Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation | *** |
46 | 2023 | CLIPAG: Towards Generator-Free Text-to-Image Generation | *** |
45 | 2023 | Text-to-image Diffusion Models in Generative AI: A Survey | *** |
44 | 2023 | DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation | *** |
43 | 2023 | Imagic: Text-Based Real Image Editing With Diffusion Models | *** |
42 | 2023 | Adding Conditional Control to Text-to-Image Diffusion Models | *** |
41 | 2023 | ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model With Knowledge-Enhanced Mixture-of-Denoising-Experts | *** |
40 | 2023 | Multi-Concept Customization of Text-to-Image Diffusion | *** |
39 | 2023 | GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis | *** |
38 | 2023 | SINE: SINgle Image Editing with Text-to-Image Diffusion Models | *** |
37 | 2023 | Scaling Up GANs for Text-to-Image Synthesis | *** |
36 | 2023 | Versatile Diffusion: Text, Images and Variations All in One Diffusion Model | *** |
35 | 2023 | ReCo: Region-Controlled Text-to-Image Generation | *** |
34 | 2023 | Multiscale Feature Extraction and Fusion of Image and Text in VQA | *** |
33 | 2023 | BanglaAbuseMeme: A Dataset for Bengali Abusive Meme Classification | *** |
32 | 2023 | MemeCap: A Dataset for Captioning and Interpreting Memes | *** |
31 | 2023 | Stochastic evolution of bad memes | This paper hypothesizes that even a bad meme with low/unattractive content can have significant circulation and adoption in a network iff it has a social conformation factor. This phenomenon occurs due to the pluralistic ignorance effect of SNA. The authors have created a mathematical model of meme categorization + propagation evolution. |
30 | 2023 | Mapping Meme to Words | This paper introduces a framework, 'ISSUES' which uses 3 techniques, * Textual inversion: it maps the image into the pseudo-text/token space. * Extracts latent representation of textual and image data using CLIP. * Multimodal Fusion It uses dataset (1) HMC and (2) HarMeme |
29 | 2023 | Contextualizing Internet Memes Across Social Platforms. | This paper creates a method to map a meme to meme-KG, 'IMKG' and hypothesizes that the learned mapping function can unveil implicit-contextual-knowledge of the meme. For evaluation, the authors have used two social media platforms - (1) Reddit and (2) Dischord. |
28 | 2023 | FLYPE : Multitask Prompt Tuning for Multimodal Human Understanding of Social Media | FLYPE has proposed a composite loss function for cross-task, shared prompts and targets unseen scenario to solve. |
27 | 2023 | "Somewhere Along Your Pedigree, a Bitch Got Over the Wall!” – A Data-Driven Approach to a Typology of Implicitly Offensive Language | This paper defines a NEW TOPOLOGY and corresponding definitions for implicit hate content. Also, highlights the issues with old approaches. |
26 | 2023 | PromptMTopic: Unsupervised Multimodal Topic Modeling of Memes using Large Language Models | PromptMTopic identifies and extracts topics involved in a set of memes.Used datasets, (1) TOTALDEFMEME (2)FHM (3)MEMOTION |
25 | 2023 | Causal Intersectionality and Dual Form of Gradient Descent for Multimodal Analysis: a Case Study on Hateful Memes | It merges two distinct lines of research work, XAI and Cuasal Analysis. The authors have hypothesized that the task of hateful meme detection can be formulated in terms of Average Treatment Effect (ATE of XAI) and summarised gradient-based attention attribution score (from Causal Analysis). |
24 | 2023 | Detecting and Correcting Hate Speech in Multimodal Memes with Large Visual Language Model | It uses 0-shot prompting in LLaVA for the task of hateful meme detection and correction. |
23 | 2023 | Leveraging World Knowledge in Implicit Hate Speech Detection | Uses Entity-Linking technique to incorporate world-contextual knowledge of entities to improve the detection of EXPLICIT and IMPLICIT memes. |
22 | 2023 | LLM-Assisted Content Analysis: Using Large Language Models to Support Deductive Coding | *** |
21 | 2023 | Chain of Explanation: New Prompting Method to Generate Quality Natural Language Explanation for Implicit Hate Speech | Using predefined prompts, Explanations for implicit hate content are generated in natural language. |
20 | 2023 | Is ChatGPT better than Human Annotators? Potential and Limitations of ChatGPT in Explaining Implicit Hate Speech | It tests the limitations and feasibility of ChatGPT for NLE of an implicit meme. |
19 | 2023 | An In-depth Analysis of Implicit and Subtle Hate Speech Messages | It compares different benchmarks for Hate-Speech detection and highlights the facts that conventional models are not suitable for detecting a implicit/subtle hate content. |
18 | 2022 | ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection | It is a toxic Data Creation Framework. Using LLM, it uses description-based prompting + adversarial-classifier-in-loop method to generate toxic data using a LLM. |
17 | 2022 | Generalizable Implicit Hate Speech Detection using Contrastive Learning | It uses contrastive learning to train a model for CROSS_TASK evaluation for IMPLICIT hate content detection. |
16 | 2021 | Latent Hatred: A Benchmark for Understanding Implicit Hate Speech | It defines a theoretically justified Taxonomy and a fine-grained annotated corpus for IMPLICIT hate speech. |
15 | 2023 | Tad: A Domain-Aware Framework Learning to Adapt Target Shifts of Hate Speech | It incorporates Cross-Domain + Target-Shift modelling of hate sppech content. |
15 | 2023 | Social Meme-ing: Measuring Linguistic Variation in Memes | It considers each meme as a node and each template as a semantically binding function w.r.t. the meme-text. Later it extracts all possible clusters, where each cluster contains memes with similar templates. Authors have studied these clusters to know how socio-factors have influenced the evolution of meme-text demographically. |
14 | 2023 | COGVLM: VISUAL EXPERT FOR LARGE LANGUAGE MODELS | A visual-language model. It trains and attaches a visual expert in attention + MLP layers of a language model which is responsible for extracting important similarity between text and an image. It is able to do deep-fusion. |
13 | 2023 | COVLM: COMPOSING VISUAL ENTITIES AND RELATIONSHIPS IN LARGE LANGUAGE MODELS VIA COMMUNICATIVE DECODING | A helper model that can guide an LLM to express the relationship between entities in the image and the visual text. |
12 | 2023 | GLaMM : Pixel Grounding Large Multimodal Model | First kind of LMM that can generate a textual response at multiple shades of granularity w.r.t. an input image. |
11 | 2023 | A Template Is All You Meme | It builds a KB based on 54k images. It consists of different templates with corresponding information and example images. |
10 | 2023 | Detecting and Correcting Hate Speech in Multimodal Memes with Large Visual Language Model | It uses a 0-shot prompt with LlaVa for Detecting and Correcting Hate meme. |
9 | 2023 | You Know What I Meme Enhancing People's Understanding and Awareness of Hateful Memes Using Crowdsourced Explanations | *** |
8 | 2023 | What Do You MEME Generating Explanations for Visual Semantic Role Labelling in Memes | Authors have represented an HVV dataset of 3K instances and an NLE novel task + a model 'LUMEN' with a comparative study. |
7 | 2023 | Pro-Cap Leveraging a Frozen Vision-Language Model for Hateful Meme Detection | It uses 0-shot-based QA to probe an LM and generate captions for no-text memes. |
6 | 2023 | MEMEX Detecting Explanatory Evidence for Memes via Knowledge-Enriched Contextualization | Given a meme and a document, MEMEX can extract the background information. |
5 | 2023 | HateProof: Are Hateful Meme Detection Systems really Robust? | A case study over the vulnerability of the existing systems in hate detection. It provides a solution based on contrastive learning + adversarial training. |
4 | 2023 | Decoding the Underlying Meaning of Multimodal Hateful Memes. | A novel task + annotated-contextual information dataset + relevance study over hateful meme. |
3 | 2023 | Characterizing the Entities in Harmful Memes Who is the Hero, the Villain, the Victim | Understanding and Identifying Hero, Villain and Victim of a meme. |
2 | 2023 | Review of Vision-Language Models and their Performance on the Hateful Memes Challenge | A survey of different unimodal + multimodal systems w.r.t. meme classification. |
1 | 2022 | Prompting for Multimodal Hateful Meme Classification | A prompt-based model that takes (1) image text, (2) image caption, and (3) a predefined prompt to classify an input meme. |