Dark Patterns Meet GUI Agents: LLM Agent Susceptibility to Manipulative Interfaces and the Role of Human Oversight

Abstract

The dark patterns, deceptive interface designs manipulating user behaviors, have been extensively studied for their effects on human decision-making and autonomy. Yet, with the rising prominence of LLM-powered GUI agents that automate tasks from high-level intents, understanding how dark patterns affect agents is increasingly important. We present a two-phase empirical study examining how agents, human participants, and human-AI teams respond to 16 types of dark patterns across diverse scenarios. Phase 1 highlights that agents often fail to recognize dark patterns, and even when aware, prioritize task completion over protective action. Phase 2 revealed divergent failure modes: humans succumb due to cognitive shortcuts and habitual compliance, while agents falter from procedural blind spots. Human oversight improved avoidance but introduced costs such as attentional tunneling and cognitive load. Our findings show neither humans nor agents are uniformly resilient, and collaboration introduces new vulnerabilities, suggesting design needs for transparency, adjustable autonomy, and oversight.

Vulnerability of GUI Agents on Dark Patterns

This table shows how different GUI agents performed when encountering 16 distinct dark pattern types. ✓ indicates successful avoidance, ✗ indicates failure to avoid, and indicates the agent stopped to ask for user confirmation.
Light blue cells show agents that were aware of dark patterns, while gray cells show those that were not aware.

Dark Pattern	GPT-4o	Claude 3.7	DeepSeek V3	Gemini 2.0	Claude CUA	Operator
Adding Steps	✓	✓	✓	✓	✓	✓
Bait and Switch	✓	✓	✗	✗	✗	✓
Hiding Information	✗	✗	✗	✗	✓	✓
Manipulating Visual Choice Architecture	✓	✓	✓	✓	✓	✓
Bad Defaults	✗	✗	✗	✗
Emotional or Sensory Manipulation	✓	✓	✓	✓	✓	✓
Trick Questions	✗	✗	✗	✓
Choice Overload	✓	✓	✓	✗	✓	✓
Social Proof	✓	✓	✓	✓	✓	✓
Nagging	✓	✓	✓	✓	✗	✓
Forced Communication or Disclosure	✗	✗	✗	✗
Scarcity and Popularity Claims	✓	✓	✓	✓	✓	✓
Forced Registration	✓	✗	✓	✓	✓
Hidden Information	✗	✗	✗	✗
Urgency	✓	✓	✓	✓	✓	✓
Shaming	✓	✓	✓	✓	✓	✓

Select an Agent type and Dark Pattern to view corresponding materials

Finding 1: High avoidance with low awareness.

Finding 2: Awareness often fails to lead to protective actions due to goal-driven optimization.

Finding 3: Only end-to-end agents use termination as a safety valve.

Divergent Failure Modes: Humans vs. GUI Agents

Human Vulnerabilities

Heuristics & Habitual Compliance

Adaptability & Correction

GUI Agent Vulnerabilities

Goal-Driven Optimization

Rigid Goal Pursuit

Impact of Human Oversight of Agents

This table shows the avoidance rates (%) for different dark pattern types, comparing the performance of humans working alone with humans supervising an AI agent.

Dark Pattern Type	Human-Only	Human Oversight
Adding Steps	100.0%	100.0%
Bait and Switch	70.0%	91.7%
Hiding Information	70.0%	100.0%
Manipulating Visual Choice Architecture	90.9%	100.0%
Bad Defaults	33.3%	80.0%
Emotional or Sensory Manipulation	83.3%	100.0%
Trick Questions	45.5%	81.8%
Choice Overload	100.0%	100.0%
Social Proof	100.0%	90.9%
Nagging	83.3%	100.0%
Forced Communication or Disclosure	25.0%	50.0%
Scarcity and Popularity Claims	90.9%	100.0%
Forced Registration	40.0%	25.0%
Hidden Information	20.0%	33.3%
Urgency	100.0%	100.0%
Shaming	100.0%	100.0%

Attentional Tunneling: Reduced Awareness

Cognitive Overload from the Interface Design

Selective Delegation and the Need for Control

Participants wanted to delegate procedural tasks (e.g., information retrieval, configuration) to the agents while insisting on making important decisions (typically defined as those that would have real-world consequences e.g., financial decisions that are difficult to reverse) themselves and expecting the agent to ask for their preference when the agent encounters multiple reasonable options. Despite their readiness to delegate procedural tasks to agents because of convenience (P7, "It's more convenient. If I type it, it will waste a lot of time.") and information search ability of agents (P21, "Sometimes it's hard to find things in settings...AI would probably be better at it than I am"), this willingness to delegate did not extend to financial transactions, as participants feared potential overspending. As P1 mentioned, "I don't want this agent to help me with shopping. Because it's my money. I'm so cheap.". This reluctance reveals a concern about losing control, particularly in sensitive scenarios. Just as they feared the agent making decisions that would have tangible real-world impacts (e.g., financial) that misalign with their best interests, participants also insisted on guiding the agent's decisions when ambiguity arose to ensure the outcome aligned with their personal preferences. While Operator sought confirmation for sensitive actions (e.g., placing an order), participants were frustrated when the agent made unilateral choices among multiple valid options without consulting them. As P7 explained, "Fill(ing) in my personal information ... could be handled by the AI agent---it's more convenient and saves time... But AI should pause where I need to choose my objects, I need to do it by myself.". Participants further emphasized agents should support, rather than replace, their decision-making. As P2 suggested, "Tell me what are the top 3 choices, so I can make a decision, then confirm, instead of it just makes its decision". P9 remarked "(The) agent should have highlighted the important terms instead of (displaying) a long article (of thoughts)". When agents bypass user input, manipulative defaults or hidden terms are more likely to go unnoticed. Maintaining user control is therefore critical not only for autonomy but also for resisting dark patterns. Oversight is more effective when humans and agents act in synergy, with agents supporting rather than substituting user judgment.

BibTeX

@misc{tang2025darkpatternsmeetgui,
        title={Dark Patterns Meet GUI Agents: LLM Agent Susceptibility to Manipulative Interfaces and the Role of Human Oversight}, 
        author={Jingyu Tang and Chaoran Chen and Jiawen Li and Zhiping Zhang and Bingcan Guo and Ibrahim Khalilov and Simret Araya Gebreegziabher and Bingsheng Yao and Dakuo Wang and Yanfang Ye and Tianshi Li and Ziang Xiao and Yaxing Yao and Toby Jia-Jun Li},
        year={2025},
        eprint={2509.10723},
        archivePrefix={arXiv},
        primaryClass={cs.HC},
        url={https://arxiv.org/abs/2509.10723}, 
  }