Make "triggered by user activation" match browser behavior #1903

jyasskin · 2016-10-14T23:46:47Z

"Triggered by user activation" currently behaves differently from the way browsers handle user gestures. For example, Chrome only propagates a gesture through a single setTimeout, and doesn't propagate through XHR or Promise.then() at all.

We're discussing this in https://crbug.com/404161.

The text was updated successfully, but these errors were encountered:

domenic · 2016-10-15T00:20:35Z

I guess this a more accurate description of the problem statement than #1358, so let me close that one in favor of this.

RByers · 2016-10-17T15:21:28Z

Note that some work is happening in blink to try to make the "user gesture" notion tied to a Document so that, for example, sensitive operations like navigation can be performed only after the user has intereacted with that frame specifically. It's probably too early to know for sure that this prove useful, but at least Domenic and Nate should probably chat to make sure we're not heading in different directions here.

annevk · 2016-10-18T13:24:49Z

How does setTimeout propagation happen? You just annotate the task queued with whatever the current task is annotated with? And I guess it has to be some kind of special value that is checked first so you don't continue to propagate it.

jyasskin · 2016-10-21T19:17:45Z

@annevk Pretty much: https://cs.chromium.org/chromium/src/third_party/WebKit/Source/core/frame/DOMTimer.cpp?q=DOMTimer::DOMTimer&sq=package:chromium&l=82&dr=CSs

upsuper · 2016-11-12T09:52:32Z

What about key events? It seems to me Chrome and WebKit accept at least keypress event as a user gesture, but Firefox doesn't. Is there any concern about allowing that from the spec side, or it is just the spec being a bit behind?

annevk · 2016-11-12T10:58:40Z

@upsuper I think that would be possible. Just keypress or also keydown/keyup? (I guess it's the task that triggers them, but nobody has defined those in terms of tasks yet...)

upsuper · 2016-11-12T11:02:30Z

@annevk Also keydown and keyup.

domenic · 2016-12-07T08:35:54Z

https://jsbin.com/jawuqeq/1/edit?html,console,output is evidence that Chrome's heuristics, of propagating the user gesture, are perhaps not necessary for web compat. To see this note how clicking the "Run with JS" button causes the popup to be shown in Chrome, but not in any other browser. Presumably this is because Chrome is propagating the click of that button through to the popup-creation, whereas other browser are not.

domenic · 2017-01-26T05:19:41Z

There is also discussion of this in WICG/interventions#12 and some brief discussion with @bzbarsky in #2292.

At this point it would be extremely helpful if we could get info from other browsers on how their algorithms work. @jyasskin has outlined a proposal based on Blink's heuristics at https://docs.google.com/document/d/1csmMqHr60zded2tdjczr6HrL18iOPR1PlnM2FqCDLOQ/edit#heading=h.j0mdilrjre5y but I'm hesitant to try to codify that in the spec without any knowledge of how close or far away it is from other engines.

bzbarsky · 2017-01-26T05:45:28Z

// cc @smaug----

At first glance in Gecko we have a concept of "handling user input" which is based on certain events (e.g. key events, but not for some keys), which is propagated through some async things (form submission is the obvious one; not sure there's anything else really). Oh, and it times out after a pref-configurable time (defaulting to 1s), so if your event listeners take longer than that, you're no longer considered to be handling user input. This is used in various placed including at least media element code, certain controls on when navigations can happen, fullscreen-allowed checks, certain execCommand (cut/copy) bits, probably others.

There's also a separate concept of "popup blocker state". This is somewhat based on the "handling user input" state but with some additional restrictions and maybe some loosenings (e.g. some trusted keyboard events that may affect popup blocker state may not affect "handling user input state" at first glance). The popup blocker state propagates through more async stuff, including javascript: evaluation, form submission, link clicks (which for example do their targeting of a load async in Gecko), setTimeout/setInterval (but only one level deep, and only if the delay of the timeout/interval is less than a pref-configurable value defaulting to 1s).

Oh, and the popup blocker state is not a boolean. It can actually take on 4 values, which affect whether this particular popup can open, as well as whether future popups will be able to get opened, and whether certain other non-window-open operations that use the popup state can happen.

The popup state is used for window opening, obviously, but also allowing/disallowing focus() calls. Not sure where else.

RByers · 2017-02-24T19:03:32Z

One notion that's important to chromium which it sounds like Gecko has too is the restriction which permits only a single window.open call to succeed for each instance of user input. I.e. dragging a mouse can open a popup from a mousedown, mousemove or mouseup listener, but only the first such window.open call will succeed across all of these (so an abusive site can't open a whole slew of popups for a single mouse drag operation). Perhaps we could start by working that into the spec somehow?

For the other details, I'm willing to try simplifying blink's behavior here to better match the spec, especially if there's another implementation who seems to be getting away (from a web compat perspective) with a simpler / more rational model. It's expensive (but not impossible) to figure out the best trade-off by trial and error, but it should be relatively cheap to switch to something simpler that's already successful in some other engine. IIRC Edge folks said they had a much simpler model which seemed to be working OK for them.

bzbarsky · 2017-02-24T19:08:07Z

Gecko's behavior there is different from Chrome's. In Gecko, this:

document.body.onclick = function() { window.open(); window.open(); }

will open two windows on click. There are various other heuristics Gecko does around popup blocking, but I'm pretty sure we don't track it across events. That said, we don't allow popups from mousemove at all. And popups from click are treated differently in various ways from popups from mouseup/mousedown/dblclick.

Oh, and in Firefox a lot of this stuff (e.g. which events allow popups at all) is user-configurable.

RByers · 2017-02-24T19:30:13Z

will open two windows on click.

Oh, interesting. I don't think our security team would be OK with that - we've seen malicious sites which get a user to click somewhere, and then open 100s of pop-up windows when they do. We're even hoping to fix the case where a tap on a touchscreen currently can open two pop-ups (one for touchstart, one for click), although since that's bounded to two so it's considered a pretty low severity bug.

bzbarsky · 2017-02-24T20:18:56Z

and then open 100s of pop-up windows when they do

We do limit the number you can open on click. The limit is just larger than 1. And a bit dynamic depending on past behavior.

I wasn't suggesting you implement the Gecko behavior here; just saying that there is a wide range of what people consider ok popup blocker behavior....

RByers · 2017-02-24T20:24:52Z

Yep, understood - thanks for the details. I'm just trying to figure out how best to converge our implementations and leaning towards changing chromium to match others when possible :-)

RByers · 2017-02-27T17:29:05Z

Filed this chromium bug to track efforts to better align our implementation with the spec (one way or another).

mystor · 2017-08-25T15:12:25Z

I imagine that for propagating the "triggered by user activation" attribute across callbacks, we'd want to do something like the following:

Propagate into promise callbacks triggered within an event "triggered by user activation"
Consider (some?) IDB callbacks to be "triggered by user activation" if they were requested within an event "triggered by user activation"
Consider setTimeout callbacks to be "triggered by user activation" if they were scheduled with, say, <20ms as their timeout value from another callback "triggered by user activation"?
Consider XHR callbacks to be "triggered by user activation" if the request was made within the a callback "triggered by user activation" (perhaps with a short-ish timeout (1s?) so servers can't feed us bytes slowly to get an arbitrary timeout with XHR callbacks).
Potentially other places which I didn't immediately think of.

domenic · 2017-08-25T16:03:58Z

Hey @mystor,

In Chrome we're investigating a maximally-simple solution, per this design doc. I guess it hasn't been mentioned on this thread yet.

The TLDR version: a user activation sets a bit, indefinitely, which will be consumed next time some user-activation-requiring action happens. We're prototyping that now as we hope it will be a great simplification to our current architecture, but we wanted to make sure it was web compatible before proposing it too concretely.

I'd love to get your thoughts on that kind of setup, assuming that we find it to be web compatible.

mystor · 2017-08-25T16:19:54Z

@domenic I worry about the indefiniteness of this flag, especially for certain APIs. For example, it would be problematic if a document received a user input event, waited a few minutes for the user to stop interacting with their computer, and then went fullscreen trying to mimic the browser/OS UI for phishing.

I imagine we could sort the APIs into security-sensitive and annoying ones, and put the security-sensitive ones behind a timer (e.g. the fullscreen API). I think this is fairly similar to what we have in gecko right now, where we don't have a timer for any of our APIs except for the fullscreen one (which must be triggered within 1s of the user input event. The merely annoying ones potentially could avoid/have a much longer timeout (e.g. clipboard APIs, popup?, autoplay).

cc @smaug---- and @ehsan who have had opinions about this flag in gecko in the past.

bzbarsky · 2017-08-29T21:00:39Z

I have the same worry as @mystor. This is why Gecko right now unsets the "in user activation" flag after 1s even if we're still handling the user input event (e.g. one of the listeners did a 1s busy-loop). The idea is that from the user's point of view there should be a clear connection between the action they took and whatever it is the page is trying to do.

mustaqahmed · 2017-10-13T14:59:11Z

In our proposed design, we wanted to have all APIs consume user activation, to minimize the effect of indefinite lifespan. I agree that it's debatable but since a single user activation can be used at most once, doesn't it minimize the "indefiniteness" worry?

By the way, both Chrome & Edge currently consume the activation with popups, but neither Firefox nor Safari do. And this is just one of many differences for popups we have today.

mustaqahmed · 2018-06-20T15:18:23Z

A quick note, somewhat related: I have created a table of "activation defining events" for Chrome, Edge, Firefox and Safari, each for both mobile and desktop, which shows more inconsistencies (at least more than what I was hoping to find):

Activation-triggering events across browsers

I will create a separate github issue to discuss the event-set, to keep the modeling discussion (this issue) separate. We have separate Chrome issues for them already: fixing event-set vs User Activation v2 model.

smaug---- · 2018-06-21T11:35:47Z

So the test is about window.open (meaning it is about popup blocking in Gecko) ?

bzbarsky · 2018-06-26T02:26:05Z

@mustaqahmed Firefox has at least three different concepts of activation that have different rules applying to them. window.open uses one of those concepts, but other things use the other ones...

mustaqahmed · 2018-07-16T17:53:31Z

@smaug----: Sorry missed your question in between my vacation. Yes, I tested only popups.

@bzbarsky: In FF, I had noticed a difference between window.open() and navigator.vibrate() but I ruled it out as a bug. Did you mean FF has three different event sets for different "classes" of APIs? If so, is it by design (vs because of isolated/incremental implementation)?

bzbarsky · 2018-07-16T18:37:02Z

Did you mean FF has three different event sets for different "classes" of APIs?

Not just event sets but also propagation rules (e.g. across the event loop, across time within a single event, etc).

If so, is it by design (vs because of isolated/incremental implementation)?

Well, the reason we have multiple different definitions is that the heuristics used by some use cases were judged wrong for other use cases... I'm not sure how to classify that in your classification.

But I would like to point out that the threat model of "a popup opens when the user did not expect it to" is quite different from "a site gets access to the camera when the user did not expect it to" or whatnot. So some things, especially new APIs, may want a more stringent definition of "activation" while others (popup blocking was the poster child here) may need a looser one for backwards compat.

mustaqahmed · 2018-07-17T17:10:39Z

But I would like to point out that the threat model of "a popup opens when the user did not expect it to" is quite different from "a site gets access to the camera when the user did not expect it to" or whatnot. So some things, especially new APIs, may want a more stringent definition of "activation" while others (popup blocking was the poster child here) may need a looser one for backwards compat.

I totally agree that we need to handle user-API-specific requirements, we have those in Chromium too. What I wanted to emphasize is that the details were developed over a long period of time w/o a guiding spec, so we can expect to have some rough edges (e.g. a vague or unintended behavior) that would need some polishing. E.g. here is one such fix we did in Chromium, and I won't be surprised if FF has seen similar issues.

In any case, we would like to have a well-defined set of API-specific variations for the Web, with not-too-many possibilities, right? Otherwise, developers and users would be confused.

This discussion sets the ground for our first major update about User Activation v2 in Chrome, stay tuned (while I update a few things based on @bzbarsky's comment above).

mustaqahmed · 2018-07-17T20:43:19Z

User Activation v2 is now available in Chrome for testing. Please give it a try and let us know your feedback. You can enable the feature in M67+ through chrome://flags/#user-activation-v2 or the command-line flag --enable-features=UserActivationV2.

Here is a short explainer with demos: mustaqahmed.github.io/user-activation-v2.

Plans for this quarter:

Dev/Canary field trial in M69.
An initial PR for the spec in a few weeks.
A Chrome intent to ship in a month.

Replaced most of Section 6.3 with the new user activation model, preserved only a part (events triggering user activation) into a new Section 6.4 which needs to be addressed through a separate issue. Fixes whatwg#1903.

smaug---- · 2018-10-24T21:24:02Z

The more I think this, the more I start to like the approach where HasConsumableUserActivation is propagated through promises and setTimeout etc. and not stored on document or window. The global HasConsumableUserActivation really doesn't work with cases when there is for example the button to accept cookies and then page also has a video.
Sure, even when propagating through callbacks, one could always just start the playback when cookie button is activated, but at least that would require explicit somewhat user hostile behavior from the web page.

User activation v2 is simple to implement, but I'm not sure it gives the best user experience.

RByers · 2018-10-25T08:10:36Z

I agree explicit propagation through all async APIs is able to provide a better user experience in general.

But despite chromium having such a system for years (ableit with some bugs/limitations which were hard to address), there has been little investment from other engines in implementing something similarly powerful. So we've seen a lot of frustration from developers in not being able to rely on or generally reason about the behavior (both due to edge cases in Chrome, and do to huge differences cross-browser), and as such the promised user experience benefits haven't really materialized across the web (things are just as often broken for users).

Therefore we decided in chromium to focus on testing the hypothesis that the simpler model could provide a typical user experience that's generally as good in practice, while being possible to achieve interoperability around.

I personally don't have any interest in continuing to invest in the more complex causality-tracking solution unless either at least two other engines have successfully shipped such a model, or the simpler model has proven to have negative UX consequences which are unacceptable to Chrome.

smaug---- · 2018-10-25T19:33:00Z

Well, I think the issue here is that there hasn't been any kind of proper spec around this. And also, the web has changed a lot over the years. Gecko's popup blocking code for example is mostly from FF1.0 era (written by jst ;) ). The web isn't the same it was in 2004.

Chrome had a partial implementation to propagate the flag, but how could others have followed that without a spec?

mustaqahmed · 2019-03-26T21:08:39Z

User Activation v2 shipped in Chrome 72, and it successfully went through the whole Stable release without any major issues, yayy! 🎂

We encountered a few minor issues, added only one small tweak to the original model which we expect to remove (partially/completely) in future. Here is a summary:

We found a few use-cases (like this) where a frame wants to expose its activation to a specific frame. We are working on a separate follow-up proposal to allow this (the spec discussion User Activation: ability to transfer activation to another frame #4364).
For a few sites, same-origin non-ancestor (child or sibling) frames expect to use the user activation. We temporarily allowed same-origin visibility of user activation to handle these cases. We may be able to remove this entirely in future using the proposal above, or may restrict this to same-origin-child-only visibility. In any case, same-origin activation visibility seems like a natural extension to our original (ancestor-only) visibility model.

This replaces the previous concept of "triggered by user activation" with a more nuanced model, introducing new concepts and a more detailed processing model. This aligns with Chromium's "user activation v2" work, which gained consensus during the discussion at TPAC [1]. (The previous specification did not align with any implementation.) This serves mostly as a foundational improvement. Interop work remains to be done on individual APIs and how they use user activation, as well as the exact set of events which count as user activation. Those issues are tracked in [2] and elsewhere. Fixes #1903. Fixes #3859. [1]: https://docs.google.com/document/d/1gHxQMdXGX4UjjoPXi0c1vhwYregQzWNEHqgbydopCoo/edit#heading=h.9hw95kikso76 [2]: https://github.com/whatwg/html/labels/topic%3A%20user%20activation

domenic mentioned this issue Oct 15, 2016

Events list that trigger "allowed to show a popup" seems too small #1358

Closed

domenic added the compat Standard is not web compatible or proprietary feature needs standardizing label Oct 15, 2016

domenic mentioned this issue Oct 27, 2016

Add some mechanism to know that a message event was triggered by user activation #1983

Closed

domenic mentioned this issue Jan 26, 2017

Add allow-top-navigation-by-user-activation sandbox token #2292

Merged

cynthia mentioned this issue Mar 15, 2017

allow-top-navigation-by-user-activation w3ctag/design-reviews#154

Closed

5 tasks

zcorpan added interop Implementations are not interoperable with each other and removed compat Standard is not web compatible or proprietary feature needs standardizing labels Mar 28, 2017

domenic mentioned this issue Apr 10, 2017

pop-up blocker algorithm doesn't define behavior for touch #599

Closed

csharrison mentioned this issue Oct 13, 2017

Need concept of "consuming" user activation #3122

Closed

marcoscaceres mentioned this issue Mar 20, 2018

user activation must survive postMessage() to call show() web-platform-tests/wpt#10090

Merged

mustaqahmed mentioned this issue May 1, 2018

Treat trusted contextmenu events as user activation. #3659

Merged

gsnedders mentioned this issue May 28, 2018

[testdriver] Enable manual interaction web-platform-tests/wpt#11173

Merged

csharrison mentioned this issue Jul 10, 2018

The UA should be able to disallow speak() from autoplaying WICG/speech-api#27

Open

mustaqahmed mentioned this issue Jul 24, 2018

Need a consistent set of events that defines user activation #3849

Closed

This was referenced Jul 25, 2018

Added a new section on User Activation v2. #3851

Merged

Add ability to test with popup blocker enabled web-platform-tests/wpt#3867

Open

mustaqahmed mentioned this issue Aug 15, 2018

User activation: different concepts for different usages #3859

Closed

mustaqahmed mentioned this issue Oct 17, 2018

Delegating transient user activation WICG/gesture-delegation#14

Open

mustaqahmed mentioned this issue Feb 13, 2019

User Activation: ability to transfer activation to another frame #4364

Closed

mustaqahmed mentioned this issue Mar 29, 2019

User Activation Delegation through postMessages w3ctag/design-reviews#347

Closed

5 tasks

annevk added the topic: user activation label Jun 28, 2019

mustaqahmed mentioned this issue Jun 28, 2019

Algorithms triggered by user activation and window.postMessage / MessageChannel / BroadcastChannel #4741

Closed

domenic closed this as completed in #3851 Dec 4, 2019

mustaqahmed mentioned this issue Dec 5, 2019

[Meta issue] Fix external references to HTML user activation behavior #5129

Open

14 tasks

robertknight mentioned this issue Mar 31, 2022

Simplify login popup creation flow hypothesis/client#4374

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make "triggered by user activation" match browser behavior #1903

Make "triggered by user activation" match browser behavior #1903

jyasskin commented Oct 14, 2016 •

edited

domenic commented Oct 15, 2016

RByers commented Oct 17, 2016

annevk commented Oct 18, 2016

jyasskin commented Oct 21, 2016

upsuper commented Nov 12, 2016

annevk commented Nov 12, 2016

upsuper commented Nov 12, 2016

domenic commented Dec 7, 2016

domenic commented Jan 26, 2017

bzbarsky commented Jan 26, 2017

RByers commented Feb 24, 2017 •

edited

bzbarsky commented Feb 24, 2017

RByers commented Feb 24, 2017

bzbarsky commented Feb 24, 2017

RByers commented Feb 24, 2017

RByers commented Feb 27, 2017

mystor commented Aug 25, 2017

domenic commented Aug 25, 2017

mystor commented Aug 25, 2017

bzbarsky commented Aug 29, 2017

mustaqahmed commented Oct 13, 2017

mustaqahmed commented Jun 20, 2018 •

edited

smaug---- commented Jun 21, 2018 •

edited

bzbarsky commented Jun 26, 2018

mustaqahmed commented Jul 16, 2018

bzbarsky commented Jul 16, 2018

mustaqahmed commented Jul 17, 2018

mustaqahmed commented Jul 17, 2018

smaug---- commented Oct 24, 2018

RByers commented Oct 25, 2018

smaug---- commented Oct 25, 2018

mustaqahmed commented Mar 26, 2019 •

edited

Make "triggered by user activation" match browser behavior #1903

Make "triggered by user activation" match browser behavior #1903

Comments

jyasskin commented Oct 14, 2016 • edited

domenic commented Oct 15, 2016

RByers commented Oct 17, 2016

annevk commented Oct 18, 2016

jyasskin commented Oct 21, 2016

upsuper commented Nov 12, 2016

annevk commented Nov 12, 2016

upsuper commented Nov 12, 2016

domenic commented Dec 7, 2016

domenic commented Jan 26, 2017

bzbarsky commented Jan 26, 2017

RByers commented Feb 24, 2017 • edited

bzbarsky commented Feb 24, 2017

RByers commented Feb 24, 2017

bzbarsky commented Feb 24, 2017

RByers commented Feb 24, 2017

RByers commented Feb 27, 2017

mystor commented Aug 25, 2017

domenic commented Aug 25, 2017

mystor commented Aug 25, 2017

bzbarsky commented Aug 29, 2017

mustaqahmed commented Oct 13, 2017

mustaqahmed commented Jun 20, 2018 • edited

smaug---- commented Jun 21, 2018 • edited

bzbarsky commented Jun 26, 2018

mustaqahmed commented Jul 16, 2018

bzbarsky commented Jul 16, 2018

mustaqahmed commented Jul 17, 2018

mustaqahmed commented Jul 17, 2018

smaug---- commented Oct 24, 2018

RByers commented Oct 25, 2018

smaug---- commented Oct 25, 2018

mustaqahmed commented Mar 26, 2019 • edited

jyasskin commented Oct 14, 2016 •

edited

RByers commented Feb 24, 2017 •

edited

mustaqahmed commented Jun 20, 2018 •

edited

smaug---- commented Jun 21, 2018 •

edited

mustaqahmed commented Mar 26, 2019 •

edited