core(image-elements): ignore invalid images where possible #14295

connorjclark · 2022-08-16T00:04:14Z

This is partial progress towards never surfacing image elements in audits if they are associated with a bogus image (such as: src='' being surfaced in an image audit).

This PR does two things:

1 - ImageRecords

This computed artifact now filters out image elements that are probably not a usable image resource. It uses mimeType to do this, but also has an affordance for avif/webp being sent with a borked mimeType. In the future, we should reach into blink/tracing to get more accurate information here.

A follow up PR would add a new audit that details these likely-problematic images, though that should wait for better instrumentation.

We currently don't have network records available to use in snapshot mode, so this PR does nothing for them.

2 - Using ImageRecords more

There are two audits that presently don't support snapshot mode and weren't already utilizing ImageRecords:

lcp-lazy-loaded
preload-lcp-image

Now they use ImageRecords instead of ImageElements directly.

adamraine · 2022-08-16T18:40:38Z

core/test/computed/image-records-test.js

 function mockElement(partial = {}) {
  return {
-    src: 'https://example.com/img.png',
+    src: partial.src ?? 'https://example.com/img.png',


Isn't this what's happening already? Where partial.src overrides this value if provided?

brendankenny · 2022-08-17T21:06:30Z

This might be going too aggressive, though :/ About 1.6% of sites in HTTP Archive (July 2022) currently have an image LCP which would no longer be detected after this. It is catching a bunch of accidental css files (at least going by filenames), but there also just seem to be a ton of people serving images as application/images, application/png, or good old application/xml.

paulirish · 2022-08-17T23:09:02Z

core/computed/image-records.js

      imageRecords.push({
        ...element,
-        mimeType: mimeType ? mimeType : URL.guessMimeType(element.src),
+        mimeType: networkRecord.mimeType,


brendan said:

This might be going too aggressive, though :/ About 1.6% of sites in HTTP Archive (July 2022) currently have an image LCP which would no longer be detected after this. It is catching a bunch of accidental css files (at least going by filenames), but there also just seem to be a ton of people serving images as application/images, application/png, or good old application/xml.

and then I was like... "i thought we were gonna used resourceType but then see #13338 (comment) where you showed an HTML page getting described as resourceType: Image

i tried to reproduce this locally and failed. but then tried <img src="page.html">. Yeah very interesting.

These resourceTypes are defined in the blink loader and set (for non-scripts) here…

I guess this type is still in the "loader" before it's determined that it's an invalid asset for the intended use.

Ah well. So you're right about resourceType not being useful for this situation.

humph.

connorjclark · 2022-08-18T19:14:48Z

but there also just seem to be a ton of people serving images as application/images, application/png, or good old application/xml.

Can you produce a list of these faulty mime-types (or is what's listed the vast majority)? As a first pass, without blink instrumentation, we could just add all these bogus things to an allow-list.

That said, I'll dig around in Chrome and see where this instrumentation could be added.

connorjclark · 2022-08-18T23:25:56Z

caseq informed me that the CDT frontend used to display incongruences in image resources by comparing the resourceType (which is the intent/context) with the mimeType (which is post-sniffing). There's also the raw mime type header available, so we can do both to detect known bads.

Need to test what the "post-sniff" CDP mimeType is for a valid image served as application/images–ideally it somehow detects it's all A-OK and shows the real content mimetype, otherwise we must fallback to the known-bads allowlist.

brendankenny · 2022-08-18T23:29:50Z

Can you produce a list of these faulty mime-types (or is what's listed the vast majority)?

Yes, but I believe the top mimetype that fails this check is text/html :/ '' is also relatively popular.

connorjclark · 2022-08-18T23:33:25Z

Need to test what the "post-sniff" CDP mimeType is for a valid image served as application/images–ideally it somehow detects it's all A-OK and shows the real content mimetype, otherwise we must fallback to the known-bads allowlist.

Result (for .jpg):

if Content-Type header is blank, CDP mimeType is image/jpeg (sniffed), no raw header
if Content-Type header is application/images (or whatever), CDP mimeType matches the raw header

in both cases the image displays, and the type is Image.

connorjclark · 2022-08-18T23:36:34Z

We could move forward with this PR but also introduce a new audit for "hey, all these resources used as images need to better identify themselves" in best practices...

If we're really talking about a small subset of misconfigured websites, that seems fine? esp. to avoid the super weird/common-ish <img src=""> that some god-awful PHP spat out (or however else these things happen)

connorjclark · 2022-08-18T23:55:39Z

related: https://chromium-review.googlesource.com/c/devtools/devtools-frontend/+/2732264

connorjclark · 2024-03-08T00:19:40Z

@brendankenny thoughts on #14295 (comment) ?

core(image-elements): ignore invalid images where possible

46fa48a

connorjclark requested a review from a team as a code owner August 16, 2022 00:04

connorjclark requested review from adamraine and removed request for a team August 16, 2022 00:04

connorjclark added 2 commits August 15, 2022 17:10

jsons

ec1f922

unit

5d0d4d5

vercel bot deployed to Preview August 16, 2022 00:17 View deployment

adamraine approved these changes Aug 17, 2022

View reviewed changes

paulirish reviewed Aug 17, 2022

View reviewed changes

connorjclark added the needs-discussion label Jul 21, 2023

connorjclark added waiting4committer and removed needs-discussion labels Mar 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

core(image-elements): ignore invalid images where possible #14295

core(image-elements): ignore invalid images where possible #14295

Uh oh!

connorjclark commented Aug 16, 2022 •

edited

Loading

Uh oh!

adamraine Aug 16, 2022

Uh oh!

brendankenny commented Aug 17, 2022 •

edited

Loading

Uh oh!

paulirish Aug 17, 2022

Uh oh!

connorjclark commented Aug 18, 2022 •

edited

Loading

Uh oh!

connorjclark commented Aug 18, 2022 •

edited

Loading

Uh oh!

brendankenny commented Aug 18, 2022

Uh oh!

connorjclark commented Aug 18, 2022

Uh oh!

connorjclark commented Aug 18, 2022 •

edited

Loading

Uh oh!

connorjclark commented Aug 18, 2022

Uh oh!

connorjclark commented Mar 8, 2024

Uh oh!

Uh oh!

core(image-elements): ignore invalid images where possible #14295

Are you sure you want to change the base?

core(image-elements): ignore invalid images where possible #14295

Uh oh!

Conversation

connorjclark commented Aug 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1 - ImageRecords

2 - Using ImageRecords more

Uh oh!

adamraine Aug 16, 2022

Choose a reason for hiding this comment

Uh oh!

brendankenny commented Aug 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paulirish Aug 17, 2022

Choose a reason for hiding this comment

Uh oh!

connorjclark commented Aug 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

connorjclark commented Aug 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brendankenny commented Aug 18, 2022

Uh oh!

connorjclark commented Aug 18, 2022

Uh oh!

connorjclark commented Aug 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

connorjclark commented Aug 18, 2022

Uh oh!

connorjclark commented Mar 8, 2024

Uh oh!

Uh oh!

connorjclark commented Aug 16, 2022 •

edited

Loading

brendankenny commented Aug 17, 2022 •

edited

Loading

connorjclark commented Aug 18, 2022 •

edited

Loading

connorjclark commented Aug 18, 2022 •

edited

Loading

connorjclark commented Aug 18, 2022 •

edited

Loading