Introduce pkl-doc model version 2 #1169

bioball · 2025-08-08T18:20:41Z

Implementation of apple/pkl-evolution#20

Currently, in order to update a pkl-doc documentation site, almost the entire existing site is read in order to update metadata like known versions, known subtypes, and more.

For example, adding a new version of a package requires that the existing runtime data of all existing versions be updated. Eventually, this causes the required storage size to balloon exponentially to the number of versions.

This addresses these limitations by:

Updating the runtime data structure to move "known versions" metadata to the package level (the same JSON file is used for all versions).
Eliminating known subtype and known usage information at a cross-package level.
Generating the search index by consuming the previously generated search index.
Generating the main page by consuming the search index.

Because this changes how runtime data is stored, an existing docsite needs to be migrated.

This also introduces a new migration command, pkl-doc --migrate, which transforms an older version of the website into a newer version.

NOTE: most of the additions are due to new input/output tests. I split those off into a separate commit. To review this PR, just take a look at the first commit.

The generated output changes now need to be served via an HTTP server (because of ES6 module imports).
You can review the generated site using a command like python3 -m http.server -d pkl-doc/src/test/files/DocGeneratorTest/output/run-1/).

Currently, in order to update a pkl-doc documentation site, almost the entire existing site is read in order to update metadata like known versions, known subtypes, and more. For example, adding a new version of a package requires that the existing runtime data of all existing versions be updated. Eventually, this causes the required storage size to balloon exponentially to the number of versions. This addresses these limitations by: 1. Updating the runtime data structure to move "known versions" metadata to the package level (the same JSON file is used for all versions). 2. Eliminating known subtype and known usage information at a cross-package level. 3. Generating the search index by consuming the previously generated search index. 4. Generating the main page by consuming the search index. Because this changes how runtime data is stored, an existing docsite needs to be migrated. This also introduces a new migration command, `pkl-doc --migrate`, which transforms an older version of the website into a newer version.

bioball · 2025-08-08T18:47:56Z

pkl-doc/src/main/kotlin/org/pkl/doc/DocGenerator.kt


      for (docPackage in docPackages) {
-        if (docPackage.isUnlisted) continue
-


Unlisted filtering logic is moved up to line 104

bioball · 2025-08-11T22:05:03Z

pkl-doc/src/main/kotlin/org/pkl/doc/PageGenerator.kt

+            id = "search-icon"
+            classes = setOf("material-icons")
+            +"search"
+          }


Unrelated: address a warning that an input is missing a label

bioball · 2025-08-11T22:09:19Z

pkl-doc/src/main/kotlin/org/pkl/doc/RuntimeDataGenerator.kt

-          name("classes").value(classes(item))
-        }
-      }
-    }


The JsonWriter and JsonReader APIs too low level; since we are also parsing runtime data, it's much nicer to use kotlinx json.

We don't want to use kotlinx json in pkl-core (which is partially why JsonWriter/JsonReader exist), but we don't have any issues using this library inside the pkl-doc tool.

bioball · 2025-08-11T22:11:25Z

pkl-doc/src/main/kotlin/org/pkl/doc/SearchIndexGenerator.kt

+      stream.write(POSTFIX.toByteArray(StandardCharsets.UTF_8))
+      stream.flush()
+    }
+  }


We are converting the runtime data files from .js to .json. I chose to avoid doing that for the search index because it's not actually a problem right now, and takes more work (more migration needed).

bioball · 2025-08-11T22:13:45Z

pkl-doc/src/main/kotlin/org/pkl/doc/PageGenerator.kt

+    lang = "en-US"
+
    head {
+      meta { charset = "UTF-8" }


Unrelated: address warnings emitted by linters

bioball · 2025-08-21T16:17:23Z

pkl-doc/src/main/kotlin/org/pkl/doc/RuntimeDataGenerator.kt

+  val writtenFiles = mutableSetOf<Path>()
+
+  private suspend fun writeData(packages: List<PackageData>) {
+    coroutineScope {


A coroutineScope will block until any jobs spawned by launch are completed. And each of these launch jobs are executed in parallel.

HT154

LGTM overall. One question, but no blockers.

HT154 · 2025-09-05T01:00:34Z

pkl-doc/src/main/kotlin/org/pkl/doc/PackageDataGenerator.kt

+        isModuleClass -> "$module/index.json".pathEncoded
+        else -> "$module/$type.json".pathEncoded


Would pkldoc have a bad time with class index in a module? If so, it's probably worth catching that and throwing an error or rewriting it to something else.

Yeah, good catch, I think it'll conflict. I won't address that in this PR; this is an existing issue and I'd rather not block this PR on it.

`copyToRecursively` does not work correctly when copying from Windows paths to jimfs.

bioball force-pushed the pkldoc-improvements branch from 5cafc7b to 0b43b89 Compare August 8, 2025 18:23

bioball mentioned this pull request Aug 8, 2025

SPICE-0018: pkldoc I/O Improvements apple/pkl-evolution#20

Open

bioball force-pushed the pkldoc-improvements branch 3 times, most recently from 2f16ea9 to 2a89899 Compare August 11, 2025 22:15

bioball force-pushed the pkldoc-improvements branch 3 times, most recently from 4f9b476 to 827e6c7 Compare August 21, 2025 13:49

bioball force-pushed the pkldoc-improvements branch from 827e6c7 to b965728 Compare September 3, 2025 13:42

bioball added 2 commits September 3, 2025 11:31

Re-generate test output

b00a035

bioball force-pushed the pkldoc-improvements branch from b965728 to b00a035 Compare September 3, 2025 18:31

bioball commented Sep 3, 2025

View reviewed changes

HT154 approved these changes Sep 5, 2025

View reviewed changes

Don't use jimfs when testing DocMigratorTest

2a7941b

`copyToRecursively` does not work correctly when copying from Windows paths to jimfs.

bioball force-pushed the pkldoc-improvements branch from eedab7b to 2a7941b Compare September 26, 2025 13:42

Fix test on Windows

061d18f

bioball force-pushed the pkldoc-improvements branch from 9a80791 to 061d18f Compare September 29, 2025 18:17

bioball added 2 commits September 29, 2025 11:51

Fix DocMigratorTest and other misc

0bc6946

Adjust labels for known subtypes/usages within package

b09eba8

bioball merged commit 5d90cf8 into apple:main Sep 29, 2025
4 checks passed

bioball deleted the pkldoc-improvements branch September 29, 2025 23:10

sin-ack mentioned this pull request Oct 8, 2025

Known usages/subtypes missing in pkl-doc model version 2 #1229

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce pkl-doc model version 2 #1169

Introduce pkl-doc model version 2 #1169

Uh oh!

bioball commented Aug 8, 2025 •

edited

Loading

Uh oh!

bioball Aug 8, 2025

Uh oh!

bioball Aug 11, 2025

Uh oh!

bioball Aug 11, 2025

Uh oh!

bioball Aug 11, 2025

Uh oh!

bioball Aug 11, 2025

Uh oh!

bioball Aug 21, 2025

Uh oh!

HT154 left a comment •

edited

Loading

Uh oh!

HT154 Sep 5, 2025

Uh oh!

bioball Sep 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		for (docPackage in docPackages) {
		if (docPackage.isUnlisted) continue

		isModuleClass -> "$module/index.json".pathEncoded
		else -> "$module/$type.json".pathEncoded

Introduce pkl-doc model version 2 #1169

Introduce pkl-doc model version 2 #1169

Uh oh!

Conversation

bioball commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HT154 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bioball commented Aug 8, 2025 •

edited

Loading

HT154 left a comment •

edited

Loading