Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

implement route scoping solution #836

Merged
merged 25 commits into from
Aug 2, 2024
Merged
Show file tree
Hide file tree
Changes from 23 commits
Commits
Show all changes
25 commits
Select commit Hold shift + click to select a range
cf40e08
implement route scoping solution
dario-piotrowicz Jul 18, 2024
1318938
remove unused variable
dario-piotrowicz Jul 30, 2024
33e7842
remove appServerActions feature from app14.0.0 fixture
dario-piotrowicz Jul 30, 2024
ef3df46
fixup! implement route scoping solution
dario-piotrowicz Jul 31, 2024
f4428c2
fixup! implement route scoping solution
dario-piotrowicz Jul 31, 2024
eec1d6f
fixup! implement route scoping solution
dario-piotrowicz Jul 31, 2024
435c53a
fixup! implement route scoping solution
dario-piotrowicz Jul 31, 2024
a09e7d8
fixup! implement route scoping solution
dario-piotrowicz Jul 31, 2024
c1f5650
fixup! implement route scoping solution
dario-piotrowicz Jul 31, 2024
9a65f29
fixup! implement route scoping solution
dario-piotrowicz Aug 1, 2024
8a36861
fixup! implement route scoping solution
dario-piotrowicz Aug 1, 2024
d82a558
fixup! implement route scoping solution
dario-piotrowicz Aug 1, 2024
c064ecc
fixup! implement route scoping solution
dario-piotrowicz Aug 1, 2024
ceee48f
avoid string concatenation for warm function imports
dario-piotrowicz Aug 1, 2024
d6c149e
fixup! implement route scoping solution
dario-piotrowicz Aug 1, 2024
b13338a
fixup! implement route scoping solution
dario-piotrowicz Aug 1, 2024
a98cf6c
fixup! implement route scoping solution
dario-piotrowicz Aug 1, 2024
2ccf8cc
fixup! implement route scoping solution
dario-piotrowicz Aug 1, 2024
ed5a721
fixup! implement route scoping solution
dario-piotrowicz Aug 1, 2024
6c0951f
fixup! implement route scoping solution
dario-piotrowicz Aug 1, 2024
3a79e89
fixup! implement route scoping solution
dario-piotrowicz Aug 1, 2024
e2ff7d6
add missing ts-nocheck comment to e2e custom-entrypoint
dario-piotrowicz Aug 1, 2024
b37bce2
fixup! implement route scoping solution
dario-piotrowicz Aug 2, 2024
87976d5
fixup! implement route scoping solution
dario-piotrowicz Aug 2, 2024
ca1ab06
fixup! implement route scoping solution
dario-piotrowicz Aug 2, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
30 changes: 30 additions & 0 deletions .changeset/thirty-birds-build.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
---
'@cloudflare/next-on-pages': patch
---

fix: implement route specific global scoping strategy

currently routes all share the same global scope, this can be problematic and cause
race conditions and failures

One example of this is the following code that is present in route function files:

```ts
self.webpackChunk_N_E = ...
```

and

```ts
self.webpackChunk_N_E.push(...)
```

this indicates that an in-memory global collection of the webpack chunks is shared by all routes,
this combined with the fact that chunks can have their own module state this can easily cause routes to conflict with each other at runtime.

So, in order to solve the above issue, all route functions are wrapped in a function which accepts as parameters, thus overrides, the `self`, `globalThis` and `global` symbols. The symbols
will be resolved with proxies that redirect setters to route-scoped in-memory maps and
getters to the above mentioned map's values and fallback to the original symbol values otherwise
(i.e. `globalThis` will be overridden by a proxy that, when setting values, sets them in a separate
location and, when getting values, gets them from said location if present there or from the real
`globalThis` otherwise)
Original file line number Diff line number Diff line change
Expand Up @@ -101,13 +101,20 @@ async function processFunctionIdentifiers(

if (importPath) {
// Dedupe and update collected imports.
const { updatedContents } = await processImportIdentifier(
{ type, identifier, start, end, importPath, info: identifierInfo },
{ fileContents, entrypoint, newFnLocation, fnConfig: fnInfo.config },
opts,
);
const { updatedContents, newImportToPrepend } =
await processImportIdentifier(
{ type, identifier, start, end, importPath, info: identifierInfo },
{
fileContents,
entrypoint,
newFnLocation,
fnConfig: fnInfo.config,
},
opts,
);

fileContents = updatedContents;
importsToPrepend.push(newImportToPrepend);
} else if (identifierInfo.consumers.length > 1) {
// Only dedupe code blocks if there are multiple consumers.
const { updatedContents, newFilePath, newImport, wasmImports } =
Expand Down Expand Up @@ -143,9 +150,10 @@ async function processFunctionIdentifiers(

// Build the identifier files before building the function's file.
await Promise.all(
[...identifierPathsToBuild].map(async path =>
buildFile(await readFile(path, 'utf8'), path),
),
[...identifierPathsToBuild].map(async path => {
const fileContents = await functionifyFileContent(path);
return buildFile(fileContents, path);
}),
);

// If wasm identifier is used in code block, prepend the import to the code block's file.
Expand All @@ -168,6 +176,27 @@ async function processFunctionIdentifiers(
await Promise.all(functionBuildPromises);
}

/**
* Given a standard ESM file (without imports) it converts it to a function call that returns
* an object with the various exports set as its fields
*
* The function allows us to override global symbols such as `self`, `globalThis` and `global`
* (which are used as the function's parameter names)
*
* @param path the path of the ESM file
* @returns the converted file content
*/
async function functionifyFileContent(path: string) {
const originalFileContents = await readFile(path, 'utf8');
return `
${
/* Note: we need to make sure that the named exports object is defined since that is used inside the file */ ''
}
const ${namedExportsObjectName} = {};
export const ${getNamedExportsFunctionName} = ((self, globalThis, global) => { ${originalFileContents} return ${namedExportsObjectName}; });
`;
}

/**
* Builds a new file for an Edge function.
*
Expand All @@ -184,7 +213,7 @@ async function buildFunctionFile(
{ importsToPrepend }: { importsToPrepend: NewImportInfo[] },
{ workerJsDir, nopDistDir }: ProcessVercelFunctionsOpts,
): Promise<{ buildPromise: Promise<void> }> {
let functionImports = '';
const functionImports: string[] = [];

// Group the identifier imports by the keys for each path.
const groupedImports = importsToPrepend.reduce((acc, { key, path }) => {
Expand All @@ -193,21 +222,43 @@ async function buildFunctionFile(
return acc;
}, new Map<string, string>());

groupedImports.forEach((keys, path) => {
const relativeImportPath = getRelativePathToAncestor({
from: newFnLocation,
relativeTo: nopDistDir,
});
let chunkMapIdx = 0;
const chunksExportsMap = new Map<string, Set<string>>();

const relativeImportPath = getRelativePathToAncestor({
from: newFnLocation,
relativeTo: nopDistDir,
});

groupedImports.forEach((exports, path) => {
const importPath = normalizePath(
join(relativeImportPath, addLeadingSlash(path)),
);

functionImports += `import { ${keys} } from '${importPath}';\n`;
if (path.endsWith('.wasm')) {
// if we're dealing with a wasm file there is a single default export to deal with
const defaultExport = exports;
// we don't need/want to apply any code transformation for wasm imports
functionImports.push(`import ${defaultExport} from "${path}"`);
dario-piotrowicz marked this conversation as resolved.
Show resolved Hide resolved
return;
}

const getNamedExportsFunctionWithId = `${getNamedExportsFunctionName}_${chunkMapIdx++}`;
const exportKeys = exports.split(',');
chunksExportsMap.set(getNamedExportsFunctionWithId, new Set(exportKeys));
functionImports.push(
`import { ${getNamedExportsFunctionName} as ${getNamedExportsFunctionWithId} } from '${importPath}'`,
);
});

fnInfo.outputPath = relative(workerJsDir, newFnPath);

const finalFileContents = `${functionImports}${fileContents}`;
const finalFileContents = iifefyFunctionFile(
fileContents,
functionImports,
fnInfo,
chunksExportsMap,
);
const buildPromise = buildFile(finalFileContents, newFnPath, {
relativeTo: nopDistDir,
}).then(async () => {
Expand All @@ -225,6 +276,60 @@ type BuildFunctionFileOpts = {
newFnPath: string;
};

/**
* Given the content of a function file it converts/wraps it into an iife that overrides the function's contents with an iffe call that
* overrides global symbols with route-specific proxies (for more details see: templates/_worker.js/routesIsolation.ts)
*
* @param fileContents the function file's contents
* @param functionImports the imports that need to be added to the file
* @param fnInfo the function's information
* @param chunksExportsMap a map containing getters and chunks identifiers being used by the function
* @returns the updated/iifefied file content
*/
function iifefyFunctionFile(
fileContents: string,
functionImports: string[],
fnInfo: FunctionInfo,
chunksExportsMap: Map<string, Set<string>>,
): string {
const wrappedContent = `
export default ((self, globalThis, global) => {
${fileContents
// it looks like there can be direct references to _ENTRIES (i.e. `_ENTRIES` instead of `globalThis._ENTRIES` etc...)
// we have to update all such references otherwise our proxying won't take effect on those
// (note: older versions of the Vercel CLI (v31 and older) used to declare _ENTRIES as "let _ENTRIES = {};", so we do
// need to make sure that we don't add `globalThis.` in these cases (if we were to drop support for those older versions
// the below line to: ".replace(/([^.])_ENTRIES/g, '$1globalThis._ENTRIES')")
.replace(/(?<!(let)\s*)([^.]|^)_ENTRIES/g, '$2globalThis._ENTRIES')
// the default export needs to become the return value of the iife, which is then re-exported as default
.replace(/export\s+default\s+/g, 'return ')}
})(proxy, proxy, proxy);
`;

const proxyCall = `const proxy = globalThis.__nextOnPagesRoutesIsolation.getProxyFor('${
fnInfo.route?.path ?? ''
}');`;

const chunksExtraction = [...chunksExportsMap.entries()].flatMap(
([getNamedExportsFunctionWithId, keys]) => {
const namedExportsObjectWithId = `__exportsOf${getNamedExportsFunctionWithId}`;
return [
`const ${namedExportsObjectWithId} = ${getNamedExportsFunctionWithId}(proxy, proxy, proxy);`,
...[...keys.keys()].map(
key => `const ${key} = ${namedExportsObjectWithId}["${key}"]`,
),
];
},
);

return [
...functionImports,
proxyCall,
...chunksExtraction,
wrappedContent,
].join(';');
dario-piotrowicz marked this conversation as resolved.
Show resolved Hide resolved
}

/**
* Prepends Wasm imports to a code block's built file.
*
Expand All @@ -246,19 +351,22 @@ async function prependWasmImportsToCodeBlocks(
relativeTo: nopDistDir,
});

let functionImports = '';
const functionImports: string[] = [];

for (const identifier of wasmImports) {
// eslint-disable-next-line @typescript-eslint/no-non-null-assertion
const { newDest } = identifierMaps.wasm.get(identifier)!;
const wasmImportPath = normalizePath(
join(relativeImportPath, newDest as string),
);
functionImports += `import ${identifier} from "${wasmImportPath}";\n`;
functionImports.push(`import ${identifier} from "${wasmImportPath}"`);
}

const oldContents = await readFile(filePath);
await writeFile(filePath, `${functionImports}${oldContents}`);
await writeFile(
filePath,
`${functionImports.join(';')};${oldContents}`,
);
},
),
);
Expand All @@ -268,12 +376,13 @@ async function prependWasmImportsToCodeBlocks(
* Processes an import path identifier.
*
* - Moves the imported file to the new location if it doesn't exist.
* - Updates the file contents to import from the new location.
* - Updates the file contents to remove the import.
* - Returns the information for the new import.
*
* @param ident The import path identifier to process.
* @param processOpts Contents of the function's file, the function's entrypoint, and the new path.
* @param opts Options for processing the function.
* @returns The updated file contents.
* @returns The updated file contents alongside the new import information.
*/
async function processImportIdentifier(
ident: RawIdentifierWithImport<IdentifierType> & { info: IdentifierInfo },
Expand All @@ -284,7 +393,7 @@ async function processImportIdentifier(
fnConfig,
}: ProcessImportIdentifierOpts,
{ nopDistDir, workerJsDir }: ProcessVercelFunctionsOpts,
): Promise<{ updatedContents: string }> {
): Promise<{ updatedContents: string; newImportToPrepend: NewImportInfo }> {
const { type, identifier, start, end, importPath, info } = ident;
let updatedContents = fileContents;

Expand Down Expand Up @@ -320,14 +429,20 @@ async function processImportIdentifier(
});
const newImportPath = normalizePath(join(relativeImportPath, info.newDest));

const newVal = `import ${identifier} from "${newImportPath}";`;
// let's remove the original import since it will be re-added later when appropriate
updatedContents = replaceLastSubstringInstance(
updatedContents,
codeBlock,
newVal,
'',
);

return { updatedContents };
return {
updatedContents,
newImportToPrepend: {
key: identifier,
path: newImportPath,
},
};
}

type ProcessImportIdentifierOpts = {
Expand Down Expand Up @@ -383,7 +498,7 @@ async function processCodeBlockIdentifier(
.forEach(key => wasmImports.push(key));

const buffer = Buffer.from(
`export const ${identifierKey} = ${codeBlock}\n`,
`${namedExportsObjectName}["${identifierKey}"] = ${codeBlock}\n`,
'utf8',
);

Expand Down Expand Up @@ -554,3 +669,43 @@ async function processBundledAssets(
}
}
}

/**
* When performing the various code tweaking we never introduce standard named ESM exports, since those would
* be invalid anyways since each js file content gets wrapped into a function anyways.
*
* Note: route function files don't have named exports to this is only needed for the other files such
* as the manifest ones and the webpack chunks ones
*
* Instead of standard named exports we simply set named exports onto an object which gets then returned by the
* file wrapper function.
*
*
* Example:
* when introducing a new export, instead of introducing:
* ```
* export const a = ...
* ```
* we introduce something like:
* ```
* NAMED_EXPORTS_OBJECT["a"] = ...
* ```
* and make sure that such <NAMED_EXPORTS_OBJECT> object is always declared and returned by the function wrapping the file's contents.
*
* This is the name of the object used for such exports.
*/
const namedExportsObjectName = '__namedExportsObject';

/**
* The strategy of exporting exports via an returned object relies on the fact that there is a function that returns such object.
*
* Example:
* ```
* import { __getNamedExports } from '...';
* const obj = __getNamedExports(...);
* // obj contains the various exports as standard object properties
* ```
*
* This is the name of such function.
*/
const getNamedExportsFunctionName = '__getNamedExports';
2 changes: 2 additions & 0 deletions packages/next-on-pages/templates/_worker.js/index.ts
Original file line number Diff line number Diff line change
@@ -1,5 +1,6 @@
import { SUSPENSE_CACHE_URL } from '../cache';
import { handleRequest } from './handleRequest';
import { setupRoutesIsolation } from './routesIsolation';
import {
adjustRequestForVercel,
handleImageResizingRequest,
Expand All @@ -22,6 +23,7 @@ declare const __ALSes_PROMISE__: Promise<null | {

export default {
async fetch(request, env, ctx) {
setupRoutesIsolation();
patchFetch();

const asyncLocalStorages = await __ALSes_PROMISE__;
Expand Down
Loading
Loading