Move ambiguous object field name detection into DotExpandingXContentParser #82359

romseygeek · 2022-01-10T11:51:11Z

Detecting when a field name contains double dots, or starts with a dot, is currently
done by the splitAndValidatePath method on DocumentParser. However it makes
more sense to do this as part of the DotExpandingXContentParser which actually
does the work of converting field names containing dots to XContent objects.

…alidation

elasticmachine · 2022-01-10T11:51:15Z

Pinging @elastic/es-search (Team:Search)

romseygeek · 2022-01-10T11:52:19Z

This leaves splitAndValidatePaths in DocumentMapper for now as it is still used by the dynamic mapper construction code, but we can then remove it as part of #81449 .

Note that this is just a refactor and doesn't try and fix #28948

javanna · 2022-01-10T12:35:35Z

server/src/main/java/org/elasticsearch/index/mapper/DocumentParser.java

@@ -461,7 +461,6 @@ private static void innerParseObject(DocumentParserContext context, ObjectMapper
        while (token != XContentParser.Token.END_OBJECT) {
            if (token == XContentParser.Token.FIELD_NAME) {
                currentFieldName = context.parser().currentName();
-                splitAndValidatePath(currentFieldName);


is it ok to omit this call now? What was it doing and where is that method called now? Seems like the new method incorporates some of its logic?

The new method is pretty much a straight copy. It's called in the DotExpandingXContentParser, which InternalDocumentParserContext creates to wrap the source document's xcontent parser. The existing call sites were places where we previously split up field names, but that all happens in the expanding parser now so it makes more sense to handle things there.

ok then I guess I am unclear on why the original method stays around, are there still places where we call it?

yep it's still called from createDynamicUpdate(). This gets completely refactored in #81449 so it will go away entirely there.

I see, thanks! Can we in the meantime share the code between the two then, or are there subtle differences?

They are slightly different because the x-content lib doesn't have access to Strings, but in implementation they are the same. Given that they're private static though, and that the copy in DocumentParser will be removed immediately in a follow up, I'd prefer to keep two versions for the moment?

romseygeek · 2022-01-10T13:18:22Z

@elasticmachine run elasticsearch-ci/docs

romseygeek · 2022-01-10T14:27:48Z

@elasticmachine update branch

javanna · 2022-01-19T13:19:56Z

libs/x-content/src/main/java/org/elasticsearch/xcontent/DotExpandingXContentParser.java

+        for (String part : parts) {
+            // check if the field name contains only whitespace
+            if (part.isEmpty()) {
+                throw new IllegalArgumentException("object field cannot contain only whitespace: ['" + fullFieldPath + "']");


@romseygeek is the mention of "object" in this error and the next one accurate? Don't we split any path regardless of the token we are reading from the parser?

Yes I think that's a leftover, and it was probably inaccurate even before this change. It should be 'field name'

javanna · 2022-01-19T13:26:24Z

libs/x-content/src/main/java/org/elasticsearch/xcontent/DotExpandingXContentParser.java

@@ -48,7 +48,7 @@ public Token nextToken() throws IOException {

        private void expandDots() throws IOException {
            String field = delegate().currentName();
-            String[] subpaths = field.split("\\.");
+            String[] subpaths = splitAndValidatePath(field);
            if (subpaths.length == 0) {


@romseygeek this check is redundant now?

It is, good catch

romseygeek added 3 commits January 10, 2022 11:33

Move ambiguous object path detection into DotExpandingXContentParser

2d775fd

Merge remote-tracking branch 'origin/master' into mapper/field-name-v…

eb277d3

…alidation

precommit

82ef34c

romseygeek added :Search Foundations/Mapping Index mappings, including merging and defining field types >refactoring v8.1.0 labels Jan 10, 2022

romseygeek requested review from nik9000 and javanna January 10, 2022 11:51

romseygeek self-assigned this Jan 10, 2022

elasticmachine added the Team:Search Meta label for search team label Jan 10, 2022

javanna reviewed Jan 10, 2022

View reviewed changes

javanna approved these changes Jan 10, 2022

View reviewed changes

Merge branch 'master' into mapper/field-name-validation

019f633

romseygeek merged commit 0ca3db6 into elastic:master Jan 10, 2022

romseygeek deleted the mapper/field-name-validation branch January 10, 2022 15:29

romseygeek mentioned this pull request Jan 10, 2022

Construct dynamic updates directly via object builders #81449

Merged

javanna reviewed Jan 19, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move ambiguous object field name detection into DotExpandingXContentParser #82359

Move ambiguous object field name detection into DotExpandingXContentParser #82359

romseygeek commented Jan 10, 2022

elasticmachine commented Jan 10, 2022

romseygeek commented Jan 10, 2022

javanna Jan 10, 2022

romseygeek Jan 10, 2022

javanna Jan 10, 2022

romseygeek Jan 10, 2022

javanna Jan 10, 2022

romseygeek Jan 10, 2022

romseygeek commented Jan 10, 2022

romseygeek commented Jan 10, 2022

javanna Jan 19, 2022

romseygeek Jan 19, 2022

javanna Jan 19, 2022

romseygeek Jan 19, 2022

Move ambiguous object field name detection into DotExpandingXContentParser #82359

Move ambiguous object field name detection into DotExpandingXContentParser #82359

Conversation

romseygeek commented Jan 10, 2022

elasticmachine commented Jan 10, 2022

romseygeek commented Jan 10, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

romseygeek commented Jan 10, 2022

romseygeek commented Jan 10, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment