Null JSON values in context and entity attribute JSON data #37

john-h-kastner-aws · 2023-11-02T21:25:59Z

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

mwhicks1 · 2023-11-02T21:36:07Z

text/0037-null-context-entity-data.md

+This is our current behavior.
+As stated above, it limits what data can be used to construct a context, forcing users to remove it in their own preprocessing passes. 
+Cedar should be easy and ergonomic to use, so we should hesitate to offload work users without good reason, but, if the correct interpretation of `null` is not always the same, it may be the correct choice here.
+
+If we take this alternative, we should keep in mind that many users are likely to find themselves writing the same filter to remove `null` from their data, so we should consider providing this filter ourselves, even if it is not an integral part of the Cedar data format.


This doesn't sound that hard, actually. There are lots of libraries for easy JSON parsing/transformation, no? I think that it might be confusing that Cedar drops nulls for attributes, but errors when they are provided in sets. Better to let users drop null if it's acceptable to them, wherever null might appear?

And the drawbacks you've listed seem non-trivial ...

Yes, based on the drawbacks I'm also no longer fully convinced that dropping null attribute only is the right solution. Dropping null everywhere actually seems better, but the example with all is very misleading.

I don't think there's too much value in us providing the filter ourselves. Our filter would presumably just be a wrapper around some existing JSON parsing function. Also, the context/entity data may be coming from other languages (not Rust) and it may make more sense to do the filtering there.

We have the data as JSONValue objects, so we're in a position to transform it easily and correctly. My concern is that someone might have JSON data as a string, so we're requiring them to parse and update it, or try to do some ad-hoc string operations.

khieta

I'd like to see the scope of this RFC expanded to Cedar's JSON representations for schemas and policies. Although those are conceptually a little different (they're "code"/"metadata" rather than "data") I think it's useful to have the discussion all in one place.

khieta · 2023-11-08T16:37:38Z

text/0037-null-context-entity-data.md

+### Unexpected behavior
+
+Ignoring `null` may not be the desired behavior for all users.
+It seems reasonable that a `null` attribute value can be safely ignored, but a user may expect that `context has foo` evaluates to `true` if `foo` is present in their JSON context data.
+It will instead evaluate to `false` if `foo` is present but has a `null` value.
+
+In support of this point, observe how JavaScript treats objects containing `null`:
+```js
+Object.hasOwn({foo: null}, 'foo') // true
+```


This seems to be a pretty big drawback -- it's making me lean more towards disallowing null.

What errors does a user see when they pass in a null value somewhere? Could we address the issue in this RFC by just making clearer error messages to explain why null values aren't allowed?

Agreed. This is a major draw back.

The current error is data did not match any variant of untagged enum CedarValueJson, so, yes, this need to be improved, but from the complaints we've heard about this, users do understand that null is the problem.

khieta · 2023-11-08T16:41:08Z

text/0037-null-context-entity-data.md

+This is our current behavior.
+As stated above, it limits what data can be used to construct a context, forcing users to remove it in their own preprocessing passes. 
+Cedar should be easy and ergonomic to use, so we should hesitate to offload work users without good reason, but, if the correct interpretation of `null` is not always the same, it may be the correct choice here.
+
+If we take this alternative, we should keep in mind that many users are likely to find themselves writing the same filter to remove `null` from their data, so we should consider providing this filter ourselves, even if it is not an integral part of the Cedar data format.


I don't think there's too much value in us providing the filter ourselves. Our filter would presumably just be a wrapper around some existing JSON parsing function. Also, the context/entity data may be coming from other languages (not Rust) and it may make more sense to do the filtering there.

text/0037-null-context-entity-data.md

cdisselkoen · 2023-11-13T14:27:24Z

text/0037-null-context-entity-data.md

+These are JSON objects, so this RFC as written up to this point implies ignoring `null` on attributes appearing here.
+We will instead continue to treat `null` as an error in this context.


Curious what the reasoning is either way -- why should we accept or not accept "__entity": null and similar.

Expanded on this bit in the text. Treating it as {} would implicitly make it a record instead of an entity. We could parse null as an entity (unspecified?), but I think that's beyond the scope of the problem this RFC wants to solve.

mwhicks1 · 2023-12-02T14:21:55Z

text/0037-null-context-entity-data.md

+This is a good alternative if unless believe that the majority of users want the same non-error parsing of `null`.
+In that case we would be requiring that many users independently re-implement the same preprocessing pass.
+Even in this case, supporting one non-error alternative (e.g., this rfc, or ignore `null` everywhere) would still force users wanting a different option to write their own preprocessing pass while making the behavior when a `null` is included in their data more difficult to diagnose.
+Rather than see an error when parsing, an unexpected parsing of `null` values could result in unexpected authorization decision.
+


I find this argument compelling for staying with the status quo, so I propose to reject this RFC.

In particular: The "ignore attributes set to null" behavior is easily implemented as a pre-pass by customers themselves, but the alternative "error on occurrences of null" anywhere becomes impossible if we change the status quo. We would need a lot of customers to prefer the proposed behavior to override customers who don't want it. Moreover, they would have to be comfortable with the half-measure of only ignoring null in top-level attributes and nowhere else; otherwise they would still need to write their own code to deal with null. I have not seen evidence of a clamor from customers for the proposed behavior change.

A possible alternative would be to provide some sort of utility function that does the JSON null-scrubbing optionally, which customers could invoke prior to converting the JSON to a context or entity, if they wished. But there is no need for an RFC for that.

john-h-kastner-aws · 2023-12-11T16:56:30Z

After discussion on this PR, I am convinced that the drawbacks as listed in the RFC outweigh the benefit of accepting more JSON values, so I am moving this RFC into the final comment period with intent to reject.

cdisselkoen · 2023-12-11T18:07:12Z

If we do reject this RFC as proposed, do we need any documentation updates to make Cedar's behavior on nulls clearer and suggest callers filter nulls as appropriate for their application?

john-h-kastner-aws · 2023-12-12T14:08:37Z

If we do reject this RFC as proposed, do we need any documentation updates to make Cedar's behavior on nulls clearer and suggest callers filter nulls as appropriate for their application?

Yes. Some concrete action items we'll open as issues on relevant repositories if this is rejected:

Update documentation for entity and context JSON formats to explicitly call out null as forbidden.
Update documentation to suggest that users with null in their data may want to preprocess to remove it, but also mention the issues this has around has and contains.
Update error message on null values to clearly state that null is the problem

john-h-kastner-aws · 2023-12-20T20:40:04Z

Closing this RFC as rejected at the end of the final comment period.

RFC 37

32a00cc

mwhicks1 reviewed Nov 2, 2023

View reviewed changes

khieta added the pending This RFC is pending; for definitions see the README label Nov 7, 2023

khieta mentioned this pull request Nov 8, 2023

Change the semantics of appliesTo fields in the schema #35

Closed

khieta reviewed Nov 8, 2023

View reviewed changes

cdisselkoen reviewed Nov 13, 2023

View reviewed changes

Update RFC, expanding on why we might say with the current behavior

cf3572a

mwhicks1 reviewed Dec 2, 2023

View reviewed changes

proof read and fix

fce7aa2

john-h-kastner-aws added final-comment-period This RFC is in its final comment period; for definitions see the README and removed pending This RFC is pending; for definitions see the README labels Dec 11, 2023

fcp

59ba855

john-h-kastner-aws closed this Dec 20, 2023

This was referenced Dec 20, 2023

[Cedar Doc Issue] Document behavior on null in entity json data cedar-policy/cedar-docs#73

Open

Improve error when null occurs in entity json data cedar-policy/cedar#530

Closed

john-h-kastner-aws removed the final-comment-period This RFC is in its final comment period; for definitions see the README label Jan 2, 2024

khieta added the rejected This RFC is rejected (or dropped); for definitions see the README label Mar 5, 2024

		These are JSON objects, so this RFC as written up to this point implies ignoring `null` on attributes appearing here.
		We will instead continue to treat `null` as an error in this context.

Null JSON values in context and entity attribute JSON data #37

Null JSON values in context and entity attribute JSON data #37

Uh oh!

Conversation

john-h-kastner-aws commented Nov 2, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

khieta left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

john-h-kastner-aws commented Dec 11, 2023

Uh oh!

cdisselkoen commented Dec 11, 2023

Uh oh!

john-h-kastner-aws commented Dec 12, 2023

Uh oh!

john-h-kastner-aws commented Dec 20, 2023

Uh oh!

Uh oh!