Experimental: implement OpenPGP message grammar validation by larabr · Pull Request #18 · ProtonMail/openpgpjs

larabr · 2025-05-09T16:35:59Z

This solution gives us the option to probe whether the grammar is too disruptive in a real-world setting, by testing it in the Proton ecosystem.

To access the types in internally

twiss · 2025-05-13T15:17:31Z

+      // Grammar validation cannot be run before message integrity has been enstablished,
+      // to avoid leaking info about the unauthenticated message structure.
+      const releaseUnauthenticatedStream = util.isStream(encrypted) && config.allowUnauthenticatedStream;
+      grammarValidator = getMessageGrammarValidator({ delayReporting: releaseUnauthenticatedStream });


I don't think this condition is correct, because even if we're not streaming or config.allowUnauthenticatedStream is false, we should still wait with reporting grammar errors until after the MDC check has been done. So I think this should just be:

Suggested change

// Grammar validation cannot be run before message integrity has been enstablished,

// to avoid leaking info about the unauthenticated message structure.

const releaseUnauthenticatedStream = util.isStream(encrypted) && config.allowUnauthenticatedStream;

grammarValidator = getMessageGrammarValidator({ delayReporting: releaseUnauthenticatedStream });

// Grammar validation cannot be run before message integrity has been enstablished,

// to avoid leaking info about the unauthenticated message structure.

grammarValidator = getMessageGrammarValidator({ delayReporting: true });

If we are not streaming unauthenticated data, all the packet bytes are authenticated before being passed to the parser, because there is a readToEnd here: https://github.com/ProtonMail/openpgpjs/blob/main/src/packet/sym_encrypted_integrity_protected_data.js#L213; so the grammar checker can throw as soon as it detects an error.

OK yes, fair enough.
But then still, since we have all the data available in that case, what's the advantage of checking after every packet? We're basically checking the grammar repeatedly after every packet, even though we already have all of them. So I think it's faster to just check it once at the end, especially in the common case of the message being valid.
(But, I guess that's less critical.)

Because we can avoid parsing the rest of the stream if there is an issue. It's a cheap check

Yes sure, but again, it doesn't really make sense to optimize for the "if there's an issue" case at the cost of the happy path

It does if the cost for the happy path is zero. It's checking the content an array of numbers that has like 5 elements at most 😅

I'm not entirely convinced that it's free, there's a bunch of allocations in there and the regression test complains. But anyway, we can refactor this later if needed.

Yeah the test failed for an actual regression 👼 pushed a fix

…mar`) It enforces a message structure as defined in https://www.rfc-editor.org/rfc/rfc9580.html#section-10.3 (but slightly more permissive with Padding packets allowed in all cases). Since we are unclear on whether this change might impact handling of some messages in the wild, generated by odd use-cases or non-conformant implementations, we also add the option to disable the grammar check via `config.enforceGrammar`. This solution gives us the option to probe whether the grammar is too disruptive in a real-world setting, by testing it in the Proton ecosystem. GrammarErrors are only sensitive in the context of unauthenticated decrypted streams.

Data is known to be authenticated at the end of the Packetlist stream parsing, even for messages with MDC check.

larabr · 2025-05-14T11:07:10Z

+import enums from './src/enums';
+import config, { type Config, type PartialConfig } from './src/config';
+
+export { enums, config, PartialConfig };


export { enums, config, **Config**, PartialConfig };

(TODO fix for upstream release)

larabr marked this pull request as draft May 10, 2025 01:30

larabr added 2 commits May 12, 2025 23:43

Internal: move enums TS declaration to standalone file

aa44123

To access the types in internally

Internal: move config TS declaration to standalone file

7de31f9

To access the types in internally

larabr force-pushed the proton-message-grammar-check branch from 90ef0d8 to c7973fd Compare May 13, 2025 12:19

larabr marked this pull request as ready for review May 13, 2025 12:20

larabr force-pushed the proton-message-grammar-check branch from 3b2219a to 1a68010 Compare May 13, 2025 14:43

twiss reviewed May 13, 2025

View reviewed changes

larabr added 2 commits May 13, 2025 19:03

Simplify grammar logic for unauthenticated case handling

200a1c7

Data is known to be authenticated at the end of the Packetlist stream parsing, even for messages with MDC check.

larabr force-pushed the proton-message-grammar-check branch from 1a68010 to 200a1c7 Compare May 13, 2025 17:03

twiss approved these changes May 14, 2025

View reviewed changes

Fix grammar check failure for unparseable packets

38159b7

larabr merged commit 414df43 into ProtonMail:main May 14, 2025
12 of 13 checks passed

larabr commented May 14, 2025

View reviewed changes

larabr mentioned this pull request May 14, 2025

Integrate experimental OpenPGP message grammar validation ProtonMail/pmcrypto#215

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experimental: implement OpenPGP message grammar validation#18

Experimental: implement OpenPGP message grammar validation#18
larabr merged 5 commits intoProtonMail:mainfrom
larabr:proton-message-grammar-check

larabr commented May 9, 2025

Uh oh!

Uh oh!

Uh oh!

twiss May 13, 2025

Uh oh!

larabr May 13, 2025

Uh oh!

twiss May 13, 2025

Uh oh!

larabr May 13, 2025

Uh oh!

twiss May 13, 2025

Uh oh!

larabr May 14, 2025

Uh oh!

twiss May 14, 2025

Uh oh!

larabr May 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

larabr May 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

larabr commented May 9, 2025

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants