[red-knot] Add control flow for `try`/`except` blocks (v2) #13633

AlexWaygood · 2024-10-04T18:19:02Z

Summary

This PR adds control flow for try/except/else/finally blocks to red-knot. It's a replacement PR for #13338, which had some fundamental issues in its approach, in particular with regards to finally blocks.

The semantics of try/except blocks are very complicated! I've written up a long document outlining all the various jumps control flow could take, which can be found here. I won't try to summarise that document in this PR description. But I will give a brief description of some of the ways I've attempted to model these semantics in this PR:

Abstractions for handling try/except blocks have been added to a new builder submodule, builder/exception_handlers.rs:

TryNodeContext keeps track of state for a single try/except/else/finally block. Exactly what state we need to keep track of varies according to whether the node has a finally branch, and according to which branch of the StmtTry node we're currently visiting.
TryNodeContextStack is a stack of TryNodeContext instances. For any given scope, try blocks can be arbitrarily nested; this means that we must keep a stack of TryNodeContexts for each scope we visit.
TryNodeContextStackManager is a stack of TryNodeContextStacks. Whenever we enter a nested scope, a new TryNodeContextStack is initialised by the TryNodeContextStackManager and appended to the stack of stacks. Whenever we exit that scope, the TryNodeContextStack is popped off the stack of stacks.

The diff for this PR is quite large, but this is mostly tests. There aren't actually that many tests, but they unfortunately need to be quite verbose. This is because we may add a more sophisticated understanding of exception handlers in the future (where we would understand that e.g. x = 1 can never raise an exception), and I wanted the tests to be robust to this so that they wouldn't have to be rewritten when that happens. (This also helps readability of the tests, since we obviously know that x = 1 can never raise exceptions.) To address this, I made sure to use assignments to function calls for testing places where a raised exception could cause a jump in control flow. This will be robust to future improvements, since it will always be the case that we will consider a function call capable of raising arbitrary exceptions.

Test Plan

All tests have been added to infer.rs. They all use reveal_type to assert that the type of a variable changes as we move through the various try/except/else/finally branches.

AlexWaygood · 2024-10-04T18:33:42Z

Codspeed reports a 2% regression in the red_knot[cold] benchmark. Unless there's either something I'm doing that's completely the wrong approach performance-wise or there are some easy wins we can see that aren't too complicated, I'd prefer not to worry about that too much and try to optimize it in followup PRs. Getting the semantics correct was hard enough 😅

github-actions · 2024-10-04T18:36:56Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

crates/red_knot_python_semantic/src/types/infer.rs

carljm

Haven't fully reviewed yet, just one kind of fundamental thing that jumped out at me on first look, would like to get your thoughts on that.

carljm · 2024-10-05T14:59:28Z

crates/red_knot_python_semantic/src/semantic_index/builder.rs

+                // These definitions were erased by `self.flow_restore`ing to the post-`else` state.
+                // We can't simply `self.flow_merge()` with any snapshots taken during the `finally` block, however,
+                // as there are more potential definition states inside the `finally` block than there are
+                // from a point after the `finally` block's completion.
+                // Instead, we must manually re-add these definitions to the `use-def` map
+                if let Some(finally_definitions) = self.try_node_context_stack().pop_context() {
+                    for DefinitionRecord {
+                        symbol,
+                        definition,
+                        category,
+                    } in finally_definitions
+                    {
+                        self.current_use_def_map_mut()
+                            .record_definition(symbol, definition, category);
+                    }
+                }


Ah, this is tricky indeed. I hadn't fully understood the awkward consequences of the way finally blocks work for our CFG. Thanks for taking the time to think this through!

Unfortunately I don't think this approach (of storing and then re-applying Definitions in the finally block) is going to give us the right results. Consider a case like this:

x = 1 try: x = could_raise_returns_str() finally: y = x reveal_type(y)

The correct revealed type for y is str, because in any case where code flow continues after the finally, that means the try block actually completed without an exception. But this PR currently gives the revealed type as Literal[1] | str. By storing and reapplying the Definition for y, we get the type of the RHS from the scenario where we might have an exception.

I think the only way to handle this correctly is to, in some form, duplicate or double visit the finally block. We effectively need to type it twice, once under the assumption that any code it protects might have raised, and again under the assumption that it didn't.

This will be a significant bit of work, as it troubles some core assumptions we have about visiting every expression exactly once. I don't think we should do it in this PR.

But I also don't think we should do this store-and-reapply-definitions thing, either, for two reasons. One is that I think it's just generally important for correctness that we maintain the control-flow-graph abstraction and don't work around it with tricks like this. The other is just about the tradeoff in semantics for Python code. Until/unless we get to a correct double-visit fix, I think the best tradeoff is to accept some false negatives while checking the finally block itself, but ensure we get the types correct after the finally block. In other words, for now I think we should just visit the finally block under the no-exceptions assumption.

What do you think?

Thanks for the great example that shows the flaws in this approach! Ugh, I really thought I'd covered everything this time 🫠 This was, as I'm sure you guessed, the bit of this PR that I was least sure about.

But I also don't think we should do this store-and-reapply-definitions thing, either, for two reasons. One is that I think it's just generally important for correctness that we maintain the control-flow-graph abstraction and don't work around it with tricks like this. The other is just about the tradeoff in semantics for Python code. Until/unless we get to a correct double-visit fix, I think the best tradeoff is to accept some false negatives while checking the finally block itself, but ensure we get the types correct after the finally block. In other words, for now I think we should just visit the finally block under the no-exceptions assumption.

What do you think?

I think this makes me a little sad after I spent so much time thinking about finally blocks 😆

I think it is pretty important that we fix this eventually. In the long run, this will lead to false positives as well as false negatives. For example, when we start emitting diagnostics for unreachable code, we will emit spurious errors on the if branch inside the finally block in this snippet, as we will incorrectly infer it as being unreachable:

x = 42 try: x = could_raise_returns_int() except: could_raise() x = "foo" else: could_raise() x = "foo" finally: if isinstance(x, int): ... # we'd probably detect this as unreachable # unless we consider the fact that we might have jumped to the `finally` # branch from halfway through an `except` or `else` branch else: ...

Another way I thought of trying to fix this "awkwardness" was to utilise the fact that we know that try/except blocks with finally branches desugar to nested try/except blocks. We could attempt to "synthesize" a nested StmtTry node if we see that a StmtTry node has a non-empty finally suite. (Not actually create a synthetic StmtTry node, but visit the StmtTry node exactly as if it were a nested StmtTry inside another StmtTry.) I actually started off trying to do that, but quickly stopped as this PR's current approach seemed like a simpler solution. (And I was also not sure how this would work with the assertions we have that you mentioned above, about only ever visiting every expression once.) Given the issue you just pointed out in your example, it seems like that probably is the only good way of doing it, though; there doesn't seem to be any way of taking shortcuts while respecting Python's semantics properly.

Yeah, I think you're right that we will want to fix this.

I've written up your edge case in my document describing control-flow semantics for exception handlers. It's very specific! I believe it only applies to StmtTry nodes that:

have finally blocks, and either:

do not have any except branches, or

all the except branches of the StmtTry node lead to immediate termination of the scope following the finally block, through either a raise, returnor similar.

The specificity of the edge case doesn't mean that it's unimportant to consider, however.

I think it also applies to try blocks with except handlers, it's just that the issue shifts to considering the possibility of an exception in the exception handler, rather than an exception in the try block?

And try/finally without except handlers is not an uncommon case.

I think it also applies to try blocks with except handlers, it's just that the issue shifts to considering the possibility of an exception in the exception handler, rather than an exception in the try block?

Ah, great point.

And try/finally without except handlers is not an uncommon case.

I said specific, not uncommon! I agree that try/finally without except is pretty common, so I definitely agree this is an important case to consider.

MichaReiser · 2024-10-05T15:09:14Z

Would it be possible and would you feel comfortable to make the internal document public and mention it in the pr summary?

I hope I get to review this on Monday or no later than Tuesday

AlexWaygood · 2024-10-05T15:18:21Z

Would it be possible and would you feel comfortable to make the internal document public and mention it in the pr summary?

Done!

AlexWaygood added the red-knot Multi-file analysis & type inference label Oct 4, 2024

AlexWaygood requested review from carljm and MichaReiser as code owners October 4, 2024 18:19

AlexWaygood mentioned this pull request Oct 4, 2024

[red-knot] Add control flow for try/except blocks #13338

Closed

T-256 reviewed Oct 4, 2024

View reviewed changes

crates/red_knot_python_semantic/src/types/infer.rs Outdated Show resolved Hide resolved

carljm reviewed Oct 5, 2024

View reviewed changes

AlexWaygood mentioned this pull request Oct 5, 2024

[red-knot] Improve tests relating to type inference for exception handlers #13643

Merged

AlexWaygood added 19 commits October 5, 2024 18:04

Add basic infrastructure required for tracking try/except blocks

3878fb2

make it work

57bf730

Add passing tests for simple try/except blocks

8a947a7

Add passing test for multiple except branches

e52504c

Add passing test for except with else

808a256

Add passing test for multiple excepts with else

1aeb840

Add passing test for finally branch with no excepts

dbd1758

Add passing tests for blocks involving an except and a finally

ddd70aa

Improve some tests

88a3865

Add tests for finally with multiple excepts

d95a627

Improve some tests

0dccf29

Add tests for finally with multiple excepts and an else

31be48f

Add a test for nested try/except blocks

d2c9f30

Add a test for try/excepts in nested scopes

fbb06b2

Better comments

67f4f2a

Fix benchmark assertion

6c31e04

fix lints

37542c8

Improve naming consistency

4c0ed89

microoptimisations for fun

aac8753

AlexWaygood force-pushed the except-handler-2 branch from ccd271b to aac8753 Compare October 5, 2024 17:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[red-knot] Add control flow for `try`/`except` blocks (v2) #13633

[red-knot] Add control flow for `try`/`except` blocks (v2) #13633

AlexWaygood commented Oct 4, 2024 •

edited

Loading

AlexWaygood commented Oct 4, 2024

github-actions bot commented Oct 4, 2024 •

edited

Loading

carljm left a comment

carljm Oct 5, 2024 •

edited

Loading

AlexWaygood Oct 5, 2024 •

edited

Loading

carljm Oct 5, 2024

AlexWaygood Oct 5, 2024

carljm Oct 5, 2024

AlexWaygood Oct 5, 2024

MichaReiser commented Oct 5, 2024 •

edited

Loading

AlexWaygood commented Oct 5, 2024

[red-knot] Add control flow for try/except blocks (v2) #13633

Are you sure you want to change the base?

[red-knot] Add control flow for try/except blocks (v2) #13633

Conversation

AlexWaygood commented Oct 4, 2024 • edited Loading

Summary

Test Plan

AlexWaygood commented Oct 4, 2024

github-actions bot commented Oct 4, 2024 • edited Loading

ruff-ecosystem results

Linter (stable)

Linter (preview)

carljm left a comment

Choose a reason for hiding this comment

carljm Oct 5, 2024 • edited Loading

Choose a reason for hiding this comment

AlexWaygood Oct 5, 2024 • edited Loading

Choose a reason for hiding this comment

carljm Oct 5, 2024

Choose a reason for hiding this comment

AlexWaygood Oct 5, 2024

Choose a reason for hiding this comment

carljm Oct 5, 2024

Choose a reason for hiding this comment

AlexWaygood Oct 5, 2024

Choose a reason for hiding this comment

MichaReiser commented Oct 5, 2024 • edited Loading

AlexWaygood commented Oct 5, 2024

[red-knot] Add control flow for `try`/`except` blocks (v2) #13633

[red-knot] Add control flow for `try`/`except` blocks (v2) #13633

AlexWaygood commented Oct 4, 2024 •

edited

Loading

github-actions bot commented Oct 4, 2024 •

edited

Loading

`ruff-ecosystem` results

carljm Oct 5, 2024 •

edited

Loading

AlexWaygood Oct 5, 2024 •

edited

Loading

MichaReiser commented Oct 5, 2024 •

edited

Loading