new lint: `chars_enumerate_for_byte_indices` #13435

y21 · 2024-09-21T20:42:33Z

This adds a new lint that checks for uses of the .chars().enumerate() position in a context where a byte index is required and suggests changing it to use .char_indices() instead.

I'm planning to extend this lint to also detect uses of the position in iterator chains, e.g. s.chars().enumerate().for_each(|(i, _)| s.split_at(i));, but that's for another time

changelog: new lint: chars_enumerate_for_byte_indices

rustbot · 2024-09-21T20:42:38Z

r? @Centri3

rustbot has assigned @Centri3.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

y21

Some notes for reviewers

y21 · 2024-09-21T20:46:11Z

clippy_lints/src/loops/chars_enumerate_for_byte_indices.rs

+        && cx.typeck_results().expr_ty_adjusted(recv).peel_refs().is_str()
+        && chars_segment.ident.name.as_str() == "chars"


Latest nightly has str::chars as a diagnostic item but our pinned nightly isn't there yet. Might be able to use that if the next sync happens soon enough

y21 · 2024-09-21T20:47:27Z

tests/ui/chars_enumerate_for_byte_indices.rs

+        // can't use #[expect] here because the .fixed file will still have the attribute and create an
+        // unfulfilled expectation, but make sure lint level attributes work on the use expression:
+        #[allow(clippy::chars_enumerate_for_byte_indices)]
+        let _ = prim[..idx];


this is a fun one, I wonder if that's something that could be fixed in uitest, like removing #[expect] attributes in the .fixed file 🤔

y21 · 2024-09-21T20:49:47Z

clippy_lints/src/loops/mod.rs

+    /// ```
+    #[clippy::version = "1.83.0"]
+    pub CHARS_ENUMERATE_FOR_BYTE_INDICES,
+    correctness,


The pattern is technically fine if you know what your strings are (like the description mentions) so it's not always 'outright wrong' like the usual correctness lints, but the fix is also really simple and always applicable so 🤷‍♂️

Boshen · 2024-09-26T08:45:55Z

Thank you for working on this.

The linter we are working on (oxlint) has encountered dozens of crashes because of this, and there were no ways to forbid such usages.

new lint: chars_enumerate_for_byte_indices

97c4bfb

rustbot assigned Centri3 Sep 21, 2024

rustbot added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties label Sep 21, 2024

y21 commented Sep 21, 2024

View reviewed changes

Boshen mentioned this pull request Sep 26, 2024

Ban index methods on std::str::Chars oxc-project/oxc#6071

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new lint: `chars_enumerate_for_byte_indices` #13435

new lint: `chars_enumerate_for_byte_indices` #13435

y21 commented Sep 21, 2024

rustbot commented Sep 21, 2024

y21 left a comment

y21 Sep 21, 2024

y21 Sep 21, 2024

y21 Sep 21, 2024 •

edited

Loading

Boshen commented Sep 26, 2024

		&& cx.typeck_results().expr_ty_adjusted(recv).peel_refs().is_str()
		&& chars_segment.ident.name.as_str() == "chars"

new lint: chars_enumerate_for_byte_indices #13435

Are you sure you want to change the base?

new lint: chars_enumerate_for_byte_indices #13435

Conversation

y21 commented Sep 21, 2024

rustbot commented Sep 21, 2024

y21 left a comment

Choose a reason for hiding this comment

y21 Sep 21, 2024

Choose a reason for hiding this comment

y21 Sep 21, 2024

Choose a reason for hiding this comment

y21 Sep 21, 2024 • edited Loading

Choose a reason for hiding this comment

Boshen commented Sep 26, 2024

new lint: `chars_enumerate_for_byte_indices` #13435

new lint: `chars_enumerate_for_byte_indices` #13435

y21 Sep 21, 2024 •

edited

Loading