Add defaults to str accessor methods #1305

hamdanal · 2025-08-03T08:34:13Z

Work on Do we want to add trivial defaults to the stubs #1292
Tests added: Please use assert_type() to assert the type of any return value

Added simple default values to the str accessor.

I made a few more changes for things I noticed while adding default values:

Added a dtype parameter to str.decode
Fixed the type of str.get key, its documented to accept hashable objects not just int
Made the table of str.translate covariant as it does not modify the input object
Marked str.wrap parameters as keyword-only (at runtime they are passed through **kwargs) and fixed their type
Simplified some redundant overloads and added missing tests for some edge cases

loicdiridollou · 2025-08-03T13:50:48Z

tests/test_string_accessors.py

    idx_list = pd.Index([["apple", "banana"], ["cherry", "date"], ["one", "eggplant"]])
    _check(assert_type(idx_list.str.join("-"), "pd.Index[str]"))

+    # wrap doesn't accept positional arguments other than width
+    with pytest.raises(TypeError):
+        idx.str.wrap(80, False)  # type: ignore[misc] # pyright: ignore[reportCallIssue]


I would usually recommend you use TYPE_CHECKING_INVALID_USAGE instead of wrapping it in a pytest context manager: the goal is to test the type checking, not the error raised (same for the other places).

Sure happy to change it but isn't pytest.raises better because it tests the thing we are claiming to raise actually raises?

It depends what you want to test, the type checker has nothing to do with pytest, what we want is to make sure that pyright and mypy complain thus you put # type: ignore[misc] # pyright: ignore[reportCallIssue].
The fact that it raises a TypeError is something that pandas controls, not the stubs. That is why we wrap it in TYPE_CHECKING_INVALID_USAGE so it is not run when testing but we only check how mypy and pyright behave.

One other note is that if one day pandas decides to change the TypeError into for example a RuntimeError, you would have to change the test but fundamentally the behavior of the types has not changed. That is why it is often easier for future maintenance to wrap it with TYPE_CHECKING_INVALID_USAGE instead of a pytest.raises, maybe @Dr-Irv can add some details if I missed something.

It depends what you want to test, the type checker has nothing to do with pytest, what we want is to make sure that pyright and mypy complain thus you put # type: ignore[misc] # pyright: ignore[reportCallIssue].

I am well aware of that; that's why I added the ignore comments in the test.

The fact that it raises a TypeError is something that pandas controls, not the stubs. That is why we wrap it in TYPE_CHECKING_INVALID_USAGE so it is not run when testing but we only check how mypy and pyright behave.

Sure but what we actually want is the type checkers to warn on something that is not supported at runtime -- that is why this project also runs pytest, to test the runtime types as well as the static types otherwise you'll get type errors on something that is totally valid.

One other note is that if one day pandas decides to change the TypeError into for example a RuntimeError, you would have to change the test but fundamentally the behavior of the types has not changed. That is why it is often easier for future maintenance to wrap it with TYPE_CHECKING_INVALID_USAGE instead of a pytest.raises

If this is your concern, I think using pytest.raises(Exception) is the best balance, generic enough so that it doesn't depend on what error might pandas use but still check that what the test claims should raise at runtime raises.
For example, consider that in the future pandas starts allowing positional arguments here, with pytest.raises you'll immediately get a failing test so you know you should fix your stub definition while with your suggestion you get no such clue and the stub become out-of-date with the runtime.

This is all to explain why pytest.raises is better here imo but I don't feel strongly about it so I'll change it to use TYPE_CHECKING_INVALID_USAGE (which I didn't know about so thanks for the hint) similar to the other parts of the project.

Add defaults to str accessor methods

66c2292

loicdiridollou reviewed Aug 3, 2025

View reviewed changes

do not use pytest.raise

1ae4b9e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add defaults to str accessor methods #1305

Add defaults to str accessor methods #1305

hamdanal commented Aug 3, 2025 •

edited

Loading

loicdiridollou Aug 3, 2025

hamdanal Aug 3, 2025

loicdiridollou Aug 3, 2025

loicdiridollou Aug 3, 2025

hamdanal Aug 3, 2025 •

edited

Loading

Uh oh!

Add defaults to str accessor methods #1305

Are you sure you want to change the base?

Add defaults to str accessor methods #1305

Conversation

hamdanal commented Aug 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

loicdiridollou Aug 3, 2025

Choose a reason for hiding this comment

hamdanal Aug 3, 2025

Choose a reason for hiding this comment

loicdiridollou Aug 3, 2025

Choose a reason for hiding this comment

loicdiridollou Aug 3, 2025

Choose a reason for hiding this comment

hamdanal Aug 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

hamdanal commented Aug 3, 2025 •

edited

Loading

hamdanal Aug 3, 2025 •

edited

Loading