Top 10 String methods in Pandas
In simple terms, string methods in Pandas are a set of tools that help us manipulate and work with text (also known as strings) in our data. Pandas, which is a powerful Python library for data manipulation, provides a variety of built-in tools to make that job easier. Instead of manually going through each piece of text and making changes, these string methods allow us to do things like:
- Convert all text to lowercase or uppercase
- Remove extra spaces or unwanted characters
- Extract parts of text based on patterns
- Find and replace certain words or characters
String Methods in Pandas
These string methods work in a very efficient way on entire columns of data, so you can modify thousands or even millions of text entries at once without breaking a sweat.
Method | Description |
---|---|
upper() | Converts a string into uppercase |
lower() | Converts a string into lowercase |
isupper() | Checks whether the character is uppercase or not |
islower() | Checks whether the character is lowercase or not |
len() | Identifies the length of the string. |
startswith() | Returns true if the element starts with the pattern |
split() | Splits the string at a particular index or character |
find() | Returns the index at where the given string is found |
strip() | Strips whitespaces from each string from both sides. |
replace() | Replaces a part of the string with another one. |
Let's explore each method in detail with example:
We will be using the below data frame for the purpose of the illustration.
import pandas as pd
sports = pd.Series(['Virat', 'azam', 'fiNch', 'ShakiB', 'STOKES', 'KAne'])
print(sports)
Output:

1.) DataFrame.upper()
Convert each string to upper case. This method is useful when normalizing text data for consistency (e.g., converting names or categories to uppercase).
print("Upper Case:")
print(sports.str.upper())
Output:

2.) DataFrame.lower()
It converts all characters to lowercase and ensure consistency in text data.
print("Lower Case:")
print(s.str.lower())
Output:

3.) DataFrame.isupper()
It returns boolean values based on whether each character present in the string is in upper case or not.
print("Checks whether string is in Upper Case:")
print(sports.str.isupper())
Output:

4.) DataFrame.islower()
It returns boolean values based on whether each character present in the string is in lowercase or not.
print("Checks whether string is in Lower Case:")
print(s.str.islower())
Output:

5.) DataFrame.len()
This function returns the length of each string.
print("Length of strings:")
print(sports.str.len())
Output:

6.) DataFrame.startswith()
It returns boolean values based on whether the string starts with a certain character sequence or not.
print("Checks whether string starts with certain substring:")
print(sports.str.startswith('a'))
Output:

7.) DataFrame.split()
This function helps to split the string by a certain character or symbols at once.
print("Splits string by character 'a':")
print(sports.str.split('a'))
Output:

8.) DataFrame.find()
This function finds the index of the occurrence of a certain character sequence.
print("Find the index of the searched character or substring:")
print(sports.str.find('a'))
Output:

9.) DataFrame.strip()
It helps to remove the extra trailing spaces from the start and the end.
print("Remove extra space from the starting and the end of the string:")
print(s.str.strip())
Output:

10.) DataFrame.replace()
This function helps to remove certain character sequence sometimes which are present in all the strings and is undesired.
print("Replace a particular substring by desired pattern:")
print(sports.str.replace('a', ''))
Output:
