7.9. Checking Strings

It is often helpful to examine a character and test whether it is uppercase, lowercase, or whether it is a character or a digit.

We can use bracket notation and string methods inside a boolean expression to check if a string meets a certain condition.

7.9.1. Checking Type

The string methods only work on string objects. Let’s say my_var = 'Python'.

my_var.upper() works just fine. However, if we were to reassign a number to the variable, like my_var = 3.14159, then my_var.upper() throws an error.

Let’s see how we can use the type() function to check for a string.

Example

Run the program with each of the following values for my_var:

  1. '1234' vs. 1234

  2. True or False vs. "True" or "False"

  3. 0.333 vs. '0.333'

Follow Up:

  1. Add an elif statement and code block to check for the int data type.

  2. Repeat this process for the float and bool data types.

  3. Rerun the program with the different my_var values.

We can easily modify line 3 to check if my_var is an int, float, or bool data type. We could also use an if/elif/else block to check for the different data types. (TRY IT!)

Note

Of course, we could avoid the conditional block completely with:

print("The value {0} is a {1} data type.".format(my_var, type(my_var))

7.9.2. Comparing Strings

We can use the comparison operators ==, !=, <, >, <=, >= on strings.

Example

1
2
3
4
5
6
7
my_pet = 'dog'
your_pet = 'cat'
other_pet = 'crow'

print(my_pet == your_pet)
print(your_pet[0] == other_pet[0])
print(my_pet < your_pet)

Console Output

False
True
False
  1. Line 5 evaluates if the strings 'dog' and 'cat' are identical and then prints the boolean result (False).

  2. Line 6 evaluates if the first character in 'cat' and 'crow' are the same (True).

    • Note that the first character comparison for 'cat' and 'Crow' would return False due to the different cases.

    • To make the expression case-insensitive, we need to convert each character to the same case (e.g. your_pet[0].lower() == other_pet[0].lower()).

  3. Line 7 evaluates if 'dog' is less than 'cat'. This comparison is a bit more complicated. What is really being compared is each character’s Unicode value. All characters, even letters, have Unicode numbers. The Unicode value of a letter corresponds with its alphabetical order. So a string that comes earlier in the alphabet is considered less than a string that comes later. Since 'dog' follows 'cat' alphabetically, the expression 'dog' < 'cat' evaluates to False.

Note

Case matters when comparing characters! CAPITAL letters have smaller Unicode values than lowercase letters.

'Zebra' < 'zebra' is True,

'Zebra' < 'apple' is True, and

'zebra' < 'apple' is False.

7.9.2.1. Checking with in and not in

If we want to find out if a certain character is in a string, we could iterate through the string and compare each character to the one we want.

1
2
3
4
5
6
title = 'The Hunger Games'
search_character = 'e'

for char in title:
   if char == search_character:
      print("'{0}' is in '{1}'.".format(search_character, title))

However, this is inefficient, since the loop continues even after we find search_character. As coded, the program prints the output once each time search_character is found.

A better approach is to use the in operator (or its opposite, not in) to return the same information. The in operator tests if one string is a substring of another.

1
2
3
4
5
title = 'The Hunger Games'
search_character = 'e'

if search_character in title:
   print("'{0}' is in '{1}'.".format(search_character, title))

Try It!

This example uses the in operator to decide when to increase the value of vowel_count.

  1. The program does not quite work yet. There are 9 vowels in 'Armadillos or anteaters', but the code does not count the capital A.

  2. Fix the code to be case-insensitive. Both capital and lowercase vowels should increase vowel_count.

  3. Refactor the code to report the number of consonants (non-vowels) in the string. (Hint: Use the not in operator).

7.9.3. Checking Case

Let’s explore how we can check the case for a character, slice, or an entire string. Fortunately, Python provides methods that check the case of a string, and they deal with non-letter characters properly.

Example

1
2
3
4
5
6
7
character = 'a'
word = "yep!"
non_letters = '$10.75'

print(character.isupper())
print(word.islower())
print(non_letters.isupper())

Console Output

False
True
False

The isupper() method returns True if all the letters in a string are uppercase. If the string contains a single lowercase letter, or no letters at all, the method returns False. The islower() method behaves in a similar way, but it checks for lowercase letters.

7.9.4. Check Your Understanding

Evaluate whether the following expressions are True or False:

Question

"dog" < "doghouse"
  1. True
  2. False

Question

"dog" < "Dog"
  1. True
  2. False

Question

"dog" < "Doghouse"
  1. True
  2. False

Question

"app" in "Happy"
  1. True
  2. False

Question

For which of the following would text.upper() == text return True?

  1. text = 'Stop Yelling!'
  2. text = 'STOP YELLING!'
  3. text = 'stop yelling!'
  4. text = 'STOP YELLINg!'
  5. All return True
  6. None return True