Issue in the #100DaysOfCode talkpython course Days 28-30 regex

Question

Issue in the #100DaysOfCode talkpython course Days 28-30 regex

Closed this issue 7 years ago · 2 comments

Hi guys
In the recording for 'findall' section, Bob mentions in one of the exercises, '\w+' as one or more characters. Shouldn't that be one or more 'words'?
Recording time: 1:17

Cheers

Answer 1 · 2018-05-18T14:31:25.000Z

You got me doubting for a sec there, but it is indeed a character class:

https://docs.python.org/3.8/library/re.html

\w
For Unicode (str) patterns:
Matches Unicode word characters; this includes most characters that can be part of a word in any language, as well as numbers and the underscore. If the ASCII flag is used, only [a-zA-Z0-9_] is matched.

Or letting the code speak:

import re
re.match(r'\w', 'a')
<_sre.SRE_Match object; span=(0, 1), match='a’>

Thanks for keeping a critical eye :)

Answer 2 · 2018-05-19T10:18:58.000Z

Thank for the clarification Bob.

Cheers