Python Debugging Workshop

🐧📕🙀

In 1944 a children’s book club sent a volume about penguins to a 10-year-old girl, enclosing a card seeking her opinion. She wrote, “This book gives me more information about penguins than I care to have.”

American diplomat Hugh Gibson called it the finest piece of literary criticism he had ever read.

Transcript

Introduction

Hello everyone! My name is Tim and today I will talk about Python debugging.

First of all, how many of you systematically use a debugger in your daily workflow?

<½ Awesome! It means that a lot of you will leave with a new tool under the belt.
>½ Nice! Then you already know quite a few things that I tell, and it will be a refresher, with a few tips and tricks on top.

Some developers say they don't need a debugger in a scripting language, since they can just look at the source. On one hand it's true and there is a saying

"If you need a debugger, the error had happened much earlier."

Meaning that someone had complicated the logic so much that reading the source code does not help.

On the other hand, however, a complex application, is like a transit system. Certainly, you have your "source code": the transit map and schedules. But would you bet your lunch money on even approximate location of any given train?

That is exactly the problem, that the debuggers address. They allow us to look inside the black box and examine the runtime state of the code.

While preparing this workshop I have looked at a number of debugging tutorials and they all follow the same structure.

They...

warn against using print statements
recap Pdb documentation
show examples of using debugger commands

I tried to make this workshop a bit differently: on top of the three points above we will look at different Python debuggers and ways to invoke them depending on the context: whether you develop locally or run code in a remote shell. I will also share pragmatic tips from my day-to-day.

Print Considered Harmful

Before we start, I am legally required to talk about print-debugging and what's wrong with it.

First of all, print is limitted in what it can do and is often used simply to check whether the code runs at all. Using print for that is an overkill, there is a better way. Python has a 3-character Python built-in for that.

`1/0`

Also known as the Redneck Breakpoint

Unlike the timid print it does not get lost in the console output, it crashes your program loudly and proudly. If you need more than one, use 2/0, 3/0, etc.

Secondly, one print is never enough, once you pop, you cannot stop. You put one, it doesn't work, then you put another one, and yet another one. And in order to see the changes you need to restart the application every time, which may be quite slow, for example in Docker. We want a short feedback loop to test our bug-origin theories as fast as we can without breaking the flow of thought.

The third argument against prints is that they accidentally get into production code. If you don't believe me, try this 3 commands:

# in the current codebase
git grep "print("

# in commit messages
git log --grep print

# across all history
git log -S 'print('

By the way, "it will get to production" applies not only to print statements, but to anything that touches your code but should not go in production. I call this extension of the Murphy's law "The Curse of The Bearded Crab". I used to work for an entertainment events aggregator, and we put silly fake events in our staging environment. And guess what, one day, a misconfigured import, uploaded "the Concert of The Bearded Crab" to the main page. So, think of the Bearded Crab as an evil deity who preys on your lost focus.

Finally, a debugger allows to explore not only your application code, but to also step inside 3rd party dependencies. It is a good way to explore how other people's code works especially in heavy-duty frameworks such as Django, celery, or sklearn.

So, the next time your fingers type print, please stop and put a debugger breakpoint instead.

Debugging as Text Adventure

Ok, enough with print-shaming.

How many people in this room have played text adventure games such as MUDs, Roguelikes, or Interactive Fiction?

Then you will find console-debugging similar to playing such a game:

They are both...

controlled with just a few commands:
- "north, south, open, examine" in games
- and "next, step, break, continue" in debugger
the commands can be abbreviated to single letters:
- In interactive fiction it is common to have n for (n)orth, o for (o)pen, and x for e(x)amine
- Python debugger has the same: n for (n)ext, s for (s)tep, b for (b)reak, and c for (c)continue
Both can be harsh:
- In Nethack, the most known roguelike, when you die you cannot load game, you lose all progress, but can identify your possessions at the moment of death.
- When an unhandled exception hits you in a debugger, you cannot undo, cannot continue, but enter a post-mortem mode to look at the program state at the moment of exception.

Funnily enough, when we look at our codebases, the similarity with dungeons becomes only stronger. See for yourself: we build them over years, layer upon layer, with parts of them getting abandoned, here and there one finds scarce outdated comments from people who are no longer around. The process of recovering the lost knowledge is akin to archeology, but instead of a pickaxe you dig up history with Git commands. Git-archeology among other topics is featured in my other workshop. </shameless_plug>

A typical code-dungeon looks like this:

a single point of entry
each function is a corridor
the more lines in a function the longer the corridor
calling a function within a function takes you one level deeper down the call stack
returning from a function takes you one level up, to the line where you entered
luckily there are no GOTO statements in Python, so each function has a single point of entry,
though, it may have multiple return points, for example a condition check may return the result earlier. And even if you don't return explicitly, Python returns an implicit None.

On this diagram the numbers are relative to each function. In real code all functions may be defined in the same file and their starting line number can be anywhere, but lines in a function are always consecutive.

For your entertainment I have prepared a small game "The Quest for Golden Python". Each level in it focuses on a different aspect of using a debugger.

Installation

Does everyone have Python 3.7+ installed? It's not critical as long as you have Python3.

In order to follow along please clone this repository from

https://github.com/tipishev/python_debugging_workshop
or here is a short link https://git.io/JeEhw

wait till everyone clones the repo

examine the requirements before installing

git clone git@github.com:tipishev/python_debugging_workshop.git
cd python_debugging_workshop
virtualenv -p python3.7 venv
. venv/bin/activate
pip install -r requirements.pip
cd dungeon

give 3-5 minutes to setup

Before we start, let's edit play.py. Here we create an instance of Player with a name and an empty starting inventory. As you can see, the Player class itself is very simple.

All locations in the game are functions to which we pass the player instance and hopefully they return player back. The game starts with enterng main_corridor from which all the other corridors branch. The goal of the game is to obtain the Golden Python, learn debugging, and possibly save the world, I don't know.

Quest 0: Scare the Rat

So, if everyone is ready, let's run the game.

python play.py

We immediately see an error, that happened in the main_corridor. The player was eaten by a rat. Judging by the error message we need to have at least something in our inventory. Let's take a big broomstick and make sure it's a big one, to scare the rat away. And run the the game again.

./play.py

Walking

Having a broom helped, but now we hit a different error and can use a debugger. There are a few ways to start a debugger, depending on how much control you have over the source code. Here we are in full control, so let's put a hadcoded breakpoint, right before entering the main corridor.

type import pdb; pdb.set_trace() right above player = main_corridor(player)

You don't have to worry about ImportError: pdb is part of the standard library.

./play.py

Debuggers don't remove bugs. They only show them in slow motion. – Unknown

Now, when we run play.py we are greeted by the Pdb prompt. Pdb becomes our Dungeon Master, stops the program on the line after the breakpoint and politely asks us what to do next.

First of all, we can just quit the debugger with q.

do that

..or tell it to continue happily running towards the error with c.

do that

Or we can look around us, with l, which stands not for look as in text adventures but for list.

The real fun begins when we start navigating our code dungeon.

There are 2 types of movement:

horizontal – within a single function or frame
vertical – up and down the call stack

Let's look at moving within a single function:

The first command is called next, it simply takes us to the next line. Here and in following examples, blue shows the position before the command, the orange – the position after. So, typing next on line 42 takes us to line 43.

If we are on a line with a function call, we have 2 choices.

We can choose next and the nested function executes behind the scenes and we continue to line 45. Think of next as "stay local and avoid any foreign functions".

Or we can type step and go vertically one level down to line 17, inside the nested function and continue horizontal movement there.

We can also fast travel with until {line_number} instead of typing next or n. This command is useful for skipping loops. If lines 18 to 20 contained a loop, both next or step would make us go through each iteration.

If a return happens before the {line_number}, debugger stops there. For example, if I accidentally typed until 210, the debugger would stop on line 23 and wait for further commands.

If we specifically want to stop before returning to the calling function , we can type return or r.

It gets useful when we

lose interest in the function and just want to see what it returns
get stuck in a loop
or step in the nested function by accident, instead of typing next

Let's try these commands in the debugger.

But before we do that, let's get a lantern. By default Pdb shows just the current line, so we may have to use l a lot. Instead, we can use an iPdb, an iPython-wrapper around Pdb. For that we just add a couple of is to our hardcoded breakpoint.

Notice the improvements:

we now have color vision with syntax highlight
we see 3 lines of context instead of just one

Now let's step in the walking_corridor and get our first amulet.

go through walking corridor: warn about !, c, n, etc. serves right, don't use one-letter variables, list, args are more treacherous

Let's add the 'amulet of walking' to our inventory, so that we don't go there again.

Stacking

Since you see how the game is structured we can move set_trace() directly in the main corridor. And while we are at it, let's upgrade our lantern with context=5

Let's go to the next level and see how to navigate vertically in the call stack.

go through stacking corridor; mention (a)rgs and how they are local to each frame where shows who started the whole circus

Now, let's add the "amulet of stacking" to our evergrowing collection and and see what fails next.

How many of you use Python version less that 3.7?

raise hand, too

But when we switch to Python 3.7, we can start using a builtin keyword breakpoint().

It was introduced in PEP 553 because..

The linter complains about multiple statements on one line
It's short and easy to remember
Even JavaScript has a debugger statement

Let's put it by the entrance to the corridor of looking.

It defaults to pdb.set_trace demonstrate To configure it use export PYTHONBREAKPOINT=...

"" (unset) for default Pdb
0 for ignoring breakpoints, can be the last line of defence in Production environment.
dotted path to a debugger callable
- ipdb.set_trace for iPdb
- pudb.set_trace for Pudb (we'll look at it later)

breakpoint passes args and kwargs to the handler, so we can put breakpoint(context=7) will make iPdb show 7 lines of context.

demonstrate

To summarize showing context, here is an illustration of context in Pdb and iPdb.

If we use the dungeon metaphor, context is how much of surroundings you can see.

Let's switch back to Pdb by unsetting PYTHONBREAKPOINT and starting the script.

export PYTHONBREAKPOINT=""
./play.py

By defaul, PDB shows just the current line, so often we would type l after n or s.

We can turn this into a one liner with double-semicolon:

n;;l
s;;l

We can also set aliases:

alias nl n;;l
alias sl s;;l

Finally, for maximum convenience, save the aliases to .pdbrc to load them at debugger start.

iPdb gives us both color vision and more context, especially when you setu it up explicitly with context keyword.

Pragmatically speaking, same way I don't use the default Python shell, I don't use Pdb if something better is available, although, Pdb has a killer feature: it is always installed. So, if you ssh to a server for the first time, you can run Pdb with no additional setup.

Looking

So, let's go through looking level.

do the looking level

So far, we've been dropping the hardcoded breakpoints. This is fine most of the time, but still gives the Bearded Crab a chance to sneak your breakpoints in the release. The ninja-way is to use the debugger without changing the source code. Here is one way to do it.

python -m ipdb play.py

This way we start from the very first line and keep going. We can avoid this journey by passing a breakpoint.

python -m ipdb -c "b levels/examination.py:71" play.py

Examination

do the examination level

Running in Shell

You may also want to use a debugger inside an interactive shell, like django shell or iPython. When I just started using a debugger I write a wrapper function around the code I wanted to debug like this:

def run():
    import ipdb; ipdb.set_trace()
    jumping_corridor(player)

But later I found that Pdb has 3 functions for exactly that:

run
runeval
runcall

The last one I find the most usefull. Pass into it a callable, and it's arguments and the the call will happen under debugger's control. Let's try that.

Jumping

from play import main_corridor, player
ipdb.runcall(main_corridor, player)

do the level

In real world jumping helps to skip
- network calls
- expensive computation before debug-section

combined e.g. records = decode_xml(requests.get('http://example.com/huge.xml'))

go back if you forgot to check something

Postmortem

While we talk about running code inside a shell, there's a super-useful debugger feature "postmortem" abbreviated to pm(). It finds the last exception that happened iin the shell and allows to explore the state right before it happened.

Has anyone seen a 2011 movie Source Code? It's a movie about a guy who could live through the last 8 minutes of a person in a train accident, but could not change anything. Postmortem is a bit like this. You cannot follow the control, just observe the state at the moment of crash.

"I wish I was there when it happened!"
multiple ways to start
- python -m pdb|ipdb|pudb play.py arg1 arg2 and crash
- import *db; *db.pm() in interactive shell. Will use sys.last_traceback for examination
- %debug in iPython
when a long-running script crashes in your shell, type %debug and walk up the stack and see what happened. It may give insights on what was the error about. Not so long ago, I was fetching events from our partner and ran postmortem on a failed script, it showed me the exact URL on which the connection was reset, so I could continue from that exact spot.

Breaking

Breakpoints help to avoid stepping through heavily-decorated functions, like celery-tasks
breakpoints allow a test-journey:
- put breakpoint, check, put next breakpoint continue, all within a single run
one-time breakpoints
watching variables with post-run commands
break
- filename:lineno | function
- condition
condition may be useful when a bug happens only on certain conditions, like looping over a bunch of orders, and only orders from a single country process incorrectly.
it's better to avoid modifying the code with breakpoint/set_trace, less chance of committing to production
python -m ipdb -c "b levels/main.py:13" -c "b levels/main.py:14" play.py

Sage Search

as soon as in the loop on line 70

person.speak()
l 46,47
player.translate(person.speak())
continue
try again..

ipdb> commands 6
(com) print(player.translate(person.speak()))
(com) end



display person.__class__.__name__
# c a few times
# undisplay

commands 7
print('person.__class__.__name__)
end

Aliases and .pdbrc

pdb config file .pdbrc or ~/.pdbrc, local overrides global, as in git or laws
mostly useful for aliases
alias can take all arguments with %*
import pprint; pprint(self.__dict__) vs pp vars(self)
even if you cannot write to .pdbrc, keep your local config up-to-date and paste the aliases file in remote Pdb-shells, after all it's just a list of legal debugger commands.

PUDB

modern, "v.2019"
works best on a wide screen
5 splits:
- code
- shell
- variables (and watches)
- stack
- breakpoints

Source Code

vi-keys
breakpoint with b
breakpoints are preserved between runs
H to go back to the bottom frame, executing function
m to open modules
ctrl-p to open preferences

Shell

clear button

Variables

n to watch an expression

Stack

stack, not much to say

Breakpoints

edit breakpoint

Exercise

check your terminal size

tput lines
# e.g. 174

tput lines
# e.g. 51

In the code:

from sklearn.metrics import r2_score

y_true = [3, -0.5, 2, 7]
y_pred = [2.5, 0.0, 2, 8]

from pudb.remote import set_trace
set_trace(term_size=(174, 51))

r2_score(y_true, y_pred)

No mouse, only arrows, shortcuts, and vi-keys.

Old commands from PDB work:

(n)ext
(s)tep
(c)ontinue
(d)own
(u)p

Ctrl-l to redraw the screen.

Side panels: vi-keys work, as well as PgUp/PgDn, Home/End.

Resizing:

+ / -
= / _
[ / ]

Debuggers Genealogy

Bdb

the core of all python debuggers
defines Breakpoint class (file, line)
trace_dispatch(frame, event, arg)
- line
- call
- return
- exception
- c_call
- c_return
- c_exception
breakpoints manipulation:
- set_break
- clear_break
- clear_bpbynumber
- clear_all_file_breaks
- clear_all_breaks
- get_{bpbynumber, break, breaks, file_breaks}

CMD aside

part of the standard library
creates a simple command-line interace (CLI) interpreter
client code spectrum:
- simple script
- command with arguments
- Command line environment
- Graphical wrapper?
documentation page has a cute example TurtleShell
Cmd2 has more features, common pattern with standard implementations

Debuggers comparisson

Talk in spectrum Availability -> Features
PDB vs iPDB vs PDB++ vs PUDB vs IDE

Pdb

available everywhere
simple

very basic, rough on the edges shows bdb.Quit
no color

iPdb

can start as %debug from iPython or debug function(args, **kwargs)
tab-completion
colors

requires iPython
no extended functionality as in Pdb++

Pudb:

Graphical
Shows the whole source
Shows source, vars, stack, breakpoints, console on the same screen

∓ uses Vi-style navigation
works better on bigger screens
no jumps inducer/pudb#129

watch-statements

Pdb++ aka Pdbpp

hijacks Pdb name
avoids one-letter trap by preferring context variables to debugger commands, can override with !!command
Disable pdb.set_trace(): any subsequent call to it will be ignored
@pdb.break_on_setattr(attrname, condition=always)
highly configurable

IDE/Visual debuggers

better display of variables and source
cannot run in terminal
extra setup for remote or container debugging
some cost money
~~loss of oldschool-cred~~

Strace

While we talk about looking inside the program runtime with debuggers, there is a Linux utility called strace if you have a long-running script but no idea what it is doing, launch

# for longer lines
sudo strace -s 65535 -v -p {PID}

And it will show what kernel read/write operations are happening, it can leak the URLs the script accesses, the strings it reads and writes.

Preventing Bugs

If you want more effective programmers, you will discover that they should not waste their time debugging, they should not introduce the bugs to start with. – Edsger W. Dijkstra

Broken windows theory

flake8
autopep8
isort

Conclusion

Now you know what to do when you encounter a bug
- don't shy away from shell debuggers, it's like a text adventure after all
- insert breakpoint instead of print
- if it's live use runcall or set_trace in a wrapper
- have some idea of what popular debuggers are out there
Call to action
- install ipdb, pudb, and pdbpp in all environments where you are allowed to run a Python shell
- add useful aliases to a .pdbrc file, to not type pp vars(self) all the time
- the next time you time your code breaks, put import ipdb; ipdb.set_tace(context=10)
- if you get an exception in a shell run *db.pm()