/debugging-stories

A collection of debugging stories. PRs welcome (sorry for the backlog) :-)

Debugging stories are fun! This is a collection of links to various debugging stories.

Atop segfaults

Breakpoint takes 15ms

Bug that hides from breakpoints

C64 Variable Screen Position crash

Chrome development branch crashes with x87 FPU stack overflow

Crash Bandicoot memory card corruption

Crashes only on Wednesdays

Custom Rust OS crashes

Debian package data corruption, but only on sparc64

Debugging Insights (collection of stories on separate pages, submitted to Dave Agans, the author of "Debugging: The 9 Indispensable Rules for Finding Even the Most Elusive Software and Hardware Problems")

Digital divider with wrong output

Email only goes 500 miles

Impossible random number collision happened in practice

Incorrect sign masking operation

iOS file corruption

FIRST robotics dashboard segfaults due to finalizer

FriendNet / WoW message corruption

FPGA / CPU timing bug

Gdb set follow-fork-mode child doesn't work

Grep segfaults

GWT test passes during normal runs but fails during coverage runs

Java 8 webapp performance problems

JVM has 6-7 second GC pauses

Login only works when sitting down (and other stories). From Jon Bentley's Programming Pearls, section 5.10

Mandrill stops sending messages

Matrix transformation is 10x faster in a test harness than in the game

Memory-corruption / threading bug

MS Money crashes. Includes reverse engineered binary patch fix!

Multicast to 237.0.0.1 acts craters performance

Network link instability due to DB caching policies

Node.js memory leak

OpenOffice + Ubuntu|2009-04-28 - "My wife has complained that OpenOffice will never print on Tuesdays" (bug is actually with the file utility

OS crashes on boot if machine was used recently

PDP-11 crashes only when live cattle are being shipped from northern Ukraine and western Russia

Perl can't read from TMPDIR? (separate analysis of the same issue here)

Polish S doesn't appear on medium

Python hashes files differently on different days

Python/Flask/S3 absurd memory usage

Register allocation algorithm in JIT uses extra register

[SIGALRM timer hangs] (https://nativeguru.wordpress.com/2015/02/19/why-you-should-avoid-using-sigalrm-for-timer)

Signal fails only when returning from page fault handler

Signal handling on 286 emulator for Xenix

Slow logons

SSH sometimes fails from cron

Stack corruptionw when using select()

Switch crashes machine despite not being connected

Tmux/vim strangeness

uint32_t gets truncated to 16 bits

Windows crash dump of nonsensical instruction

Contributors

  • Alexandros Tasos
  • Arthur Axel fREW Schmidt
  • André Luiz
  • Bill Smith
  • bussiere
  • Dan Luu
  • Dan Nguyen
  • Dimitry (unknownexception)
  • Florent Viel
  • Howard Mao
  • Jani Patokallio
  • Jessie Weinstein
  • José Manuel Díez
  • Jouke Waleson
  • Jyrki Vesterinen
  • Kartik Agaram
  • Kevin D. Clark
  • Kirtan Gajjar
  • Louis Vela
  • Matteo Castellini
  • Mark Jason Dominus
  • michaelx386
  • Nan Xiao
  • Nathan Isom
  • Nikola Brežnjak
  • Rafik Draoui
  • Ramon Fried
  • Sean Cassidy
  • James Porter

TODOs:

Improve formatting.

Write up some interesting bugs: 40 hour hang, hardware bug caused by copying behavior of Intel engineering sample found on ebay, etc.