import random
def fuzzer(max_length: int = 100, char_start: int = 32, char_range: int = 32) -> str:
"""A string of up to `max_length` characters
in the range [`char_start`, `char_start` + `char_range`)"""
= random.randrange(0, max_length + 1)
string_length = ""
out for i in range(0, string_length):
+= chr(random.randrange(char_start, char_start + char_range))
out return out
Fuzzing: Breaking Things with Random Inputs
Summary
This post offers our insights about the chapter called “Fuzzing: Breaking Things with Random Inputs” from the The Fuzzing Book! This chapter teachers us about the use of “fuzzers” or programs that automatically create random sections of numbers, letters, and symbols to create a random test. One of the most basic examples could be realized by creating a “fuzz generator”, which the chapter explains with the following Python function called fuzzer
:
A “fuzzer” can quickly test a command by providing it with random inputs. Suppose you had a function for writing to a file, scribe(data: str) -> None
, as shown in the following code segment. How would you know that a random sequence of bytes can be written without causing the function to crash? You wouldn’t if you did not implement a test case for the function! We can easily test this using the fuzzer()
from the The Fuzzing Book:
import os
import tempfile
def scribe(data: str) -> None:
= "file.txt"
name = tempfile.mkdtemp()
tempdir = os.path.join(tempdir, name)
FILE with open(FILE, "w") as f:
f.write(data)
# clean up the mess!
os.remove(FILE)
os.removedirs(tempdir)
= fuzzer()
input_data scribe(input_data)
After running this through a few hundred iterations, we would begin to feel more comfortable interfacing our scribe()
function to a public API. But how do we run this multiple times in an idiomatic way? Rather than creating thousands of lines of boilerplate source code in our test suite, we may opt to implement what is known as a Runner
.
The “Fuzzing: Breaking Things with Random Inputs” chapter discusses the concept of a Runner
. As shown in the following source code segment, the Runner()
is the component responsible for executing the target application with the generated input. It captures the program’s behavior, logs crashes, and identifies potential vulnerabilities. To learn more about this concept, consider the following example of a runner class, ProgramRunner
, which inherits from the class Runner
.
import subprocess
from typing import Any, List, Tuple, Union
class Runner:
"""Base class for testing inputs."""
# Test outcomes
= "PASS"
PASS = "FAIL"
FAIL = "UNRESOLVED"
UNRESOLVED
def __init__(self) -> None:
"""Initialize"""
pass
def run(self, inp: str) -> Any:
"""Run the runner with the given input"""
return (inp, Runner.UNRESOLVED)
class ProgramRunner(Runner):
"""Test a program with inputs."""
def __init__(self, program: Union[str, List[str]]) -> None:
"""Initialize.
`program` is a program spec as passed to `subprocess.run()`"""
self.program = program
def run_process(self, inp: str = "") -> subprocess.CompletedProcess:
"""Run the program with `inp` as input.
Return result of `subprocess.run()`."""
return subprocess.run(self.program,
input=inp,
=subprocess.PIPE,
stdout=subprocess.PIPE,
stderr=True)
universal_newlines
def run(self, inp: str = "") -> Tuple[subprocess.CompletedProcess, str]:
"""Run the program with `inp` as input.
Return test outcome based on result of `subprocess.run()`."""
= self.run_process(inp)
result
if result.returncode == 0:
= self.PASS
outcome elif result.returncode < 0:
= self.FAIL
outcome else:
= self.UNRESOLVED
outcome return (result, outcome)
As detailed in the code, the Runner
class has three outcomes: PASS
, FAIL
, UNRESOLVED
, and it has a method called run()
, which, because there is nothing given to run, produces the UNRESOLVED
outcome. We inherit ProgramRunner
from Runner
, which is capable of testing any generic program because it invokes the python module subprocess
, which is capable of calling processes through the features provided by the operating system.
Our ProgramRunner
exports two very important methods: run_process()
and run()
. run_process()
is rather raw, and it is a wrapper to subprocess.run()
, which runs a program by name and provides it with an input. run()
, however, neatly interprets the returncode
property of the process invoked by run_process()
. As with all UNIX programs, a nonzero returncode
indicates error. Still, this is not enough to give us a framework for testing functional bits of code. We will need another class that intelligently uses this ProgramRunner
!
The “Fuzzing: Breaking Things with Random Inputs” chapter instructs us to create a class specifically for the purpose of fuzzing. We’ll create a base class Fuzzer
and override it with a specific implementation that creates random strings within a range of lengths, RandomFuzzer
.
class Fuzzer:
def __init__(self) -> None:
"""Constructor"""
pass
def fuzz(self) -> str:
"""Return fuzz input"""
return ""
def run(self, runner: Runner = Runner()) \
-> Tuple[subprocess.CompletedProcess, str]:
return runner.run(self.fuzz())
def runs(self, runner: Runner = Runner(), trials: int = 10) \
-> List[Tuple[subprocess.CompletedProcess, str]]:
"""Run `runner` with fuzz input, `trials` times"""
return [self.run(runner) for i in range(trials)]
class RandomFuzzer(Fuzzer):
"""Produce random inputs"""
def __init__(
self,
int = 10,
min_length: int = 100,
max_length: int = 32,
char_start: int = 32
char_range: -> None:
) """Produce strings of `min_length` to `max_length` characters
in the interval [`char_start`, `char_start` + `char_range`)"""
self.min_length = min_length
self.max_length = max_length
self.char_start = char_start
self.char_range = char_range
def fuzz(self) -> str:
= random.randrange(self.min_length, self.max_length + 1)
string_length = ""
out for i in range(0, string_length):
+= chr(random.randrange(self.char_start, self.char_start + self.char_range))
out return out
Now we can use RandomFuzzer
to kick off a fuzzing process to test inputs of type str
between the two specific lengths! As shown in The Fuzzing Book, we can test the program cat
, which will print out its stdin
, illustrated by the following code segment and its output:
# initialize `cat` program as `ProgramRunner` with `stdin` = "cat"
= ProgramRunner(program="cat")
cat
# create `RandomFuzzer` class capable of random inputs
= RandomFuzzer(min_length=20, max_length=21)
random_fuzzer
# finally, apply the `RandomFuzzer` to the `cat` `ProgramRunner` to fuzz it once:
print("Single Run:\n")
random_fuzzer.run(cat)
# or we can fuzz it for any number of runs!
print("Multiple Runs:\n")
30) random_fuzzer.runs(cat,
Single Run:
Multiple Runs:
[(CompletedProcess(args='cat', returncode=0, stdout='6#5+;$5 .84,*3 >:86+!', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout="#5-<0:?-796$?*4 7&.')", stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout=';#\'0!99"; $%9"5=1-.2', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='4984>9,)53.)$0.**584', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='##=,61.":3;,? 282%%#', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='7";.*&9, 0 )./64":10', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='844="5!>6%7:-#89 ?(+5', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout=' )(20 :%&\'(&286 "$=1', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='*+?:! 0&?#+>"??7;+0"', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout="%+.<--72&)%22&?0!('10", stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout=".#!('1%55#6)0>4.,*$2", stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='(17=5>8-4*&61?;(4?</', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='*,!38\'#>"#/?1)14%>*"4', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='!33=3-56/473;44$$9%$&', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='7#,8*5/1( 6(;=?*/0 ./', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout="$(0&>.!?!5'39???<>1;-", stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout="7 *+ 6!;'/4-6#-'8/*>'", stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='):4292-#2##999(- 4;9', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='\'01&;1-(270,0?<."!;.)', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout="'9!!9(!6950($';*30-19", stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout="*;+/(/3'37<=5*=)'((7", stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='5.4!?6$-?;9,8035,"4$', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='1.32-258: 2("3>#((4#', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='=5)7$- "&9,> ?:=+4-=:', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout=">4,0!5')/+.>)<#-$?.+*", stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='6$/+3$);;*3-29.3<8,!/', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='27-2<)"$4*1;3!%<!<:9', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout=';=,%"&\'#&0.-=*.0<:=7&', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='8*>68.0+?!9* <<%(. 9', stderr=''),
'PASS'),
(CompletedProcess(args='cat', returncode=0, stdout='53=0:#;20+4$#0; /$(40', stderr=''),
'PASS')]
The output shows that cat
worked for each random input. Although this testing process is not comprehensive, it would do a good job at catching problems with inputs that the cat
function may not expected. Note that, even though this simple example restricts the input to random sequences of bytes, RandomFuzzer
could be extended for any kind of data structure and thus to any kind of program! This means that fuzzing is a general-purpose tool we can use for Chasten.
Reflection
This article shows how fuzzing provides an automated way to provide creative inputs to a program, given that we know the kind of data a program is expecting.
Our team resonated with the importance of this chapter, as we have had many unnecessary issues arise on our feature branches and are feeling the pains of not implementing fuzzing sooner. Our colleague Jason Gyamfi states it clearly:
This makes the chapter a must-read for those aiming to improve software strength and safety.
Our team is striving to do just that: “aiming to improve software strength and safety.” While starting the semester, some of us had not even heard of the term “fuzzing.” Now, we all are aware of the term, and some of us have even started implementing it into the test suite of Chasten, our tool for finding patterns in the AST of a python program.
We are all aware of the benefits of fuzzing and its keen ability to point out stress points in our code. That said, the task remains to achieve complete familiarity with fuzzing strategies and to consistently implement them when creating new features. Many of our team members have emphasized the importance of doing this sooner than later, so as to not accrue technical debt. In our case, we use Hypothesis, which offers many powerful strategies for fuzzing.
Action Items
In the “Fuzzing: Breaking Things with Random Inputs” chapter of The Fuzzing Book we develop a deeper understanding about the use of fuzzing and the importance of testing in software engineering. Fuzzing can be an effective way to find weak points in code. The implementation of fuzzing into the Chasten program could have many benefits. When developing Chasten we can check for bugs with in parts of the system that input source code or configure files. The use of fuzzing could also simulate a user’s behavior which would give the team an idea of bugs that a user may encounter. We have to apply what we have learned about fuzzing in these ways:
- Have urgency: When we see that a feature lacks proper test cases, rush to change that!
- Be responsible: When we write our own features, we need to make sure each change includes some sort of test to verify that feature works with fuzzed input.
- Be curious: Explore tooling that exists in order to know “every inch” of its feature set. This way, we can approach our testing the most effectively.