From 830479ece526a127822beba04e9b9edcc902a4b4 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?S=C3=B6ren=20Tempel?= Date: Mon, 6 Apr 2020 14:52:08 +0200 Subject: [PATCH] fuzzing: Add some documentation on writing fuzzing applications --- fuzzing/README.md | 51 +++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 51 insertions(+) diff --git a/fuzzing/README.md b/fuzzing/README.md index baad60f05b..48c22a7f89 100644 --- a/fuzzing/README.md +++ b/fuzzing/README.md @@ -9,6 +9,55 @@ The following additional dependencies are required: * [afl][afl homepage] * [libasan][sanitizers github] (optional but recommended) +## Writing a fuzzing application + +The fuzzing setup relies heavily on RIOT's [netapi][netapi doc] +interface, or more generally RIOT's message passing algorithm. RIOT +modules implementing network protocols usually start a single thread +which receives messages via message passing and interprets these as +protocol messages for further processing. This software architecture is +utilized by the fuzzing setup by sending random data to network modules +using message passing. + +For each network module which should be fuzzed a protocol specific +fuzzing application needs to be written. This application must +initialize the network module and make sure that the thread is started +and configured correctly. Afterwards, the application must read from +standard input where random data is provided by AFL which must be +converted to a `gnrc_pktsnip_t` by the application. The fuzzing module +provides a utility function (`fuzzing_read_packet`) for this purpose. + +Additionally, the fuzzing application is responsible for ensuring the +correct protocol specific encapsulation for the `gnrc_pktsnip_t` type. +For instance, the `gnrc_tcp` module expects each packet, received +via message passing, to be encapsulated in an IP datagram. This +encapsulation is created by the fuzzing application. + +Afterwards, the created `gnrc_pktsnip_t` is dispatched to the started +network module thread using `netapi`. As soon as the network module +finished processing the packet, i.e. frees it (or requests the next one +when using the `sock` API), the fuzzing application is terminated and +started again with new random input by AFL. Currently, only a single +packet can be processed per invocation. + +## Input Corpus + +AFL utilizes mutation-based input generation and as such requires an +initial input corpus. Refer to the [AFL manual][afl-fuzz approach] for +further information an AFL's input generation algorithm. The input +corpus should consist of valid protocol messages and should be as small +as possible. The input corpus for a given fuzzing application is located +in the `input/` subdirectory, each file in this directory represents a +single protocol message. Depending on the specific protocol, an input +corpus may be generated using a script or by extracting packets from a +sniffed sample communication (e.g. using `wireshark`). + +When creating a new fuzzing application, it is advisable to test that +packets of the input corpus are successfully processed. This can be done +by setting `ENABLE_DEBUG` for the tested module and running: + + make -C fuzzing/ term < fuzzing//input/ + ## Invocation Before fuzzing an application it needs to be compiled, to ease detection @@ -33,3 +82,5 @@ Parallel fuzzing is supported through `AFL_FLAGS`, e.g.: [sanitizers github]: https://github.com/google/sanitizers [afl homepage]: http://lcamtuf.coredump.cx/afl/ +[netapi doc]: https://riot-os.org/api/netapi_8h.html +[afl-fuzz approach]: https://github.com/google/AFL/blob/ca01f9a4c4ccb59d349c729ad3018e339f9aae0c/README.md#2-the-afl-fuzz-approach