Please use this identifier to cite or link to this item:
https://doi.org/10.1109/TSE.2019.2941681
Title: | Smart Greybox Fuzzing | Authors: | Pham, Van-Thuan Bohme, Marcel Santosa, Andrew E Caciulescu, Alexandru Razvan Roychoudhury, Abhik |
Keywords: | Science & Technology Technology Computer Science, Software Engineering Engineering, Electrical & Electronic Computer Science Engineering Fuzzing Computer bugs Libraries Tools Dictionaries Open area test sites Schedules Vulnerability detection smart fuzzing automated testing file format grammar input structure |
Issue Date: | 1-Sep-2021 | Publisher: | Institute of Electrical and Electronics Engineers | Citation: | Pham, Van-Thuan, Bohme, Marcel, Santosa, Andrew E, Caciulescu, Alexandru Razvan, Roychoudhury, Abhik (2021-09-01). Smart Greybox Fuzzing. IEEE Transactions on Software Engineering 47 (9) : 1980-1997. ScholarBank@NUS Repository. https://doi.org/10.1109/TSE.2019.2941681 | Abstract: | Coverage-based greybox fuzzing (CGF) is one of the most successful approaches for automated vulnerability detection. Given a seed file (as a sequence of bits), a CGF randomly flips, deletes or copies some bits to generate new files. CGF iteratively constructs (and fuzzes) a seed corpus by retaining those generated files which enhance coverage. However, random bitflips are unlikely to produce valid files (or valid chunks in files), for applications processing complex file formats. In this work, we introduce smart greybox fuzzing (SGF) which leverages a high-level structural representation of the seed file to generate new files. We define innovative mutation operators that work on the virtual file structure rather than on the bit level which allows SGF to explore completely new input domains while maintaining file validity. We introduce a novel validity-based power schedule that enables SGF to spend more time generating files that are more likely to pass the parsing stage of the program, which can expose vulnerabilities much deeper in the processing logic. Our evaluation demonstrates the effectiveness of SGF. On several libraries that parse complex chunk-based files, our tool AFLsmart achieves substantially more branch coverage (up to 87 percent improvement) and exposes more vulnerabilities than baseline AFL. Our tool AFLsmart discovered 42 zero-day vulnerabilities in widely-used, well-tested tools and libraries; 22 CVEs were assigned. | Source Title: | IEEE Transactions on Software Engineering | URI: | https://scholarbank.nus.edu.sg/handle/10635/244803 | ISSN: | 0098-5589 1939-3520 |
DOI: | 10.1109/TSE.2019.2941681 |
Appears in Collections: | Staff Publications Elements |
Show full item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
1811.09447v1.pdf | 706.2 kB | Adobe PDF | OPEN | Pre-print | View/Download |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.