sean barrett | Breaking Eggs And Making Omelettes

$ ./bb86 < ~/basic-block.asm Reading stdin Warning: unknown opcode 'bswap' in line 9 Memory locations: mem1 EQU dword+(ebp_0)+08 mem5 EQU dword+(mem4)+((mem3 >> 03)) mem4 EQU dword+(mem1)+10 mem3 EQU dword+(mem1)+04 mem2 EQU dword+(ebp_0)+0c Integer registers: eax = ((mem5 < < cl_0) >> cl_0) ebx = mem4 ecx = (00000020 - mem2) edx = (mem3 >> 03) esi = mem2 edi = mem1 Floating point stack: st(0) = fp3 st(1) = fp2 st(2) = fp1 st(3) = fp0 Memory locations: [dword+(mem1)+04] < = (mem3 + mem2)

Every now and then, someone comes along and writes a short novel of a comment on an older post. Such was the case when Sean Barrett used the occasion of my What RE Looks Like post to take three hours of his rather busy life and compose a “symbolic executor” — a basic block decompiler. It’s a valiant effort and I would like to try my hand at it, as with all RE tools. I am having trouble compiling the source he posted (I converted CR format, but I am still having trouble with missing symbols from his custom library-in-a-header-file). It works on Microsoft-compatible disassembly output but probably would not be too hard to adapt for ‘objdump -Mintel’ in the GNU toolchain.

Many people have gone down this basic block disassembly road. The details are hazy but I seem to recall that I have made the journey as well. It’s a good thing I keep this blog as a journal. I guess the reason I can’t remember what my experiment was called is because it was the “Unnamed RE Project”. It looks like all I accomplished there was straight ASM -> C translation without any effort at higher level language abstraction.

Anyway, I still maintain that figuring out the overriding purpose of these basic blocks is not the biggest challenge in traditional binary reverse engineering– indeed, I personally consider it the most interesting part. No, what I think is the toughest part is figuring out — or more likely guessing — what the sometimes hundreds of referenced variables are actually used for, and assigning them appropriate names. The biggest nightmare is when functions pass around multiple gigantic nested structures and actually use a bunch of variables within.

In other words, true understanding of the underlying algorithm is the goal. But, Sean, I still want to try your tool.

Breaking Eggs And Making Omelettes

Topics On Multimedia Technology and Reverse Engineering

Tag Archives: sean barrett

Barrett’s Basic Blocks Are Back

The Quest For Decompilation