Tonight’s ASM Strangeness | Breaking Eggs And Making Omelettes

Check out this sequence:

copy 32-bit register A to register B
shift B ~~left~~ right by 0x1F (thanks to Reimar for spotting the mistake)
subtract 1 from B
logically AND B against A

Eventually, it dawned on me that the sequence saturates an integer to a minimum value of 0, only without using any of the more traditional branching logic for such an operation.

Speed demons, these programmers. They used some neat tricks. It looks like gcc didn’t, though. Either that or I don’t understand the deeper meaning of the instruction “lea esi, [esi+0]”. It strikes me as a NOP. But that’s not quite as bad as some code observed in another gcc-compiled module recently that saw fit to execute “mov eax, eax” after a function call before moving eax (the function’s return value) to its final destination.

I’m not complaining since, when the time comes (hopefully soon) to reverse engineer that module, the naive compilation will make the task more straightforward.

11 thoughts on “Tonight’s ASM Strangeness”

Kostya August 6, 2007 at 11:47 pm

Well, NOP is only one byte and sometimes compiler inserts other meaningless instructions (like lea eax,[eax+0],xchg eax,eax or mov eax, eax) to fill the space and keep loops aligned. Also consider total execution time per byte (e.g. one two-byte instruction versus two one-byte NOPs).
It’s parallelizing that gives me headache sometimes (i.e. when one calculation is interleaved with another to be more parallelizable and to negate the execution latency).

bubu August 6, 2007 at 11:52 pm

You can find very similar tricks in these pages:
http://graphics.stanford.edu/~seander/bithacks.html
http://www.inwap.com/pdp10/hbaker/hakmem/hakmem.html

Reimar August 7, 2007 at 1:44 am

Hm… Did you mean “shift B _right_ by 0Ã—1F”, because otherwise it makes no sense to me…

Multimedia Mike Post authorAugust 7, 2007 at 6:22 am

Oops, yes I did mean shift right… I’ll fix.

Anonymous August 7, 2007 at 4:31 pm

Why copy, shift, subtract, and and? Why not just xor A, A or something similar?

Where is the traditional branching in setting a register to 0?

Multimedia Mike Post authorAugust 7, 2007 at 6:16 pm

@Anonymous: The traditional branching logic is:

if (register < 0)
register = 0

The less orthodox sequence works because if the top/sign bit is set, it will decrement to 0 when shifted to the LSB and the AND operation will clear the entire register. Hence:

if (sign bit is set)
clear register

Whereas, if the sign bit is clear to begin with, shifting to the LSB will result in a 0 register, and decrementing will flip the register to all ones. ANDing will result in the original value.

mark August 8, 2007 at 8:38 pm

The book at the fxt site also has some really interesting stuff on the topic of optimization.
http://www.jjj.de/fxt/#fxtbook

mark August 8, 2007 at 9:25 pm

In case you are interested, i tested this code with a C implementation.

int break_eggs(const int input)
{
int branch = input;

/* branch with signed int */
if ( branch >= 0x1f;
branchless -= 1;
branchless &= (unsigned int)input;
assert( (unsigned int)branch == branchless );
return branchless;
}

void main() {
assert (break_eggs(-15) == 0);
assert (break_eggs(15) == 15);
}

mark August 8, 2007 at 9:27 pm

The code was mangled by the html :(

In case you are interested, i tested this code with a C implementation.

int break_eggs(const int input)
{
int branch = input;

/* branch with signed int */
if ( branch >= 0x1f;
branchless -= 1;
branchless &= (unsigned int)input;
assert( (unsigned int)branch == branchless );
return branchless;
}

void main() {
assert (break_eggs(-15) == 0);
assert (break_eggs(15) == 15);
}

Anonymous August 12, 2007 at 3:39 am

Am I missing something ?
negative :
b = b>>31 = 0xFFFFFFFF = -1
b = b-1 = 0xFFFFFFFE = -2 ????

positive case :
b = b>>31 = 0
b = b-1 = -1 = 0xFFFFFFFF OK

I think that this could work better :
b>>=31
b= ~b
b&=a

Multimedia Mike Post authorAugust 12, 2007 at 8:05 am

@Anonymous: This assumes a logical shift vs. an arithmetic shift. Thus, zeros are shifted in from the left, even for a negative number.

Comments are closed.