Cheat Engine

KryziK · Posted: Sat Sep 12, 2015 9:06 pm Post subject: Assembler Help

Hey all, I'm working on writing a small assembler for a subset of the Intel instructions, and I had a few questions:

1. In the CE source, in assemblerunit.pas, I noticed the following line:

Dark Byte · Posted: Sun Sep 13, 2015 2:28 am Post subject:

1 the 0x66 prefix switches between 16 and 32 bit operation (0x67 does the same but then for addressing, which ce does not support)

2 the prefered instruction is at top, but if the instructiin can’t be encoded it’ll for an alternate one (e g when you wish to encode a value bigger than 128 it can’t use the 2 byte jmp

3 a lot of those entries are duplicates with a minot diference (e.g it shows a different entry for 32 and 64 bit mode. CE just deals with that afterwards by setting a rex prefix)
besides that, some newer instructions aren't handled yet like VEX

KryziK · Posted: Sun Sep 13, 2015 11:38 am Post subject:

2. So, for my second question, I see that CE finds the range of instructions in the array opcodes with the matching mnemonic, but what I really wanted to know was how CE determines whether or not the entry in opcodes is the right one. So, it starts at the first one, which is preferred, how does it determine whether the operands match what the user has typed? My guess was in the original post: It checks the size of the immediate that the user has typed, and then just makes sure that it fits into the size specified in opcodes.

Hopefully that was clear enough to understand. I'm just trying to figure out how CE "weeds out" the opcodes that don't match what the user typed. It's hard to read through so many if statements and such.

3. I see, in the Intel manual, some instructions that are 64 bit, such as:

Dark Byte · Posted: Sun Sep 13, 2015 1:12 pm Post subject:

2 yes, you're right
it checks the size of the immediate and then goes through the list
first the 2 byte jmp will get checked if it matches, and then the 5 byte one

if the immediate is bigger than 128 (1 byte) the 2 byte jmp fails, so it goes for the 5 byte jmp

the assembler array contains the parameters they expect

3 take that example you post there. CE has no real use for that one (with or without REX prefix) as changing the code segment is grnerally a bad idea. (sure, you could do tricks like executing 64 bit code in a 32 process, but come on... for ce?)

as for jmp rel16, for some reason it zeroes the upper 16 bits of EIP so for all purpose, useless

the jmp r/m# instructions are just 1 instruction. it may be shown 3 times, but they are just the same instruction. It depends on the cpu state how it gets handled (e. g. in 64 execution mode modrm can be encoded as rip relative)

KryziK · Posted: Sun Sep 13, 2015 5:59 pm Post subject:

Thanks for your replies! I'll take what you have said into account when coding and reading the Intel manual.

Unfortunately, with classes and a job, who knows if I'll have enough time to work on this for very long. But, for now, I'll try!

Thanks again, DB. You're the best. <3

KryziK · Posted: Wed Sep 23, 2015 10:51 pm Post subject:

DB,

Could you explain modR/M and how the /digit opcode works?

For example, the PUSH instruction (on 4-271 of Intel Manual):

Dark Byte · Posted: Thu Sep 24, 2015 4:00 am Post subject:

for instructions that don't need 2 parameters, the reg field of the modr/m byte can be part of the instruction

in case of push that's 110 , so the real instruction is 11111111 110

modR/M is too complex to explain, but the intel guide with all the instructions has a chapter on how the modr/m and sib bytes are build (and the offsets, and several other special cases)

as for the r,w stuff, no idea, i never looked at those

KryziK · Posted: Thu Sep 24, 2015 12:14 pm Post subject:

Could you explain that first part again? What do you mean the reg field can be part of the instruction? Why is push "11111111 110"? That comes out to "FF 06", which is an inc instruction according to CE. I'm sure that I'm misunderstanding something here.

Also, could you answer my last question about /digit vs /r (eo_reg vs eo_reg0/1/2/3/4/5/6/7)? I can tell that this difference is important in understanding what you just explained, because:

Dark Byte · Posted: Sat Sep 26, 2015 3:11 am Post subject:

bit 3,4 and 5 make up the reg field, so if those are 110 (6) it'd be a push

well, i guess an alternate way of looking at it is assume FF is a single instruction (e.g. WRT) where the register specifies what kind of operation is done
e. g. :
WRT [1234], EAX = INC [1234]
WRT [1234], ESI = PUSH [1234]

eo_reg6 means that bits 3-5 should be made 6

KryziK · Posted: Wed Oct 21, 2015 9:57 pm Post subject:

Dark Byte,

Thanks for the reply. I have been working on my code ever since your reply. I found this resource which ended up being very helpful:

http://www.c-jump.com/CIS77/CPU/x86/index.html

It provides an easy to understand look at the MOD R/M and SIB bytes, with examples and everything. I'll add the link to the OP, too.

I'm getting closer to finishing! Thanks again.