Cheat Engine

h3x1c

I've tried Googling a bit for this, but I can't seem to come across a "plain English" explanation of this. Put simply, how does a program know how to treat any given memory address? I understand that at compile time, the program is compiled with size and format in mind for everything, but where does it store such information so that at runtime, it knows when a particular address is a 4-byte int vs. 4-byte long?

Or better yet, how does CE know what format a memory address is when it scans?

I may well be thinking about this too deeply, but I'm just not understanding how a program (whether the program itself or a program like CE that can analyze said program) "knows" how to treat each of its memory addresses! Embarassed

Dark Byte · Posted: Mon Jul 25, 2016 11:44 am Post subject:

CE doesn't know what format an address is. It relies on the user to tell it instead. (or if you use all, it just tries every possible combination)

if you're talking about dissect data, then it's either based on guessing (address alignment and if the value is a human readable value or not) or if there is debugging information available (.net/mono, .pdb) then it can get the info from there

STN · Posted: Mon Jul 25, 2016 11:45 am Post subject:

When you program, you can define which data type you want to use for your variable. Such as int, short int, long int, unsigned int, char etc. depending on the language. It is a feature of strongly typed languages but if you have been using very high level languages/managed languages then i can understand your confusion.

CE just guesses, of course CE doesn't know what is the proper data type.

In memory all data is same, a string is no different than an int unless you treat it as such. A string is a collection of chars(one byte), a 4 bytes int is a collection of 1 bytes. This will make it clear for you, open CE mem viewer and in hex viewer, change display type from byte hex to any of the different data types, you can see all of them are basically just bytes. That's how they are stored in memory

h3x1c · Posted: Mon Jul 25, 2016 11:49 am Post subject:

STN · Posted: Mon Jul 25, 2016 11:51 am Post subject:

h3x1c · Posted: Mon Jul 25, 2016 11:55 am Post subject:

STN · Posted: Mon Jul 25, 2016 12:28 pm Post subject:

ParkourPenguin · Posted: Mon Jul 25, 2016 12:34 pm Post subject:

You're thinking about this from a high-level perspective far too much. Value types are very useful for sanity checks when developing a program, but when you get down to it, a value type is really just an abstraction over bytes in memory. In other words, every value type is stored in memory as bytes. You're free to interpret those bytes any way you want, be it 4-byte, float, string, or something you make up (i.e. custom value types). There is absolutely nothing you can do to conclusively distinguish an address's value type just from looking at its value.

You can make an educated guess of an address's value type by looking at how the program accesses that address (e.g. fld dword ptr [eax] probably means [eax] is a float), but you still won't know for certain. When you look at the core aspects of reverse engineering, the only thing that's important is what the program does with a value. In order to quickly determine this, most people will make the assumption that a program will only treat a single value as a single type, which doesn't always have to be true. Take this C code for example:

h3x1c · Posted: Mon Jul 25, 2016 2:54 pm Post subject:

Thanks DB, STN, and Parkour! This is crystal clear for me now.

The convolution in my head stems from a weird amalgamation of things I've been studying at the same time lately from low-level and high-level (C#, specifically)--the error being what you led with in your reply, Parkour.

Thanks again for your detailed replies, everyone!!! Very Happy

mgr.inz.Player