This is an old revision of this page, as edited by Doktorspin (talk | contribs) at 23:22, 1 February 2010 (Undid revision 341367214 by SarekOfVulcan Disagreement is not a sufficient response. Please supply rationale rather than unjustified removal). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
Revision as of 23:22, 1 February 2010 by Doktorspin (talk | contribs) (Undid revision 341367214 by SarekOfVulcan Disagreement is not a sufficient response. Please supply rationale rather than unjustified removal)(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)It is proposed that this article be deleted because of the following concern:
If you can address this concern by improving, copyediting, sourcing, renaming, or merging the page, please edit this page and do so. You may remove this message if you improve the article or otherwise object to deletion for any reason. Although not required, you are encouraged to explain why you object to the deletion, either in your edit summary or on the talk page. If this template is removed, do not replace it. This message has remained in place for seven days, so the article may be deleted without further notice. Find sources: "Microsoft Macro Assembler" – news · newspapers · books · scholar · JSTORPRODExpired+%5B%5BWP%3APROD%7CPROD%5D%5D%2C+concern+was%3A+Software+product+hardly+appears+to+be+notable.+Main+sources+are+from+the+primary+producer%27s+material+and+are+therefore+not+reliable+third+party+sources.+Several+links+don%27t+function.Expired ], concern was: Software product hardly appears to be notable. Main sources are from the primary producer's material and are therefore not reliable third party sources. Several links don't function. Nominator: Please consider notifying the author/project: {{subst:proposed deletion notify|Microsoft Macro Assembler|concern=Software product hardly appears to be notable. Main sources are from the primary producer's material and are therefore not reliable third party sources. Several links don't function.}} ~~~~ Timestamp: 20100201230157 23:01, 1 February 2010 (UTC) Administrators: delete |
This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these messages)
No issues specified. Please specify issues, or remove this template. (Learn how and when to remove this message) |
Developer(s) | Microsoft |
---|---|
Stable release | 9.0 / May 1, 2008 (2008-05-01) |
Operating system | Microsoft Windows and MS-DOS |
Type | Assembler |
License | Microsoft EULA |
Website | www |
The Microsoft Macro Assembler is an x86 assembler for MS-DOS and Microsoft Windows. While the name MASM has earlier usage it is commonly understood in more recent years to refer to the Microsoft assembler. It supports a wide variety of macro facilities and structured programming idioms, including high-level functions for looping and procedures. Later versions added the capability of producing programs for Windows. MASM is one of the few Microsoft development tools that target 16-bit, 32-bit and is supplied as a 64 bit version ML64.EXE for 64-bit platforms. Versions 5.0 and earlier were MS-DOS applications. Versions 5.1 and 6.0 were available as both MS-DOS and OS/2 applications. Versions 6.12 to 6.14 were implemented as patches for version 6.11 which converted them from 16 bit MZ executables to 32 bit PE executable files. All later versions have been 32 bit PE executable files built as Win32 console mode applications.
History
The Microsoft Assembler, commonly known as MASM has been in production since 1981 and is upgraded by Microsoft to keep abreast with operating system needs and processor developments. Versions 6.1 and 6.11 converted MASM from a 16 bit MZ/NE dual-mode executable for DOS and OS/2 to a proper 32 bit Portable Executable that ran natively on the 32 bit Windows platforms and included Phar Lap's TNT DOS extender so that it could run in MS-DOS.
The last freestanding commercial version of the Mirosoft assembler was version 6.11d released in the middle 1990s. With the release of 32 bit versions of Windows with both the OEM win95 and WinNT version 4.0, Microsoft developed ML.EXE mainly for internal use as an operating system vendor and it was mainly available only through MSDN subscription but Microsoft developed patches for the last commercial version of ML.EXE With the release of the 6.14 patch, ML.EXE became a very reliable tool that supported Intel opcodes up to the early SSE instruction set.
In mid-2000 Microsoft re-integrated ML.EXE back into their VC98 commercial software development package with the processor pack as the downloadable file VCPP5.EXE which was licenced so that licenced end users of VC98 could redistribute the processor pack to other licenced end users of VC98 and all versions of Microsoft Visual C and Visual Studios have contained ML.EXE as a component since that time. The ML.EXE version supplied in the VCPP5 pack was ML.EXE version 6.15 which added support for the SSE2 Intel instruction set. Successive versions of ML.EXE have been developed on a needs basis to include later Intel mnemonics. Later in Visual C++ 2005, a 64-bit version of MASM appeared.
Although MASM is no longer a freestanding commercial product, it has since 2000 been a component of the Microsoft commercial development environment Visual Studio but Microsoft have also made it availabe in many different packages for device development and more recently in the free downloadable versions of Visual Studio.
Version 7.0 was included with Visual C++ .NET 2002. Version 7.1 was included with Visual C++ .NET 2003. Version 8.0 was included with Visual C++ 2005 which also includes a version that can assemble x64 code. Version 9.0 is included with Visual C++ 2008. Some of the newer versions of MASM are also included in various Microsoft SDKs and DDKs.
Usage
The Microsoft assembler has been the main vehicle for preserving the earlier Intel assembler notation and it can still be written as a fully specified language, a format that many dis-assemblers produce. The most common notation of this type are the data size specifiers,
BYTE PTR The data size specifier for the target being 8 bit. WORD PTR The data size specifier for the target being 16 bit. DWORD PTR The data size specifier for the target being 32 bit.
Addressing Notation
ML.EXE maintains the historical distinction between transient stack addressing and fixed data addressing by using the notation OFFSET to denote data in either the initialised or uninitialised data sections. Transient stack addressing is handled by a number of methods. With a procedure that uses a stack frame, named LOCAL variables are used for readability purposes and where the address of the variable is required it can be accessed eith by the LEA mnemonic or in an INVOKE call by the ADDR operator. The LOCAL variables are stack addresses when used within a procedure. Procedures written without a stack frame are generally written purely in mnemonics using direct based argument addressing. To maintain compatibility with the historical MASM method of pseudo high level notation, there is a notation to turn off the stack frame generation for the procedure on a needs basis.
Square Brackets
ML.EXE does not require the addition of square brackets around a named variable but will tolerate such notation deviation by ignoring the notation. This notation difference has at time been a source of confusion for programmers familiar with other assemblers that use the square brackets to denote an address.
mov eax, local_var ; Standard ML.EXE notation mov eax, ; Square brackets are ignored by ML.EXE in this context.
There is no Intel mnemonic that will produce the extra level of indirection implied by placing un-necessary square brackets around a named variable and the practice leads to confusion of programmers who are experienced in using other x86 assemblers that use the square bracket notation differently. ML.EXE uses square brackets around direct mnemonic code to perform the dereferencing operation as in the following example.
lea eax, variable_name ; load the address of a stack variable into the EAX register mov eax, ; dereference the CONTENT of the variable and copy it into the EAX register.
ML.EXE will allow the following.
mov eax, mov eax,
In this context the second pair of square brackets perform the ADDITION function within the complex addressing notation
Notation Abbreviation
Over a long period a form of shorthand notation developed as the parsers in early versions improved and generally if the assembler can determine the size of the data the data size specifier is not necessary although it still can be used. This shorthand has confused some users who have used other assemblers which are not by default data size specified tools. It can lead to problems when the user is not familiar with the default data size specifiers while using the shorthand notation.
movzx eax, ; This fails as the assembler cannot determine the size of data to be zero extended into the EAX register.
movzx eax, BYTE PTR ; this is the correct notation if the programmer wishes to zero extend a BYTE into a 32 bit register.
Limited Type Checking
From at least version 6.0, ML.EXE has supported a pseudo high level notation for creating procedures that perform argument size and count checking. It is part of a system using the PROC ENDP PROTO and INVOKE operators. The PROTO operator is used to define a function prototype that has a matching PROC that is terminated with the ENDP operator. The prototyped procedure can then be called with the INVOKE operator which is protected by the limited size and argument count checking. There is additional notation at a more advanced level for turning off the automatically generated stack frame for the procedure where stack overhead in the procedure call may have an effect with very small procedures. ML.EXE is also capable of being written completely free of the pseudo high level notation using only bare Intel mnemonics.
Using an example prototype from the 32 bit Windows API function set,
SendMessage PROTO STDCALL :DWORD,:DWORD,:DWORD,:DWORD SendMessage equ <SendMessageA>
The code to call this function using the INVOKE notation is as follows.
invoke SendMessage,hWin,WM_COMMAND,wParam,lParam
Which is translated exactly to,
push lParam push wParam push WM_COMMAND push hWin call SendMessage
The advantage of the INVOKE method is that it tests the size of the data types and the argument count and generates an assembly time error if the arguments do not match the prototype.
Note that ML64.EXE does not currently support the INVOKE notation and may not in the future. Based on Microsoft's history of updating earlier versions of ML.EXE on a needs basis for their own internal usage, this feature set may not be developed unless they have a need to add it for their own usage.
Calling Conventions
ML.EXE supports a number of different calling conventions on both the 16 bit real mode DOS operating system, the 16 bit Windows versions and the later 32 bit versions. ML.EXE supports the C, SYSCALL, STDCALL, BASIC, FORTRAN and PASCAL calling conventions.
Pseudo High Level Emulation
ML.EXE provides a notation to emulate a variety of high level control and loop structures.
It supports the .IF block structure,
.if - .elseif - .else - .endif
It also supports the .WHILE loop structure,
.while eax > 0 sub eax, 1 .endw
And the .REPEAT loop structure.
.repeat sub eax, 1 .until eax < 1
The high level emulation also supports C runtime comparison operators that work according to the same rules as Intel mnemonic comparisons. For the .IF block notation the distinction between SIGNED and UNSIGNED data is handles with a minor data type notation variation where the storage size DWORD which is by default UNSIGNED can also be specified as SDWORD for SIGNED comparison. This data type distinction is only appropriate for the pseudo high level notation as it is unused at the mnemonic level of code where the distinction is determined by the range of conditional evaluation techniques available in the Intel mnemonics.
The combined pseudo high level emulation allows MASM to more easily interface with the later current operating systems that use a C style application programming interface. Generally the pseudo high level interface is used for non-speed critical code where clarity and readability are the most important factors, speed critical code is usually written in directly in mnemonics.
Note that ML64.EXE does not support all of the earlier pseudo high level notation and may not in the future. Based on Microsoft's history of updating earlier versions of ML.EXE on a needs basis for their own internal usage, this feature set may not be developed unless they have a need to add it for their own usage.
Pre-Processor
The Microsoft assembler has a very powerful pre-processor that has considerably more functionality than modern C compilers which is consistent with its designation as a macro assembler and it has been designed from the introduction of ML.EXE version 6.0 with C style pseudo high level functionality for programmers who prefer to use this style of notation for code that is not speed critical. On the down side the pre-processor is an old design that is known to be quirky in its operation and reasonably difficult to use without a lot of experience when writing macros that are of a more complex nature.
At their simplest macros written for the ML.EXE pre-processor are useful for automating many different simple tasks.
; ---------------------------- ; memory to memory assignment ; ---------------------------- m2m MACRO M1, M2 push M2 pop M1 ENDM
; -------------------------------------------------- ; memory to memory assignment using the EAX register ; -------------------------------------------------- mrm MACRO m1, m2 mov eax, m2 mov m1, eax ENDM
Using the EXITM <return item> notation a macro can return a value or register in a way that can used similar to a high level function call. Using a very simple example,
addregs32 MACRO reg1, reg2 add reg1, reg2 EXITM <reg1> ENDM
In the .CODE section.
mov ecx, 16 mov edx, 8 mov eax, addregs32(ecx, edx)
Which disassembles exactly to the following mnemonics.
0040102B B910000000 mov ecx,10h 00401030 BA08000000 mov edx,8 00401035 03CA add ecx,edx 00401037 8BC1 mov eax,ecx
At a slightly more complex level the pre-processor can be used to emulate higher level languages which allows non-critical code to be simplified for higher programming throughput.
fn MessageBox,0,str$(eax),"Title",MB_OK
In this working example fn is a macro that encapsulates the INVOKE notation and adds functionality so that quoted text can be inserted directly into the API function call in much the same way as a high level language. The str$() macro is an emulation of traditional basic for converting numeric data as either a memory operand or a register into string data for display.
Object Module Compatibility
The 32 bit versions of ML.EXE introduced with the patches for the last commercial version onwards produce object modules in both the older OMF format and the Microsoft version of the Portable Executable specification COFF format. The object module format is compatible with modern Microsoft C compilers and object modules produced by either ML.EXE or CL.EXE can be routinely intermixed and linked into applications written either with ML.EXE or CL.EXE.
Support
IDE Support
- Microsoft Visual Studio
- RadAsm
- WinAsm Studio
- EasyCode
Debuggers
- OllyDbg
Disassemblers
- IDAPro the Interactive Disassembler
SDK
- MASM32 SDK
There are many projects that support MASM, including IDEs (such as WinAsm Studio and RadASM), debuggers (such as OllyDbg), and disassemblers (including IDAPro, the Interactive Disassembler). The MASM32 project has developed a library, sample code repository, and documentation for MASM users. It is licenced as freeware. Support can also be found at several forums. Also, along the lines of products such as Visual Basic, or Visual C++, EasyCode is an IDE for the MASM language that provides "Visual" capabilities. Microsoft continues to develop MASM on an as-needed basis.
MASM Compatible Assemblers
For assembler programmers who are unable to use the Microsoft assembler for licencing reasons there are two directly MASM compatible assemblers that can build almost all MASM code apart from the more complex macros developed under MASM.
- Pelle's Macro Assembler which is a component of the Pelles C development environment.
- JWASM Macro Assembler licenced under the Sybase Open Watcom EULA.
See also
Footnotes
- Unisys OS 1100 Meta-Assembler
- Randall Hyde. "Which Assembler is the Best?". Retrieved 2008-06-27.
- Version 6.00 has the following copyright string: "(C) Microsoft Corp 1981-1991."
- ALANG.HLP standard installation Masm 6.11d
- http://www.intel.com/products/processor/manuals/
- http://msdn.microsoft.com/en-us/library/default.aspx
- MASM32 Main macro file MACROS.ASM
- http://download.microsoft.com/download/e/b/a/eba1050f-a31d-436b-9281-92cdfeae4b45/pecoff.doc
- http://www.microsoft.com/whdc/system/platform/firmware/PECOFF.mspx
- http://www.oby.ro/rad_asm/
- http://www.winasm.net/
- http://www.easycode.cat/English/index.htm
- http://www.ollydbg.de/
- http://www.masm32.com
- "RadASM". Retrieved 2008-06-27.
- "OllyDbg". 2008-05-24. Retrieved 2008-06-27.
- hutch. "hutch's home page". Retrieved 2008-06-27.
- Hutchesson, Steve (2004). "MASM32 Licence". Retrieved 2009-12-11.
- "The Masm Forum". Retrieved 2008-06-27.
- "Easy Code". Retrieved 2008-06-27.
- "What is MASM". Retrieved 2008-06-27.
External links
x86 assembly topics | |
---|---|
Topics | |
Assemblers |
|
Programming issues |