Bootstrapping An MSDOS Assembler With Batch Files

December 10, 2018

You have a clean MSDOS system, and you need to write some software for it. What do you do? You could use debug, of course. But there are no labels so while you can get machine code from mnemonics, you’ll still need to figure out the addresses on your own. That wasn’t good enough for [mniip], who created an assembler using mostly batch files. There are a few .COM files and it looks as if the first time you use debug to create those, but there’s also source you can assemble on subsequent builds with the assembler.

Why? We aren’t entirely sure. But it is definitely a hack. The technique sort of reminded us of our own universal cross assembler — sort of.

There are a few things that make this work. First, there are not many 8086 instructions to worry about. Second, you have to use a special format — essentially prefixing the op codes with CALL. This keeps the assembler from having to parse op codes. You actually call a batch file with the name of the instruction. For example:

CALL PUSH CS
CALL POP DS
CALL MOV DX WORD %String%

CALL LABEL String
REM H e l l o , w
CALL DB 72 101 108 108 111 44 32 119

That code snippet shows another nuance. You have to CALL LABEL to introduce a label. To use the label in an instruction, you have to surround it with percent signs.

Of course, as a practical matter, you could use gcc to build a proper assembler. But where’s the sport in that?

26 thoughts on “Bootstrapping An MSDOS Assembler With Batch Files”

steven says:

December 10, 2018 at 7:17 pm

I’ve read that this was possible, but couldn’t ever find an implementation anywhere.

Report comment

Reply
1. M says:
  
  December 11, 2018 at 5:35 am
  
  There’s no link to the actual hack. Only to a previous HaD article and one on GCC.
  
  Report comment
  
  Reply
  1. Al Williams says:
    
    December 11, 2018 at 7:05 am
    
    Not sure why that happened. Fixed now, although [rnjacobs] put the link in the comments (thanks).
    
    Report comment
    
    Reply
jacques1956 says:

December 10, 2018 at 7:19 pm

I’d rather use debug.

Report comment

Reply
rnjacobs says:

December 10, 2018 at 7:24 pm

Seems to be missing the link to the github repository in the writeup? Seems to be https://github.com/mniip/BOOTSTRA

Report comment

Reply
Jeremy S Cook says:

December 10, 2018 at 7:57 pm

Not a hack! Woop, woop, woop! /jk

Report comment

Reply
Martian Tech says:

December 10, 2018 at 8:07 pm

It’s pretty easy to write an assembler in PERL, which runs on just about every platform there is.

Examples can be found here for both microcode and assembly language: https://hackaday.io/project/27392-stupid-computer

Report comment

Reply
1. Jac Goudsmit says:
  
  December 10, 2018 at 11:29 pm
  
  Of course a clean MS-DOS install doesn’t have Perl.
  
  GWBASIC/BASICA then?
  
  Report comment
  
  Reply
  1. jacques1956 says:
    
    December 11, 2018 at 4:59 am
    
    Right! BASICdefinitely would have been a better choice.
    
    Report comment
    
    Reply
2. M says:
  
  December 11, 2018 at 6:01 am
  
  microcode?
  
  Report comment
  
  Reply
  1. Martian Tech says:
    
    December 11, 2018 at 8:50 am
    
    Microcode for the STUPID computer. The examples are not for the X86 architecture. Sorry if that was confusing.
    
    Report comment
    
    Reply
Alan Hightower says:

December 10, 2018 at 8:08 pm

It’s a clever hack, though not a very useful one. He has 143 batch files totaling 2403 lines at 580 KB! For reference MASM 6.11 minimal command line run-time including linker and librarian is 700 KB. Sure one *could* use edlin or copy con to enter all 2403 lines by hand rather than copying over MASM on a floppy.. but .. ? And presumably an earlier version of MASM or another assembler would be smaller. And most of the batch files are not simple. It would be as tedious to enter the printed hex dump of a working assembler in debug as to hand enter the batch files (for a true boot-strap experience).

Report comment

Reply
1. Alan Hightower says:
  
  December 10, 2018 at 8:17 pm
  
  Nevermind. Batch files are 49KB. But a upx’d version of MASM.EXE 3.01 is only 40KB (+13K for LIB.EXE and +23K for LINK.EXE). I’d rather type in the hex dump.
  
  Report comment
  
  Reply
RÖB says:

December 10, 2018 at 8:09 pm

Such luxury to have DOS to bootstrap an assembler.

In the days long before the internet there was no such thing as code distribution or sharing except for some pages in a magazine that were usually written in hex.

If you wanted and assembler then you had to write one in hex. That being such a daunting task led people to tackle a disassembler first as it was much easier to write as the input is far better defined and you don’t have to store a lot of variables and it’s simply a single pass process.

By the time you have finished writing a disassembler you know how to convert every mnemonic to hex from your own memory or thinking the process though as the op-code decoder does so there is no longer any real need to write an assembler.

So every time someone asks you if you have finished the “assembler” that you once said you would write, you just answered “not yet” while you are programming directly in hex without telling them you no longer intend to.

Report comment

Reply
1. heyhono says:
  
  December 11, 2018 at 1:28 am
  
  +1
  
  Report comment
  
  Reply
Cbob says:

December 10, 2018 at 8:26 pm

Somehow, I really don’t miss typing in code from PC magazine, Byte or Popular Electronics.

Report comment

Reply
1. playaspec says:
  
  December 11, 2018 at 11:51 pm
  
  You’re missing the point. The *ONLY* way you could get the functionality written about was to enter it into the computer yourself. It was a GREAT way to get young minds into programming.
  
  Now you just download it and use it, but never learn anything from it.
  
  Report comment
  
  Reply
1972ish says:

December 11, 2018 at 2:30 am

But then MSDOS is not for REAL Computers, isn’t it?

Report comment

Reply
1. Miroslav says:
  
  December 11, 2018 at 6:29 am
  
  Sure is. Complete control system from a PC tossed in the junk, courtesy of LPT, DOS, and QBasic.
  
  And timing is predictable. OS doesn’t get in your way. Real time system.
  
  Report comment
  
  Reply
  1. Rocket Ray says:
    
    December 11, 2018 at 11:15 am
    
    Loved that about DOS.
    
    I once wrote a realtime fairly high speed datalogger in assembly using debug. The little .com streamed the contents from the LPT buffer to a file on the HDD along with a time stamp in CSV. Found I could double the write speed if I removed the loop to close the file. The way to stop datalogging was to power down.
    
    Of course, the system then complained of a corrupted file system on reboot. A very simple program that was then written and called on boot to properly close the file. Worked pretty well on curbside (dumpster grade) 286 machines back in the day.
    
    Report comment
    
    Reply
    1. Miroslav says:
      
      December 12, 2018 at 6:34 am
      
      Great job! Few years ago I made a crude oscilloscope for the LPT. It had 500 000 samples/second acquisition speed. But I only got there after I started using very tight GOTO loop :) in compiled QBasic, instead of LOOP/FOR/WHILE structured cr.p. Some other optimizations were needed as well. But the thing flew.
      
      Report comment
      
      Reply
svofski says:

December 11, 2018 at 4:05 am

I’ve read it back and forth a few times but couldn’t find a worked example. Come on, a hello.com file that prints “hello, world”?

Report comment

Reply
1. TheDarkTiger says:
  
  December 11, 2018 at 10:24 am
  
  There is, deep in the git :
  https://github.com/mniip/BOOTSTRA/blob/master/BATAS/README.MD
  At the very end of the file (commit n° 7815b77, which is the last one as the time of this writing.)
  
  Report comment
  
  Reply
  1. svofski says:
    
    December 11, 2018 at 3:16 pm
    
    Thanks! They seem to have fixed the article too.
    
    Report comment
    
    Reply
2. RÖB says:
  
  December 11, 2018 at 11:59 am
  
  c:\> debug -a 100 1373:0100 mov ah,9 1373:0102 mov dx,108 1373:0105 int 21 1373:0107 ret 1373:0108 db "Hello world!$" 1373:0115 -n c:\hi.com -r bx BX 0000 :0 -r cx CX 0000 :15 -w Writing 00015 bytes -q
  
  c:\> c:\hi.com Hello world!
  
  Report comment
  
  Reply
  1. svofski says:
    
    December 11, 2018 at 3:18 pm
    
    I never doubted your debug.com skills, RÖB ;) But the article is about using command files. They have fixed the link now.
    
    Report comment
    
    Reply

Hackaday

Bootstrapping An MSDOS Assembler With Batch Files

26 thoughts on “Bootstrapping An MSDOS Assembler With Batch Files”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

Encryption In The 1790s

The Need For Speed: Internet Speed Measurement (or DIY?)

Postal IRCs Are Almost A Thing Of The Past

Launching Rockets Is Hard, Bring Them Back Is Harder

Putting Some Zig In A Linux-Based 3D Printer

Our Columns

Hackaday Europe 2026: Half Quad, Half Blimp: Test. Fly. Survive.

FLOSS Weekly Episode 876: There Is No Money Fairy

Compile Here, Run Everywhere: Crosstool-Ng

Giving Resin 3D Printers Another Shot After Six Years

Hackaday Europe 2026: Project Gigapixel

26 thoughts on “Bootstrapping An MSDOS Assembler With Batch Files”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns