The MASM Forum

General => The Workshop => Topic started by: NoCforMe on September 22, 2022, 09:44:44 AM

Title: Another parser
Post by: NoCforMe on September 22, 2022, 09:44:44 AM
Well, folks, further coding on my editor brought yet another opportunity to use my FSA tokenizing/parsing scheme. This time it was to interpret a user-input line defining a print page header or footer, and it worked like a charm.

In adding a page setup option, I decided to go with Notepad's header/footer definition scheme, which is good enough for government work, as they say. Their syntax allows plain text which gets printed as entered, plus these special tags:


TagMeaning
&lAlign text to left
&cAlign text to center
&rAlign text to right
&fInsert name of file being printed
&dInsert date
&tInsert time
&pInsert page #

Took me just a few hours, from the time I sat down with paper and pencil to when I did a successful proof-of-concept run. (It would have taken even less time if I hadn't made some stupid mistakes caused by copypasta.)

The first diagram below shows the overall scheme: I decided that since I have text that aligns left, center and right, the easiest way to handle this would be to separate the text into 3 separate buffers. Which turned out to be super-simple. When it comes time to actually print the header or footer, I'll first print the left buffer with left alignment, the center buffer with center alignment, the right buffer with right alignment.

The second diagram shows the tokenizer used to accomplish this. Look how (relatively) simple it is. Again, it consists of a data table that drives code using several (8 in this case) stubs, the longest of which is less than 20 statements long; most are much shorter.

I'd like to use this as a challenge to anyone out there who's interested in taking it up: Try to do this same job using whatever method you like. Usually this is going to consist of a whole bunch of IF ... ELSEIF ... statements, and probably setting and checking a whole bunch of flags. I'd like to demonstrate how superior this method is in several ways. Try to write this same code and see how it turns out. (I'll post the code for my method in a follow-up to this.) And be sure to tell us how long it took you (including time sitting scratching your head!).

Here are the details:
For the encoding byte, I use 0FEh to indicate a page #, for example (I picked some high #s with no corresponding ASCII values). When the printing code scans the buffer (any of the 3), it'll substitute the text for the page # there. This isn't part of the parsing job. All you need to do is stick some byte in there.

So see if you're up to this challenge. May the best code win!