Last week I bought an external floppy drive. On one floppy disk I found some old mainframe programs. I wrote most of them as a student with PL/1.
This was at the time when memory was a rare resource. I adjusted the following program from there for PowerBASIC 3.5. It's a plain DOS program
and in my opinion it belongs in this subforum. If Hutch decides otherwise, I'm okay with that, too.
The application calculates 250 decimal digits of the Euler's number e = 2.718281828... For this purpose it uses an array with 52 elements (208 Byte)
of the data type REAL4. The BASIC program provides the framework. There the variables as well as the array are created and the result is printed.
But the real dirty work is done by the procedures in assembly language.
All floating point operations are realized with the old FPU. The only prerequisite is the 386/387 tandem. That is why the program will run properly under
different configurations. I have tested it under: plain FreeDOS, FreeDOS with VirtualBox and VMware Player, DOS emulation of OS/2 Warp 4 and DOSBox.
The program should also run under the DOS emulation of Windows 95, if someone still has that installed. The output of the program looks like this:
2 71828 18284 59045 23536 2874 71352 66249 77572 47093 69995 95749 66967 62772 40766 30353 54759 45713 82178 52516
64274 27466 39193 20030 59921 81741 35966 29043 57290 3342 95260 59563 7381 32328 62794 34907 63233 82988 7531 95251
1901 15738 34187 93070 21540 89149 93488 41675 9244 76146 6681
Please, press any key to end the application ...
The background of the algorithm has to do with the Taylor series and the rest element. With this method, D. Gillies and D. Wheeler calculated 100 000
decimal digits of e in 1964. In 1971, J. Dutka used it to calculate 1 000 000 digits of the square root of 2. The method can, of course, be accelerated.
However, I deliberately didn't use SSE instructions, since that doesn't work on older processors.
A feedback and test reports from other members would be nice. Many thanks in advance.
Hi Gunther,
Well, it runs okay here. But leading zeroes are suppressed in
your output.
Regards,
Steve N.
Hi Gunther!
FOR i = 0 TO 50
LEN1 = 6 - LEN(STR$(X(i)))
IF LEN1 > 0 THEN
IF i = 0 THEN
STDOUT STRING$(LEN1," ");
ELSE
STDOUT STRING$(LEN1,"0");
END IF
STDOUT MID$(STR$(X(i),5),2,6);
ELSE
STDOUT MID$(STR$(X(i),5),2,6);
END IF
NEXT i
27182818284590452353602874713526624977572470936999595749669676277240766303535475945713821785251664274274663919320030599218174135966290435729003342952605956307381323286279434907632338298807531952510190115738341879307021540891499348841675092447614606681
Please, press any key to end the application ...
Nice to play a little :thumbsup:
Regards, HSE
Steve,
Quote from: FORTRANS on May 02, 2022, 09:47:00 PM
Well, it runs okay here. But leading zeroes are suppressed in
your output.
Yes, I see. But HSE (http://masm32.com/board/index.php?topic=10034.msg109779#msg109779) did fix it.
By the way, what's your DOS configuration?
HSE,
Quote from: HSE on May 02, 2022, 10:49:18 PM
Nice to play a little :thumbsup:
Thank you. Good catch! :thumbsup: As soon as I've time, I'll post the update.
What's your DOS configuration?
Gunther,
Quote from: Gunther on May 03, 2022, 03:03:39 AM
What's your DOS configuration?
That was in DosBox 0.74-3. I have
machine=svga_s3, but I think everything is default. PB was 3.50 and Assembler used was JWAsmR (I builded from v2.15).
Perhaps look better with point:
FOR i = 0 TO 50
LEN1 = 6 - LEN(STR$(X(i)))
IF LEN1 > 0 THEN
IF i = 0 THEN
STDOUT STRING$(LEN1-1," ");
ELSE
STDOUT STRING$(LEN1,"0");
END IF
STDOUT MID$(STR$(X(i),5),2,6);
IF i = 0 THEN STDOUT ".";
ELSE
STDOUT MID$(STR$(X(i),5),2,6);
END IF
NEXT i
Hi,
Quote from: Gunther on May 03, 2022, 03:00:27 AM
By the way, what's your DOS configuration?
On this system, Windows 2000 and OS/2 Warp 4.
Regards,
Steve
HSE,
Quote from: HSE on May 03, 2022, 03:42:40 AM
That was in DosBox 0.74-3. I have machine=svga_s3, but I think everything is default. PB was 3.50 and Assembler used was JWAsmR (I builded from v2.15).
Good to know. DOSBox is quite solid and well suited for testing purposes.
Quote from: HSE on May 03, 2022, 03:42:40 AM
Perhaps look better with point:
Probably. I'll check that.
Steve,
Quote from: FORTRANS on May 03, 2022, 05:38:21 AM
On this system, Windows 2000 and OS/2 Warp 4.
thank you for the information. :thumbsup:
Would it make sense to use SIMD instructions?
The update is in the archive e250r1.zip under the first post in this thread. Now also leading zeros are printed correctly. Also, a dot is printed after the 2.
The output should look like that:
2.7182818284590452353602874713526624977572470936999595749669676277240766303535475945713821785251664274274663919320030599218174135966290435729003342952605956307381323286279434907632338298807531952510190115738341879307021540891499348841675092447614606681
Please, press any key to end the application ...
The improved output loop in lines 124 - 142 in the source code of E250.BAS is a suggestion from HSE. Many thanks to him. :thumbsup: I've also noted this in the source code.
Quote from: Gunther on May 03, 2022, 05:44:15 AM
Steve,
Quote from: FORTRANS on May 03, 2022, 05:38:21 AM
On this system, Windows 2000 and OS/2 Warp 4.
thank you for the information. :thumbsup:
Would it make sense to use SIMD instructions?
Hi,
Well, not on this computer, they are not supported
(P-III MMX). And I assume you mean at least some
version of SSE. And JJ's MASMBASIC requires SSE2
and does not run here.
I suspect, that since you are asking the question,
that it would be of interest to you. But then a DOS
program is probably not the way to go. I have a bit of
difficulty in using DOS timing programs on this system.
And Windows and OS/2 react differently in that regard.
And if you write a program using SIMD code, you
should probably test it against your original program
to see what differences are notable.
Have fun? Keep us posted on any neat programming.
Cheers,
Steve N.
Thank you Steve.
Quote from: FORTRANS on May 03, 2022, 06:43:52 AM
Have fun? Keep us posted on any neat programming.
I'll do my best. :thup:
Quote from: Gunther on May 03, 2022, 05:44:15 AM
Steve,
Quote from: FORTRANS on May 03, 2022, 05:38:21 AM
On this system, Windows 2000 and OS/2 Warp 4.
thank you for the information. :thumbsup:
Would it make sense to use SIMD instructions?
If you move to 32bit SSE is well suited for packed fp instructions together with Taylor series perform 4 parts of calculations SIMD
( I don't know what each x^y/constant part in Taylor series is called in english)
That's quite nice :thumbsup: Just wonder why use DOS instead Windows
Daydreamer,
Quote from: daydreamer on May 03, 2022, 07:50:07 PM
If you move to 32bit SSE is well suited for packed fp instructions together with Taylor series perform 4 parts of calculations SIMD
( I don't know what each x^y/constant part in Taylor series is called in english)
I'm aware of that. The point is that with this algorithm, on each pass of the loop, both the element and its successor and predecessor are iteratively computed.
That makes things difficult. But I'll think about this.
My idea was to use SIMD instrutions to make floating point calculations more convenient. With this algorithm, REAL4 values are already sufficient for the calculation.
High accuracy is not required. Therefore, SIMD arithmetic should be good enough.
Caballero,
Quote from: caballero on May 04, 2022, 03:08:46 AM
That's quite nice :thumbsup: Just wonder why use DOS instead Windows
thank you for the flowers.
I wanted to show that even sophisticated algorithms can be realized with DOS. PowerBASIC is well suited for this purpose. I also didn't feel like implementing
two nested loops in assembly language.
But it's already right: The program can be written under Windows with appropriate modifications. Maybe I'll do that when I've the time and mood.
Quote from: Gunther on May 04, 2022, 04:43:48 AM
I wanted to show that even sophisticated algorithms can be realized with DOS
I agree
I already done the pixelshader way of make circle per pixel check inside or outside radius with sqrt(x*x+y*y),Rossler attractor
But it also more challenging optimize speed when not spoiled by 4.5ghz but the slower emulating frequency
Daydreamer,
Quote from: daydreamer on May 04, 2022, 04:24:09 PM
But it also more challenging optimize speed when not spoiled by 4.5ghz but the slower emulating frequency
that's a good argument.
But you also have to understand me. Our subforum for PowerBASIC is not used very often. The lion's share there is from Hutch. From me is only a very small part.
I am therefore endeavoring to increase this proportion somewhat. I learned about Hutch decades ago through PowerBASIC. There are good reminiscences that I
associate with this compiler. Maybe I'm too nostalgic, who knows?
does PowerBasic for DOS not have enough functionality so that you must use assembler?
or is the use of assembler to improve performance?
if the latter, then how much improvement was achieved by using assembler?
Jack,
Quote from: jack on May 04, 2022, 06:20:27 PM
does PowerBasic for DOS not have enough functionality so that you must use assembler?
not easy to answer. Many things are a matter of taste. It may be enough for a programmer, but not for others. In addition, however, there are hard facts.
The DOS compiler PB 3.5 is from 1997. The new registers and instructions were not known at that time. There is an inline assembler, but it only allows 8086
instructions. The generated code of the compiler is also far from optimal. Even with the metastatement $CPU 80386 the compiler does not use the 32 bit
registers. These points have all been improved by Bob Zale in the Windows compilers. But this is not true for the DOS compiler.
That's why I try to improve certain things with assembly language. Otherwise, the modern processors (Skylake, Buldozzer etc.) only run as a fast clocked 8086.
In my understanding, that's not the point.
Quote from: jack on May 04, 2022, 06:20:27 PM
if the latter, then how much improvement was achieved by using assembler?
Hard to answer. I didn't do a test, but only adjusted the mainframe program accordingly. Even there I had written the inner loops in mainframe assembly language
(System/370). I was still a student at that time. I don't remember if that was required or not. It's just been too long.
The actual archive is e250r2.zip. It contains the same files but with very significant changes. This mainly involves the BASIC source.
I wrote the inner calculation loops and some more parts with the inline assembler. As a result, the program now has almost the same
structure as the main frame source. This simplifies a later port to a pure Windows application with the MASM SDK. Furthermore, this
achieves an increase in speed; a whole series of function calls and thus the associated overhead is unnecessary.
But there's a price to pay. The inline assembler of PowerBASIC for DOS supports only the 8086 instructions. Therefore, special measures
were necessary, e.g. to realize the floating point instructions. This has made the source code a bit longer, but it's well commented.
Moreover, the code is pretty self-explanatory.
A major challenge still remains. The loop to output Euler's number is a suggestion from HSE. This still has to be implemented in assembly
language. Then the whole algorithm can be realized without problems under Win64 and Win32.
Tests and improvement ideas are welcome at any time. Thank you for this in advance.