x87 functions #1161

Torinde · 2024-03-26T15:49:57Z

x87 (instructions list: current, obsolete Intel/IIT/Cyrix, obsolete NEC: part1, part2)

code for most of those is available in SoftFloat (Bochs, 86Box, QEMU, Berkeley)
useful even on x86, because x87 may get removed in a future x86 CPU (already deprecated in Windows, unsupported in MSVC)
POWER9, z, RISC-V Q/L support 128-bit precision, which may speedup emulating the 80-bit x87
relevant for DOS/Win9x/WinXP emulators (for games and other software)
- some of which are still used even for civil engineering calculations
- especially when run on non-x86 platforms, where precision is sometimes reduced to 64-bit

CPU flag in Linux is fpu.

x87 Non-Waiting FPU Control Instructions

x87 Floating-point Load/Store/Move Instructions

x87 Integer Load/Store Instructions

x87 Basic Arithmetic Instructions

x87 Basic Arithmetic Instructions with Stack Pop

x87 Basic Arithmetic Instructions with Integer Source Argument

x87 Additional Arithmetic Instructions

FCHS D9 E0 Floating-point change sign
FABS D9 E1 Floating-point absolute value
FTST D9 E4 Floating-point compare top-of-stack value to 0
FXAM D9 E5 Classify top-of-stack st(0) register value.
FXTRACT D9 F4 Split the st(0) value into two values E and M representing the exponent and mantissa of st(0).
FPREM D9 F8 Floating-point partial[o] remainder (not IEEE 754 compliant)
FSQRT D9 FA Floating-point square root
FRNDINT D9 FC Floating-point round to integer
FSCALE D9 FD Floating-point power-of-2 scaling. Rounds the value of st(1) to integer with round-to-zero, then uses it as a scale factor for st(0):[q]

x87 Transcendental Instructions

F2XM1 D9 F0 Base-2 exponential minus 1, with extra precision for st(0) close to 0:
FYL2X[t] D9 F1 Base-2 Logarithm:
FPTAN D9 F2 Partial Tangent: Computes from st(0) a pair of values X and Y, such that
FPATAN D9 F3 Two-argument arctangent with quadrant adjustment:[u]
FYL2XP1[t] D9 F9 Base-2 Logarithm plus 1, with extra precision for st(0) close to 0:

Other x87 Instructions

FNOP D9 D0 No operation[v]
FDECSTP D9 F6 Decrement x87 FPU Register Stack Pointer
FINCSTP D9 F7 Increment x87 FPU Register Stack Pointer
FFREE st(i) DD C0+i Free x87 FPU Register
WAIT, FWAIT 9B Check and handle pending unmasked x87 FPU exceptions
FSTPNCE st(i) D9 D8+i[g] Floating-point store and pop, without stack underflow exception
FFREEP st(i) DF C0+i[g] Free x87 register, then stack pop

x87 Non-Waiting Control Instructions added in 80287

FNSETPM DB E4 FSETPM Notify FPU of entry into Protected Mode[a]
FNSTSW AX DF E0 FSTSW AX Store x87 Status Word to AX

x87 Instructions added in 80387

FUCOM st(i)[c] DD E0+i Floating-point unordered compare.
FUCOMP st(i)[c] DD E8+i Floating-point unordered compare and pop
FUCOMPP DA E9 Floating-point unordered compare to st(1), then pop twice
FPREM1 D9 F5 IEEE 754 compliant floating-point partial remainder.[d]
FSINCOS D9 FB Floating-point sine and cosine.
FSIN D9 FE Floating-point sine.[e]
FCOS D9 FF Floating-point cosine.[e]

x87 Instructions added in Pentium Pro

x87 Non-Waiting Instructions added in Pentium II, AMD K7 and SSE

FXSAVE m512byte NP 0F AE /0 FXSAVE64 m512byte Save x87, MMX and SSE state to 512-byte data structure _fxsave64
FXRSTOR m512byte NP 0F AE /1 FXRSTOR64 m512byte Restore x87, MMX and SSE state from 512-byte data structure _fxstore64

x87 Instructions added as part of SSE3

FISTTP m16 DF /1 Floating-point store integer and pop, with round-to-zero
FISTTP m32 DB /1 Floating-point store integer and pop, with round-to-zero
FISTTP m64 DD /1 Floating-point store integer and pop, with round-to-zero

x87 Instructions present in specific 80387 models

x87 Instructions present in NEC μPD72091

x87 Instructions present in NEC μPD72191/D9008D

FPOWER the power function x^y. This function is difficult to implement not only for its complex definition but also for sufficient accuracy. The equation X^y = e^(y*logeX) does not give good accuracy because the accuracy error of the log function is augmented by the exponential function. The FPP solves this problem by providing a 74-bit data width for the mantissa data bus.

The text was updated successfully, but these errors were encountered:

mr-c · 2024-03-26T15:57:32Z

@Torinde Do you know of any header files for these functions?

Torinde · 2024-03-26T17:32:03Z

Do you know of any header files for these functions?

No. @kklobe, do you know a header file for x87 functions?

kklobe · 2024-03-29T13:13:56Z

Do you know of any header files for these functions?

No. @kklobe, do you know a header file for x87 functions?

I'm not aware of any. I think a header file for these functions would be a tall order, especially on non-x86 platforms to perform the 80-bit extended precision calculations.

Torinde · 2024-03-31T13:53:45Z

Isn't that taken care of by SoftFloat (and the projects using it - see links at the first bullet in OP)?

The latest release of SoftFloat implements five floating-point formats: 16-bit half-precision, 32-bit single-precision, 64-bit double-precision, 80-bit double-extended-precision, and 128-bit quadruple-precision.
All required rounding modes, exception flags, and special values are supported.
Fused multiply-add is also implemented for all formats except 80-bit double-extended-precision.
Target-specific code is provided for various Intel x86 and ARM processors.

kklobe · 2024-03-31T14:02:59Z

Isn't that taken care of by SoftFloat (and the projects using it - see links at the first bullet in OP)?

The latest release of SoftFloat implements five floating-point formats: 16-bit half-precision, 32-bit single-precision, 64-bit double-precision, 80-bit double-extended-precision, and 128-bit quadruple-precision.
All required rounding modes, exception flags, and special values are supported.
Fused multiply-add is also implemented for all formats except 80-bit double-extended-precision.
Target-specific code is provided for various Intel x86 and ARM processors.

That strikes me as quite outside the scope of this project. The x87 instructions aren't really SIMD, and would require adding something like SoftFloat as a dependency, so now you no longer have a header-only solution to translate from SIMD instruction set to SIMD instruction set.

If I'm misunderstanding your suggestion, let me know.

Torinde · 2024-03-31T16:21:12Z

I thought parts of SoftFloat can be useful for the creation of a header file.

mr-c added the instruction-set-support Implementing new SIMD ISA extensions portably label Mar 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

x87 functions #1161

x87 functions #1161

Torinde commented Mar 26, 2024 •

edited

mr-c commented Mar 26, 2024

Torinde commented Mar 26, 2024

kklobe commented Mar 29, 2024

Torinde commented Mar 31, 2024

kklobe commented Mar 31, 2024

Torinde commented Mar 31, 2024

x87 functions #1161

x87 functions #1161

Comments

Torinde commented Mar 26, 2024 • edited

mr-c commented Mar 26, 2024

Torinde commented Mar 26, 2024

kklobe commented Mar 29, 2024

Torinde commented Mar 31, 2024

kklobe commented Mar 31, 2024

Torinde commented Mar 31, 2024

Torinde commented Mar 26, 2024 •

edited