arm/nwfpe/netwinder-fpe.rst

*e790a4ceSJonathan Corbet=============
*e790a4ceSJonathan CorbetCurrent State
*e790a4ceSJonathan Corbet=============
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetThe following describes the current state of the NetWinder's floating point
*e790a4ceSJonathan Corbetemulator.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetIn the following nomenclature is used to describe the floating point
*e790a4ceSJonathan Corbetinstructions.  It follows the conventions in the ARM manual.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan Corbet::
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan Corbet  <S|D|E> = <single|double|extended>, no default
*e790a4ceSJonathan Corbet  {P|M|Z} = {round to +infinity,round to -infinity,round to zero},
*e790a4ceSJonathan Corbet            default = round to nearest
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetNote: items enclosed in {} are optional.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetFloating Point Coprocessor Data Transfer Instructions (CPDT)
*e790a4ceSJonathan Corbet------------------------------------------------------------
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetLDF/STF - load and store floating
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan Corbet<LDF|STF>{cond}<S|D|E> Fd, Rn
*e790a4ceSJonathan Corbet<LDF|STF>{cond}<S|D|E> Fd, [Rn, #<expression>]{!}
*e790a4ceSJonathan Corbet<LDF|STF>{cond}<S|D|E> Fd, [Rn], #<expression>
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetThese instructions are fully implemented.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetLFM/SFM - load and store multiple floating
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetForm 1 syntax:
*e790a4ceSJonathan Corbet<LFM|SFM>{cond}<S|D|E> Fd, <count>, [Rn]
*e790a4ceSJonathan Corbet<LFM|SFM>{cond}<S|D|E> Fd, <count>, [Rn, #<expression>]{!}
*e790a4ceSJonathan Corbet<LFM|SFM>{cond}<S|D|E> Fd, <count>, [Rn], #<expression>
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetForm 2 syntax:
*e790a4ceSJonathan Corbet<LFM|SFM>{cond}<FD,EA> Fd, <count>, [Rn]{!}
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetThese instructions are fully implemented.  They store/load three words
*e790a4ceSJonathan Corbetfor each floating point register into the memory location given in the
*e790a4ceSJonathan Corbetinstruction.  The format in memory is unlikely to be compatible with
*e790a4ceSJonathan Corbetother implementations, in particular the actual hardware.  Specific
*e790a4ceSJonathan Corbetmention of this is made in the ARM manuals.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetFloating Point Coprocessor Register Transfer Instructions (CPRT)
*e790a4ceSJonathan Corbet----------------------------------------------------------------
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetConversions, read/write status/control register instructions
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetFLT{cond}<S,D,E>{P,M,Z} Fn, Rd          Convert integer to floating point
*e790a4ceSJonathan CorbetFIX{cond}{P,M,Z} Rd, Fn                 Convert floating point to integer
*e790a4ceSJonathan CorbetWFS{cond} Rd                            Write floating point status register
*e790a4ceSJonathan CorbetRFS{cond} Rd                            Read floating point status register
*e790a4ceSJonathan CorbetWFC{cond} Rd                            Write floating point control register
*e790a4ceSJonathan CorbetRFC{cond} Rd                            Read floating point control register
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetFLT/FIX are fully implemented.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetRFS/WFS are fully implemented.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetRFC/WFC are fully implemented.  RFC/WFC are supervisor only instructions, and
*e790a4ceSJonathan Corbetpresently check the CPU mode, and do an invalid instruction trap if not called
*e790a4ceSJonathan Corbetfrom supervisor mode.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetCompare instructions
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetCMF{cond} Fn, Fm        Compare floating
*e790a4ceSJonathan CorbetCMFE{cond} Fn, Fm       Compare floating with exception
*e790a4ceSJonathan CorbetCNF{cond} Fn, Fm        Compare negated floating
*e790a4ceSJonathan CorbetCNFE{cond} Fn, Fm       Compare negated floating with exception
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetThese are fully implemented.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetFloating Point Coprocessor Data Instructions (CPDT)
*e790a4ceSJonathan Corbet---------------------------------------------------
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetDyadic operations:
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetADF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - add
*e790a4ceSJonathan CorbetSUF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - subtract
*e790a4ceSJonathan CorbetRSF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - reverse subtract
*e790a4ceSJonathan CorbetMUF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - multiply
*e790a4ceSJonathan CorbetDVF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - divide
*e790a4ceSJonathan CorbetRDV{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - reverse divide
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetThese are fully implemented.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetFML{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - fast multiply
*e790a4ceSJonathan CorbetFDV{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - fast divide
*e790a4ceSJonathan CorbetFRD{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - fast reverse divide
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetThese are fully implemented as well.  They use the same algorithm as the
*e790a4ceSJonathan Corbetnon-fast versions.  Hence, in this implementation their performance is
*e790a4ceSJonathan Corbetequivalent to the MUF/DVF/RDV instructions.  This is acceptable according
*e790a4ceSJonathan Corbetto the ARM manual.  The manual notes these are defined only for single
*e790a4ceSJonathan Corbetoperands, on the actual FPA11 hardware they do not work for double or
*e790a4ceSJonathan Corbetextended precision operands.  The emulator currently does not check
*e790a4ceSJonathan Corbetthe requested permissions conditions, and performs the requested operation.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetRMF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - IEEE remainder
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetThis is fully implemented.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetMonadic operations:
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetMVF{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - move
*e790a4ceSJonathan CorbetMNF{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - move negated
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetThese are fully implemented.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetABS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - absolute value
*e790a4ceSJonathan CorbetSQT{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - square root
*e790a4ceSJonathan CorbetRND{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - round
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetThese are fully implemented.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetURD{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - unnormalized round
*e790a4ceSJonathan CorbetNRM{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - normalize
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetThese are implemented.  URD is implemented using the same code as the RND
*e790a4ceSJonathan Corbetinstruction.  Since URD cannot return a unnormalized number, NRM becomes
*e790a4ceSJonathan Corbeta NOP.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetLibrary calls:
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetPOW{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - power
*e790a4ceSJonathan CorbetRPW{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - reverse power
*e790a4ceSJonathan CorbetPOL{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - polar angle (arctan2)
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetLOG{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - logarithm to base 10
*e790a4ceSJonathan CorbetLGN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - logarithm to base e
*e790a4ceSJonathan CorbetEXP{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - exponent
*e790a4ceSJonathan CorbetSIN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - sine
*e790a4ceSJonathan CorbetCOS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - cosine
*e790a4ceSJonathan CorbetTAN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - tangent
*e790a4ceSJonathan CorbetASN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arcsine
*e790a4ceSJonathan CorbetACS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arccosine
*e790a4ceSJonathan CorbetATN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arctangent
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetThese are not implemented.  They are not currently issued by the compiler,
*e790a4ceSJonathan Corbetand are handled by routines in libc.  These are not implemented by the FPA11
*e790a4ceSJonathan Corbethardware, but are handled by the floating point support code.  They should
*e790a4ceSJonathan Corbetbe implemented in future versions.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetSignalling:
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetSignals are implemented.  However current ELF kernels produced by Rebel.com
*e790a4ceSJonathan Corbethave a bug in them that prevents the module from generating a SIGFPE.  This
*e790a4ceSJonathan Corbetis caused by a failure to alias fp_current to the kernel variable
*e790a4ceSJonathan Corbetcurrent_set[0] correctly.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetThe kernel provided with this distribution (vmlinux-nwfpe-0.93) contains
*e790a4ceSJonathan Corbeta fix for this problem and also incorporates the current version of the
*e790a4ceSJonathan Corbetemulator directly.  It is possible to run with no floating point module
*e790a4ceSJonathan Corbetloaded with this kernel.  It is provided as a demonstration of the
*e790a4ceSJonathan Corbettechnology and for those who want to do floating point work that depends
*e790a4ceSJonathan Corbeton signals.  It is not strictly necessary to use the module.
*e790a4ceSJonathan Corbet
*e790a4ceSJonathan CorbetA module (either the one provided by Russell King, or the one in this
*e790a4ceSJonathan Corbetdistribution) can be loaded to replace the functionality of the emulator
*e790a4ceSJonathan Corbetbuilt into the kernel.