m68k/fpsp/srem_mod.sa

1.3Scgd*	$NetBSD: srem_mod.sa,v 1.3 1994/10/26 07:49:58 cgd Exp $
1.3Scgd
1.1Smycroft*	MOTOROLA MICROPROCESSOR & MEMORY TECHNOLOGY GROUP
1.1Smycroft*	M68000 Hi-Performance Microprocessor Division
1.1Smycroft*	M68040 Software Package
1.1Smycroft*
1.1Smycroft*	M68040 Software Package Copyright (c) 1993, 1994 Motorola Inc.
1.1Smycroft*	All rights reserved.
1.1Smycroft*
1.1Smycroft*	THE SOFTWARE is provided on an "AS IS" basis and without warranty.
1.1Smycroft*	To the maximum extent permitted by applicable law,
1.1Smycroft*	MOTOROLA DISCLAIMS ALL WARRANTIES WHETHER EXPRESS OR IMPLIED,
1.1Smycroft*	INCLUDING IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A
1.1Smycroft*	PARTICULAR PURPOSE and any warranty against infringement with
1.1Smycroft*	regard to the SOFTWARE (INCLUDING ANY MODIFIED VERSIONS THEREOF)
1.1Smycroft*	and any accompanying written materials.
1.1Smycroft*
1.1Smycroft*	To the maximum extent permitted by applicable law,
1.1Smycroft*	IN NO EVENT SHALL MOTOROLA BE LIABLE FOR ANY DAMAGES WHATSOEVER
1.1Smycroft*	(INCLUDING WITHOUT LIMITATION, DAMAGES FOR LOSS OF BUSINESS
1.1Smycroft*	PROFITS, BUSINESS INTERRUPTION, LOSS OF BUSINESS INFORMATION, OR
1.1Smycroft*	OTHER PECUNIARY LOSS) ARISING OF THE USE OR INABILITY TO USE THE
1.1Smycroft*	SOFTWARE.  Motorola assumes no responsibility for the maintenance
1.1Smycroft*	and support of the SOFTWARE.
1.1Smycroft*
1.1Smycroft*	You are hereby granted a copyright license to use, modify, and
1.1Smycroft*	distribute the SOFTWARE so long as this entire notice is retained
1.1Smycroft*	without alteration in any modified and/or redistributed versions,
1.1Smycroft*	and that such modified versions are clearly identified as such.
1.1Smycroft*	No licenses are granted by implication, estoppel or otherwise
1.1Smycroft*	under any patents or trademarks of Motorola, Inc.
1.1Smycroft
1.1Smycroft*
1.1Smycroft*	srem_mod.sa 3.1 12/10/90
1.1Smycroft*
1.1Smycroft*      The entry point sMOD computes the floating point MOD of the
1.1Smycroft*      input values X and Y. The entry point sREM computes the floating
1.1Smycroft*      point (IEEE) REM of the input values X and Y.
1.1Smycroft*
1.1Smycroft*      INPUT
1.1Smycroft*      -----
1.1Smycroft*      Double-extended value Y is pointed to by address in register
1.1Smycroft*      A0. Double-extended value X is located in -12(A0). The values
1.1Smycroft*      of X and Y are both nonzero and finite; although either or both
1.1Smycroft*      of them can be denormalized. The special cases of zeros, NaNs,
1.1Smycroft*      and infinities are handled elsewhere.
1.1Smycroft*
1.1Smycroft*      OUTPUT
1.1Smycroft*      ------
1.1Smycroft*      FREM(X,Y) or FMOD(X,Y), depending on entry point.
1.1Smycroft*
1.1Smycroft*       ALGORITHM
1.1Smycroft*       ---------
1.1Smycroft*
1.1Smycroft*       Step 1.  Save and strip signs of X and Y: signX := sign(X),
1.1Smycroft*                signY := sign(Y), X := |X|, Y := |Y|,
1.1Smycroft*                signQ := signX EOR signY. Record whether MOD or REM
1.1Smycroft*                is requested.
1.1Smycroft*
1.1Smycroft*       Step 2.  Set L := expo(X)-expo(Y), k := 0, Q := 0.
1.1Smycroft*                If (L < 0) then
1.1Smycroft*                   R := X, go to Step 4.
1.1Smycroft*                else
1.1Smycroft*                   R := 2^(-L)X, j := L.
1.1Smycroft*                endif
1.1Smycroft*
1.1Smycroft*       Step 3.  Perform MOD(X,Y)
1.1Smycroft*            3.1 If R = Y, go to Step 9.
1.1Smycroft*            3.2 If R > Y, then { R := R - Y, Q := Q + 1}
1.1Smycroft*            3.3 If j = 0, go to Step 4.
1.1Smycroft*            3.4 k := k + 1, j := j - 1, Q := 2Q, R := 2R. Go to
1.1Smycroft*                Step 3.1.
1.1Smycroft*
1.1Smycroft*       Step 4.  At this point, R = X - QY = MOD(X,Y). Set
1.1Smycroft*                Last_Subtract := false (used in Step 7 below). If
1.1Smycroft*                MOD is requested, go to Step 6.
1.1Smycroft*
1.1Smycroft*       Step 5.  R = MOD(X,Y), but REM(X,Y) is requested.
1.1Smycroft*            5.1 If R < Y/2, then R = MOD(X,Y) = REM(X,Y). Go to
1.1Smycroft*                Step 6.
1.1Smycroft*            5.2 If R > Y/2, then { set Last_Subtract := true,
1.1Smycroft*                Q := Q + 1, Y := signY*Y }. Go to Step 6.
1.1Smycroft*            5.3 This is the tricky case of R = Y/2. If Q is odd,
1.1Smycroft*                then { Q := Q + 1, signX := -signX }.
1.1Smycroft*
1.1Smycroft*       Step 6.  R := signX*R.
1.1Smycroft*
1.1Smycroft*       Step 7.  If Last_Subtract = true, R := R - Y.
1.1Smycroft*
1.1Smycroft*       Step 8.  Return signQ, last 7 bits of Q, and R as required.
1.1Smycroft*
1.1Smycroft*       Step 9.  At this point, R = 2^(-j)*X - Q Y = Y. Thus,
1.1Smycroft*                X = 2^(j)*(Q+1)Y. set Q := 2^(j)*(Q+1),
1.1Smycroft*                R := 0. Return signQ, last 7 bits of Q, and R.
1.1Smycroft*
1.1Smycroft
1.1SmycroftSREM_MOD    IDNT    2,1 Motorola 040 Floating Point Software Package
1.1Smycroft
1.1Smycroft	section    8
1.1Smycroft
1.1Smycroft	include	fpsp.h
1.1Smycroft
1.1SmycroftMod_Flag  equ	L_SCR3
1.1SmycroftSignY     equ	FP_SCR3+4
1.1SmycroftSignX     equ	FP_SCR3+8
1.1SmycroftSignQ     equ	FP_SCR3+12
1.1SmycroftSc_Flag   equ	FP_SCR4
1.1Smycroft
1.1SmycroftY         equ	FP_SCR1
1.1SmycroftY_Hi      equ	Y+4
1.1SmycroftY_Lo      equ	Y+8
1.1Smycroft
1.1SmycroftR         equ	FP_SCR2
1.1SmycroftR_Hi      equ	R+4
1.1SmycroftR_Lo      equ	R+8
1.1Smycroft
1.1Smycroft
1.1SmycroftScale     DC.L	$00010000,$80000000,$00000000,$00000000
1.1Smycroft
1.1Smycroft	xref	t_avoid_unsupp
1.1Smycroft
1.1Smycroft        xdef        smod
1.1Smycroftsmod:
1.1Smycroft
1.2Smycroft   Clr.L                Mod_Flag(a6)
1.1Smycroft   BRA.B                Mod_Rem
1.1Smycroft
1.1Smycroft        xdef        srem
1.1Smycroftsrem:
1.1Smycroft
1.1Smycroft   Move.L               #1,Mod_Flag(a6)
1.1Smycroft
1.1SmycroftMod_Rem:
1.1Smycroft*..Save sign of X and Y
1.1Smycroft   MoveM.L              D2-D7,-(A7)     ...save data registers
1.1Smycroft   Move.W               (A0),D3
1.1Smycroft   Move.W               D3,SignY(a6)
1.1Smycroft   AndI.L               #$00007FFF,D3   ...Y := |Y|
1.1Smycroft
1.1Smycroft*
1.1Smycroft   Move.L               4(A0),D4
1.1Smycroft   Move.L               8(A0),D5        ...(D3,D4,D5) is |Y|
1.1Smycroft
1.1Smycroft   Tst.L                D3
1.1Smycroft   BNE.B                Y_Normal
1.1Smycroft
1.1Smycroft   Move.L               #$00003FFE,D3	...$3FFD + 1
1.1Smycroft   Tst.L                D4
1.1Smycroft   BNE.B                HiY_not0
1.1Smycroft
1.1SmycroftHiY_0:
1.1Smycroft   Move.L               D5,D4
1.1Smycroft   CLR.L                D5
1.1Smycroft   SubI.L               #32,D3
1.1Smycroft   CLR.L                D6
1.1Smycroft   BFFFO                D4{0:32},D6
1.1Smycroft   LSL.L                D6,D4
1.1Smycroft   Sub.L                D6,D3           ...(D3,D4,D5) is normalized
1.1Smycroft*                                       ...with bias $7FFD
1.1Smycroft   BRA.B                Chk_X
1.1Smycroft
1.1SmycroftHiY_not0:
1.1Smycroft   CLR.L                D6
1.1Smycroft   BFFFO                D4{0:32},D6
1.1Smycroft   Sub.L                D6,D3
1.1Smycroft   LSL.L                D6,D4
1.1Smycroft   Move.L               D5,D7           ...a copy of D5
1.1Smycroft   LSL.L                D6,D5
1.1Smycroft   Neg.L                D6
1.1Smycroft   AddI.L               #32,D6
1.1Smycroft   LSR.L                D6,D7
1.1Smycroft   Or.L                 D7,D4           ...(D3,D4,D5) normalized
1.1Smycroft*                                       ...with bias $7FFD
1.1Smycroft   BRA.B                Chk_X
1.1Smycroft
1.1SmycroftY_Normal:
1.1Smycroft   AddI.L               #$00003FFE,D3   ...(D3,D4,D5) normalized
1.1Smycroft*                                       ...with bias $7FFD
1.1Smycroft
1.1SmycroftChk_X:
1.1Smycroft   Move.W               -12(A0),D0
1.1Smycroft   Move.W               D0,SignX(a6)
1.1Smycroft   Move.W               SignY(a6),D1
1.1Smycroft   EOr.L                D0,D1
1.1Smycroft   AndI.L               #$00008000,D1
1.1Smycroft   Move.W               D1,SignQ(a6)	...sign(Q) obtained
1.1Smycroft   AndI.L               #$00007FFF,D0
1.1Smycroft   Move.L               -8(A0),D1
1.1Smycroft   Move.L               -4(A0),D2       ...(D0,D1,D2) is |X|
1.1Smycroft   Tst.L                D0
1.1Smycroft   BNE.B                X_Normal
1.1Smycroft   Move.L               #$00003FFE,D0
1.1Smycroft   Tst.L                D1
1.1Smycroft   BNE.B                HiX_not0
1.1Smycroft
1.1SmycroftHiX_0:
1.1Smycroft   Move.L               D2,D1
1.1Smycroft   CLR.L                D2
1.1Smycroft   SubI.L               #32,D0
1.1Smycroft   CLR.L                D6
1.1Smycroft   BFFFO                D1{0:32},D6
1.1Smycroft   LSL.L                D6,D1
1.1Smycroft   Sub.L                D6,D0           ...(D0,D1,D2) is normalized
1.1Smycroft*                                       ...with bias $7FFD
1.1Smycroft   BRA.B                Init
1.1Smycroft
1.1SmycroftHiX_not0:
1.1Smycroft   CLR.L                D6
1.1Smycroft   BFFFO                D1{0:32},D6
1.1Smycroft   Sub.L                D6,D0
1.1Smycroft   LSL.L                D6,D1
1.1Smycroft   Move.L               D2,D7           ...a copy of D2
1.1Smycroft   LSL.L                D6,D2
1.1Smycroft   Neg.L                D6
1.1Smycroft   AddI.L               #32,D6
1.1Smycroft   LSR.L                D6,D7
1.1Smycroft   Or.L                 D7,D1           ...(D0,D1,D2) normalized
1.1Smycroft*                                       ...with bias $7FFD
1.1Smycroft   BRA.B                Init
1.1Smycroft
1.1SmycroftX_Normal:
1.1Smycroft   AddI.L               #$00003FFE,D0   ...(D0,D1,D2) normalized
1.1Smycroft*                                       ...with bias $7FFD
1.1Smycroft
1.1SmycroftInit:
1.1Smycroft*
1.1Smycroft   Move.L               D3,L_SCR1(a6)   ...save biased expo(Y)
1.1Smycroft   move.l		d0,L_SCR2(a6)	;save d0
1.1Smycroft   Sub.L                D3,D0           ...L := expo(X)-expo(Y)
1.1Smycroft*   Move.L               D0,L            ...D0 is j
1.1Smycroft   CLR.L                D6              ...D6 := carry <- 0
1.1Smycroft   CLR.L                D3              ...D3 is Q
1.1Smycroft   MoveA.L              #0,A1           ...A1 is k; j+k=L, Q=0
1.1Smycroft
1.1Smycroft*..(Carry,D1,D2) is R
1.1Smycroft   Tst.L                D0
1.1Smycroft   BGE.B                Mod_Loop
1.1Smycroft
1.1Smycroft*..expo(X) < expo(Y). Thus X = mod(X,Y)
1.1Smycroft*
1.1Smycroft   move.l		L_SCR2(a6),d0	;restore d0
1.1Smycroft   BRA.W                Get_Mod
1.1Smycroft
1.1Smycroft*..At this point  R = 2^(-L)X; Q = 0; k = 0; and  k+j = L
1.1Smycroft
1.1Smycroft
1.1SmycroftMod_Loop:
1.1Smycroft   Tst.L                D6              ...test carry bit
1.1Smycroft   BGT.B                R_GT_Y
1.1Smycroft
1.1Smycroft*..At this point carry = 0, R = (D1,D2), Y = (D4,D5)
1.1Smycroft   Cmp.L                D4,D1           ...compare hi(R) and hi(Y)
1.1Smycroft   BNE.B                R_NE_Y
1.1Smycroft   Cmp.L                D5,D2           ...compare lo(R) and lo(Y)
1.1Smycroft   BNE.B                R_NE_Y
1.1Smycroft
1.1Smycroft*..At this point, R = Y
1.1Smycroft   BRA.W                Rem_is_0
1.1Smycroft
1.1SmycroftR_NE_Y:
1.1Smycroft*..use the borrow of the previous compare
1.1Smycroft   BCS.B                R_LT_Y          ...borrow is set iff R < Y
1.1Smycroft
1.1SmycroftR_GT_Y:
1.1Smycroft*..If Carry is set, then Y < (Carry,D1,D2) < 2Y. Otherwise, Carry = 0
1.1Smycroft*..and Y < (D1,D2) < 2Y. Either way, perform R - Y
1.1Smycroft   Sub.L                D5,D2           ...lo(R) - lo(Y)
1.1Smycroft   SubX.L               D4,D1           ...hi(R) - hi(Y)
1.1Smycroft   CLR.L                D6              ...clear carry
1.1Smycroft   AddQ.L               #1,D3           ...Q := Q + 1
1.1Smycroft
1.1SmycroftR_LT_Y:
1.1Smycroft*..At this point, Carry=0, R < Y. R = 2^(k-L)X - QY; k+j = L; j >= 0.
1.1Smycroft   Tst.L                D0              ...see if j = 0.
1.1Smycroft   BEQ.B                PostLoop
1.1Smycroft
1.1Smycroft   Add.L                D3,D3           ...Q := 2Q
1.1Smycroft   Add.L                D2,D2           ...lo(R) = 2lo(R)
1.2Smycroft   AddX.L               D1,D1           ...hi(R) = 2hi(R) + carry
1.1Smycroft   SCS                  D6              ...set Carry if 2(R) overflows
1.1Smycroft   AddQ.L               #1,A1           ...k := k+1
1.1Smycroft   SubQ.L               #1,D0           ...j := j - 1
1.1Smycroft*..At this point, R=(Carry,D1,D2) = 2^(k-L)X - QY, j+k=L, j >= 0, R < 2Y.
1.1Smycroft
1.1Smycroft   BRA.B                Mod_Loop
1.1Smycroft
1.1SmycroftPostLoop:
1.1Smycroft*..k = L, j = 0, Carry = 0, R = (D1,D2) = X - QY, R < Y.
1.1Smycroft
1.1Smycroft*..normalize R.
1.1Smycroft   Move.L               L_SCR1(a6),D0           ...new biased expo of R
1.1Smycroft   Tst.L                D1
1.1Smycroft   BNE.B                HiR_not0
1.1Smycroft
1.1SmycroftHiR_0:
1.1Smycroft   Move.L               D2,D1
1.1Smycroft   CLR.L                D2
1.1Smycroft   SubI.L               #32,D0
1.1Smycroft   CLR.L                D6
1.1Smycroft   BFFFO                D1{0:32},D6
1.1Smycroft   LSL.L                D6,D1
1.1Smycroft   Sub.L                D6,D0           ...(D0,D1,D2) is normalized
1.1Smycroft*                                       ...with bias $7FFD
1.1Smycroft   BRA.B                Get_Mod
1.1Smycroft
1.1SmycroftHiR_not0:
1.1Smycroft   CLR.L                D6
1.1Smycroft   BFFFO                D1{0:32},D6
1.1Smycroft   BMI.B                Get_Mod         ...already normalized
1.1Smycroft   Sub.L                D6,D0
1.1Smycroft   LSL.L                D6,D1
1.1Smycroft   Move.L               D2,D7           ...a copy of D2
1.1Smycroft   LSL.L                D6,D2
1.1Smycroft   Neg.L                D6
1.1Smycroft   AddI.L               #32,D6
1.1Smycroft   LSR.L                D6,D7
1.1Smycroft   Or.L                 D7,D1           ...(D0,D1,D2) normalized
1.1Smycroft
1.1Smycroft*
1.1SmycroftGet_Mod:
1.1Smycroft   CmpI.L		#$000041FE,D0
1.1Smycroft   BGE.B		No_Scale
1.1SmycroftDo_Scale:
1.1Smycroft   Move.W		D0,R(a6)
1.1Smycroft   clr.w		R+2(a6)
1.1Smycroft   Move.L		D1,R_Hi(a6)
1.1Smycroft   Move.L		D2,R_Lo(a6)
1.1Smycroft   Move.L		L_SCR1(a6),D6
1.1Smycroft   Move.W		D6,Y(a6)
1.1Smycroft   clr.w		Y+2(a6)
1.1Smycroft   Move.L		D4,Y_Hi(a6)
1.1Smycroft   Move.L		D5,Y_Lo(a6)
1.1Smycroft   FMove.X		R(a6),fp0		...no exception
1.1Smycroft   Move.L		#1,Sc_Flag(a6)
1.1Smycroft   BRA.B		ModOrRem
1.1SmycroftNo_Scale:
1.1Smycroft   Move.L		D1,R_Hi(a6)
1.1Smycroft   Move.L		D2,R_Lo(a6)
1.1Smycroft   SubI.L		#$3FFE,D0
1.1Smycroft   Move.W		D0,R(a6)
1.1Smycroft   clr.w		R+2(a6)
1.1Smycroft   Move.L		L_SCR1(a6),D6
1.1Smycroft   SubI.L		#$3FFE,D6
1.1Smycroft   Move.L		D6,L_SCR1(a6)
1.1Smycroft   FMove.X		R(a6),fp0
1.1Smycroft   Move.W		D6,Y(a6)
1.1Smycroft   Move.L		D4,Y_Hi(a6)
1.1Smycroft   Move.L		D5,Y_Lo(a6)
1.2Smycroft   Clr.L		Sc_Flag(a6)
1.1Smycroft
1.1Smycroft*
1.1Smycroft
1.1Smycroft
1.1SmycroftModOrRem:
1.1Smycroft   Move.L               Mod_Flag(a6),D6
1.1Smycroft   BEQ.B                Fix_Sign
1.1Smycroft
1.1Smycroft   Move.L               L_SCR1(a6),D6           ...new biased expo(Y)
1.1Smycroft   SubQ.L               #1,D6           ...biased expo(Y/2)
1.1Smycroft   Cmp.L                D6,D0
1.1Smycroft   BLT.B                Fix_Sign
1.1Smycroft   BGT.B                Last_Sub
1.1Smycroft
1.1Smycroft   Cmp.L                D4,D1
1.1Smycroft   BNE.B                Not_EQ
1.1Smycroft   Cmp.L                D5,D2
1.1Smycroft   BNE.B                Not_EQ
1.1Smycroft   BRA.W                Tie_Case
1.1Smycroft
1.1SmycroftNot_EQ:
1.1Smycroft   BCS.B                Fix_Sign
1.1Smycroft
1.1SmycroftLast_Sub:
1.1Smycroft*
1.1Smycroft   FSub.X		Y(a6),fp0		...no exceptions
1.1Smycroft   AddQ.L               #1,D3           ...Q := Q + 1
1.1Smycroft
1.1Smycroft*
1.1Smycroft
1.1SmycroftFix_Sign:
1.1Smycroft*..Get sign of X
1.1Smycroft   Move.W               SignX(a6),D6
1.1Smycroft   BGE.B		Get_Q
1.1Smycroft   FNeg.X		fp0
1.1Smycroft
1.1Smycroft*..Get Q
1.1Smycroft*
1.1SmycroftGet_Q:
1.1Smycroft   clr.l		d6
1.1Smycroft   Move.W               SignQ(a6),D6        ...D6 is sign(Q)
1.1Smycroft   Move.L               #8,D7
1.1Smycroft   LSR.L                D7,D6
1.1Smycroft   AndI.L               #$0000007F,D3   ...7 bits of Q
1.1Smycroft   Or.L                 D6,D3           ...sign and bits of Q
1.1Smycroft   Swap                 D3
1.1Smycroft   FMove.L              fpsr,D6
1.1Smycroft   AndI.L               #$FF00FFFF,D6
1.1Smycroft   Or.L                 D3,D6
1.1Smycroft   FMove.L              D6,fpsr         ...put Q in fpsr
1.1Smycroft
1.1Smycroft*
1.1SmycroftRestore:
1.1Smycroft   MoveM.L              (A7)+,D2-D7
1.1Smycroft   FMove.L              USER_FPCR(a6),fpcr
1.1Smycroft   Move.L               Sc_Flag(a6),D0
1.1Smycroft   BEQ.B                Finish
1.1Smycroft   FMul.X		Scale(pc),fp0	...may cause underflow
1.1Smycroft   bra			t_avoid_unsupp	;check for denorm as a
1.1Smycroft*					;result of the scaling
1.1Smycroft
1.1SmycroftFinish:
1.1Smycroft	fmove.x		fp0,fp0		;capture exceptions & round
1.1Smycroft	rts
1.1Smycroft
1.1SmycroftRem_is_0:
1.1Smycroft*..R = 2^(-j)X - Q Y = Y, thus R = 0 and quotient = 2^j (Q+1)
1.1Smycroft   AddQ.L               #1,D3
1.1Smycroft   CmpI.L               #8,D0           ...D0 is j
1.1Smycroft   BGE.B                Q_Big
1.1Smycroft
1.1Smycroft   LSL.L                D0,D3
1.1Smycroft   BRA.B                Set_R_0
1.1Smycroft
1.1SmycroftQ_Big:
1.1Smycroft   CLR.L                D3
1.1Smycroft
1.1SmycroftSet_R_0:
1.1Smycroft   FMove.S		#:00000000,fp0
1.2Smycroft   Clr.L		Sc_Flag(a6)
1.1Smycroft   BRA.W                Fix_Sign
1.1Smycroft
1.1SmycroftTie_Case:
1.1Smycroft*..Check parity of Q
1.1Smycroft   Move.L               D3,D6
1.1Smycroft   AndI.L               #$00000001,D6
1.1Smycroft   Tst.L                D6
1.1Smycroft   BEq.W                Fix_Sign	...Q is even
1.1Smycroft
1.1Smycroft*..Q is odd, Q := Q + 1, signX := -signX
1.1Smycroft   AddQ.L               #1,D3
1.1Smycroft   Move.W               SignX(a6),D6
1.1Smycroft   EOrI.L               #$00008000,D6
1.1Smycroft   Move.W               D6,SignX(a6)
1.1Smycroft   BRA.W                Fix_Sign
1.1Smycroft
1.1Smycroft   End