SlideShare a Scribd company logo
Natarajan Meghanathan et al. (Eds) : NeCoM, CSITEC - 2016
pp. 19โ€“30, 2016. ยฉ CS & IT-CSCP 2016 DOI : 10.5121/csit.2016.61203
A FLOATING POINT DIVISION UNIT
BASED ON TAYLOR-SERIES EXPANSION
ALGORITHM AND ITERATIVE
LOGARITHMIC MULTIPLIER
Riyansh K. Karani, Akash K. Rana, Dhruv H. Reshamwala and
Kishore Saldanha
Electronics Department,
Dwarkadas J. Sanghvi College of Engineering, Mumbai
riyansh.karani.011235@gmail.com
akash9182akash@gmail.com
dhr.reshamwala@gmail.com
kishoresaldanha@gmail.com
ABSTRACT
Floating point division, even though being an infrequent operation in the traditional sense, is
indis-pensable when it comes to a range of non-traditional applications such as K-Means
Clustering and QR Decomposition just to name a few. In such applications, hardware support
for floating point division would boost the performance of the entire system. In this paper, we
present a novel architecture for a floating point division unit based on the Taylor-series
expansion algorithm. We show that the Iterative Logarithmic Multiplier is very well suited to be
used as a part of this architecture. We propose an implementation of the powering unit that can
calculate an odd power and an even power of a number simultaneously, meanwhile having little
hardware overhead when compared to the Iterative Logarithmic Multiplier.
KEYWORDS
Floating point division, Iterative Logarithmic Multiplier, Taylor-series
1. INTRODUCTION
Approximation methods such as Newton-Raphson and Taylor-series can be used to approximate
functions where direct computation of these functions is either computationally very expensive or
not possible. The Taylor-series expansion is an approximation method that generates a high order
polynomial approximation of a function at some value in its domain. The idea is simple; given
that a function and its first derivatives are continuous at some point in its domain, the function
can be approximated by a polynomial of degree at that point. The higher the order of this
polynomial, the better is the approximation. Following on this idea, the reciprocal of a number
can be approximated as a very simple Taylor-series polynomial, and thus the problem of division
of one number by another is essentially reduced to multiplication of one number and the Taylor-
20 Computer Science & Information Technology (CS & IT)
series polynomial of the other [6]. Calculating the terms of the Taylor-series polynomial that
approximates the reciprocal of a number involves calculating the powers of itself
where calculating each higher power generates a better approximation.
Naturally, there is a need of a multiplier unit to calculate these powers, and the performance of
the floating point division unit then depends almost solely on the performance of the multiplier
unit. There are many popular multiplier architectures in use today [3]. The Iterative Logarithmic
Multiplier is one such multiplier architecture [12]. What makes it an attractive choice is that it is
highly programmable. The accuracy of the product generated by this multiplier can be precisely
controlled, which comes in very handy when one does not need full precision multiplication, like
in the case of digital signal processing.
But perhaps what is even more important is that because of its inherent nature, its implementation
is very hardware efficient when it comes to computation of squares. This is because the
mathematical description of the Iterative Logarithmic Multiplier is quite simplified when
multiplying a number with itself, as when compared to multiplying two different numbers.
Because every even power of a number can be represented as a square of a smaller power
of the number every alternate power of is representable as a square of some other
number. Hence, the Iterative Logarithmic Multiplier is a very suitable candidate when choosing a
multiplier architecture for calculating the terms of the Taylor-series polynomial approximation of
a reciprocal.
In this paper, we present an architecture for a floating point division unit based on the Iterative
Logarithmic Multiplication algorithm, and the Taylor-series expansion algorithm. The proposed
architecture is designed to be hardware efficient, as is the requirement when designing
architectures for high speed computational units. We start by describing the Taylor-series
expansion algorithm in section II, and analyse the approximation errors generated when using this
approach. We then introduce the methodology for calculating the reciprocal of a number using
the Taylor-series approach, and derive a quantitative measure of the generated error. Since the
approach above requires an initial approximation of the reciprocal, in section III, we start by
describing and analysing linear approximation as a possible approach. We then build on this to
present the piecewise linear approximation based approach that we have employed in our
implementation of the floating point division unit. In section IV, we describe the Iterative
Logarithmic Multiplier as proposed by Babicอ , Avramovicอ and Bulicอ [12], followed by the
discussion and implementation of the proposed squaring unit in section V. Finally, we present the
architecture for the proposed powering unit in section VI, and discuss its implementation and
features.
2. TAYLOR-SERIES EXPANSION ALGORITHM
Computer Science & Information Technology (CS & IT) 21
22 Computer Science & Information Technology (CS & IT)
3. INITIAL APPROXIMATION
As stated in the previous section, we need an initial approximation of the inverse of a number, in
order to calculate a more precise approximation using the Taylor-series expansion algorithm, and
according to equation (7), the number of iterations required to obtain an approximation with a
desired precision depends on the initial approximation. So, it is very important to select an
appropriate method for finding an initial approximation. There are different kinds of methods [5]
based on linear approximation, direct lookup tables [7] [8] [11], table lookup followed by
multiplication [4] and polynomial approximations [9]. In our implementation, we choose a
different approach. We employ a piecewise linear approximation for generating the initial
approximation, and we show that one can obtain any desired amount of precision using this
method, without much increase in complexity.
Figure 1: Plot showing the actual and linear Figure 2: Plot showing the values of
approximation values of x-1
, m for x in the range [1,2]
for a = 1 and b = 2
Computer Science & Information Technology (CS & IT) 23
24 Computer Science & Information Technology (CS & IT)
Computer Science & Information Technology (CS & IT) 25
Figure 3: Piecewise linear approximation of x-1
for x in the range [1,2], derived for n = 5
Table I: Piecewise Linear approximation segments
4. ITERATIVE LOGARITHMIC MULTIPLIER
Logarithmic Number System (LNS) based multipliers are a good choice when there is a
possibility of trading accuracy for speed (such as in Digital Signal Processing). The main
advantage of LNS based multipliers is the substitution of multiplication with addition, which is a
much simpler operation in terms of complexity. LNS multipliers can be divided into two
categories [12], one based on methods that use lookup-tables, and the others based on Mitchellโ€™s
algorithm [10]. The major drawback with Mitchellโ€™s algorithm is the error in the product due to
the piecewise linear approximation of the logarithmic curve. The Iterative Logarithmic
Multiplier, as the name suggests, proposes an iterative solution to computer this error term, and
hence generate a better approximation to the product.
26 Computer Science & Information Technology (CS & IT)
The binary representation of a number can be written as
Figure 4: Block diagram of an Figure 5: Architecture of the
Iterative Logarithmic Multiplier proposed squaring unit
Computer Science & Information Technology (CS & IT) 27
5. SQUARING UNIT
28 Computer Science & Information Technology (CS & IT)
6. POWERING UNIT
Figure 6: Flow diagram indicating the operation Figure 7: System implementation
of the powering unit for calculating
up to 12 powers of x
Computer Science & Information Technology (CS & IT) 29
7. CONCLUSION
In this paper, we propose and investigate a new architecture for a floating point division unit. We
show that the Taylor-series expansion algorithm can be used to generate approximations for the
reciprocal of a number up to an arbitrary precision, and analyse the errors for the same. We
propose a new piecewise linear approximation based method to generate the first approximation
required by the Taylor-series expansion algorithm, and present an extensive analysis. We then
present the architecture for a squaring unit derived from the Iterative Logarithmic Multiplier, and
argue that it requires less than 50% hardware, as compared to the Iterative Logarithmic
Multiplier. Finally, we present a cumulative implementation of the powering unit, and discuss
some of the enhancements made in order to further boost its performance. The complete system is
illustrated in Figure 7.
The performance of the system can be improved by pipelining the architecture for the Iterative
Logarithmic Multiplier [12] and the squaring unit, but at the cost of increase in hardware
utilization.
REFERENCES
[1] Tom M. Apostol. Calculus Vol. 1. 2nd ed. 2013, pp. 272โ€“303.
[2] Birne Binegar. โ€œMath 4513: Numerical Analysisโ€. 1998.
URL:https://math.okstate.edu/people/binegar/4513-F98/4513-l01.pdf.
[3] Deepak Bordiya and Lalit Bandil. โ€œComparative Analysis Of Multipliers (serial and parallel with
radix based on booth algorithm)โ€. In: International Journal of Engineering Research and Technology
Vol.2 - Issue 9 (Sept. 2013).
[4] Zhan Guo, Dongdong Chen, Bintian Zhou and Peter Nilsson. โ€œDesign and Implementation of
Reciprocal Unitโ€. In: IEEE (Jan. 2007)
[5] Ankit Khandelwal and Gaurav Agarwal. A Newton Raphson Divider Based on Improved Reciprocal
Approximation Algorithm. Dec. 2006.
[6] Taek-Jun Kwon. โ€œFloating-Point unit design using Taylor-series Expansion algorithmsโ€. PhD thesis.
University of Southern California, 2009.
[7] Albert A. Liddicoat and Michael J. Flynn. โ€œHigh Performance Floating Point Divideโ€. In: Digital
Systems Design, 2001. Proceedings. Euromicro Symposium on. Sept. 2001.
[8] Naofumi Takagi, Masayuki Ito, and Shuzo Yajima. โ€œEfficient Initial Approximation and Fast
Converging Methods for Division and Square Rootโ€. In: ARITH โ€™95 Proceedings of the 12th
Symposium on Computer Arithmetic. 1995.
30 Computer Science & Information Technology (CS & IT)
[9] Jean-Michel Muller, Milos Ercegovac and Arnaud Tisserand. โ€œSimple Seed Architectures for
Reciprocal and Square Root Reciprocalโ€. In: INRIA RR-5720 (2005), p. 25.
[10] J.N. Mitchell. โ€œComputer multiplication and division using binary logarithmsโ€. In: IRE Transactions
on Electronic Computers EC-11 (Aug. 1962), pp. 512โ€“517.
[11] Oskar Mencer, Patrick Hung, Hossam Fahmy and Michael J. Flynn. โ€œFast Division algorithm with a
Small Lookup Tableโ€. In: Circuits, Systems and Computers, 1997. Conference Record. 1997 11th
Asilomar Conference on. 1997.
[12] Patricio Bulicอ, Zdenka Babicอ, Aleksej Avramovicอ.โ€œAn Iterative Logarithmic Multiplierโ€. In:
Elektrotehnisฬ† ki vestnik 77(1) (2010), pp. 25โ€“30.

More Related Content

What's hot (20)

PDF
APPROXIMATING NASH EQUILIBRIUM UNIQUENESS OF POWER CONTROL IN PRACTICAL WSNS
IJCNCJournal
ย 
PDF
Density Based Clustering Approach for Solving the Software Component Restruct...
IRJET Journal
ย 
PDF
Dynamic approach to k means clustering algorithm-2
IAEME Publication
ย 
PDF
Centrality Prediction in Mobile Social Networks
IJERA Editor
ย 
PDF
Datapath
Edward Verenich
ย 
PDF
Spatial correlation based clustering algorithm for random and uniform topolog...
eSAT Publishing House
ย 
PDF
STUDY OF TASK SCHEDULING STRATEGY BASED ON TRUSTWORTHINESS
ijdpsjournal
ย 
PDF
A reduced complexity and an efficient channel
eSAT Publishing House
ย 
DOCX
K means report
Gaurav Handa
ย 
PDF
International Journal of Engineering Research and Development
IJERD Editor
ย 
PPTX
Implement principal component analysis (PCA) in python from scratch
EshanAgarwal4
ย 
PDF
Macromodel of High Speed Interconnect using Vector Fitting Algorithm
ijsrd.com
ย 
PDF
One dimensional vector based pattern
ijcsit
ย 
DOCX
8.clustering algorithm.k means.em algorithm
Laura Petrosanu
ย 
DOC
K-MEDOIDS CLUSTERING USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
ijscmc
ย 
PPTX
Quantum-Min-Cut/Max-Flow-Based Vertex Importance Ranking
Colleen Farrelly
ย 
PDF
Vol 16 No 2 - July-December 2016
ijcsbi
ย 
PDF
Clustering Algorithms for Data Stream
IRJET Journal
ย 
PPTX
Quantum persistent k cores for community detection
Colleen Farrelly
ย 
PDF
Welcome to International Journal of Engineering Research and Development (IJERD)
IJERD Editor
ย 
APPROXIMATING NASH EQUILIBRIUM UNIQUENESS OF POWER CONTROL IN PRACTICAL WSNS
IJCNCJournal
ย 
Density Based Clustering Approach for Solving the Software Component Restruct...
IRJET Journal
ย 
Dynamic approach to k means clustering algorithm-2
IAEME Publication
ย 
Centrality Prediction in Mobile Social Networks
IJERA Editor
ย 
Datapath
Edward Verenich
ย 
Spatial correlation based clustering algorithm for random and uniform topolog...
eSAT Publishing House
ย 
STUDY OF TASK SCHEDULING STRATEGY BASED ON TRUSTWORTHINESS
ijdpsjournal
ย 
A reduced complexity and an efficient channel
eSAT Publishing House
ย 
K means report
Gaurav Handa
ย 
International Journal of Engineering Research and Development
IJERD Editor
ย 
Implement principal component analysis (PCA) in python from scratch
EshanAgarwal4
ย 
Macromodel of High Speed Interconnect using Vector Fitting Algorithm
ijsrd.com
ย 
One dimensional vector based pattern
ijcsit
ย 
8.clustering algorithm.k means.em algorithm
Laura Petrosanu
ย 
K-MEDOIDS CLUSTERING USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
ijscmc
ย 
Quantum-Min-Cut/Max-Flow-Based Vertex Importance Ranking
Colleen Farrelly
ย 
Vol 16 No 2 - July-December 2016
ijcsbi
ย 
Clustering Algorithms for Data Stream
IRJET Journal
ย 
Quantum persistent k cores for community detection
Colleen Farrelly
ย 
Welcome to International Journal of Engineering Research and Development (IJERD)
IJERD Editor
ย 

Viewers also liked (17)

PDF
Robust Visual Tracking Based on Sparse PCA-L1
csandit
ย 
PDF
A LITERATURE REVIEW ON SEMANTIC WEB โ€“ UNDERSTANDING THE PIONEERSโ€™ PERSPECTIVE
csandit
ย 
PDF
Exploring The Dynamic Integration of Heterogeneous Services
csandit
ย 
PDF
FILE SYNCHRONIZATION SYSTEMS SURVEY
csandit
ย 
PDF
Basic Evaluation of Antennas Used in Microwave Imaging for Breast Cancer Dete...
csandit
ย 
PDF
CREATING DATA OUTPUTS FROM MULTI AGENT TRAFFIC MICRO SIMULATION TO ASSIMILATI...
csandit
ย 
PDF
Key Management Scheme for Secure Group Communication in WSN with Multiple Gr...
csandit
ย 
PPTX
Understanding the Impact of the Social Media on Digital Marketing and E-comme...
Rohit Pawar
ย 
PDF
WIRELESS SENSORS INTEGRATION INTO INTERNET OF THINGS AND THE SECURITY PRIMITIVES
csandit
ย 
PDF
Google Apps
yeissonmena
ย 
PDF
EVALUATION AND STUDY OF SOFTWARE DEGRADATION IN THE EVOLUTION OF SIX VERSIONS...
csandit
ย 
PDF
Explore the Effects of Emoticons on Twitter Sentiment Analysis
csandit
ย 
PPTX
Starbucks Case Study
Rohit Pawar
ย 
PPTX
Facebook & Twitter Analytics
Rohit Pawar
ย 
PPTX
Economic study on TATA Steel 2015
Rohit Pawar
ย 
PPTX
What is Sleeping pralysis and how it effect us ?
Harshit Agarwal
ย 
PPTX
Market based instruments as a policy instrument for environmental problems
Glen Speering
ย 
Robust Visual Tracking Based on Sparse PCA-L1
csandit
ย 
A LITERATURE REVIEW ON SEMANTIC WEB โ€“ UNDERSTANDING THE PIONEERSโ€™ PERSPECTIVE
csandit
ย 
Exploring The Dynamic Integration of Heterogeneous Services
csandit
ย 
FILE SYNCHRONIZATION SYSTEMS SURVEY
csandit
ย 
Basic Evaluation of Antennas Used in Microwave Imaging for Breast Cancer Dete...
csandit
ย 
CREATING DATA OUTPUTS FROM MULTI AGENT TRAFFIC MICRO SIMULATION TO ASSIMILATI...
csandit
ย 
Key Management Scheme for Secure Group Communication in WSN with Multiple Gr...
csandit
ย 
Understanding the Impact of the Social Media on Digital Marketing and E-comme...
Rohit Pawar
ย 
WIRELESS SENSORS INTEGRATION INTO INTERNET OF THINGS AND THE SECURITY PRIMITIVES
csandit
ย 
Google Apps
yeissonmena
ย 
EVALUATION AND STUDY OF SOFTWARE DEGRADATION IN THE EVOLUTION OF SIX VERSIONS...
csandit
ย 
Explore the Effects of Emoticons on Twitter Sentiment Analysis
csandit
ย 
Starbucks Case Study
Rohit Pawar
ย 
Facebook & Twitter Analytics
Rohit Pawar
ย 
Economic study on TATA Steel 2015
Rohit Pawar
ย 
What is Sleeping pralysis and how it effect us ?
Harshit Agarwal
ย 
Market based instruments as a policy instrument for environmental problems
Glen Speering
ย 
Ad

Similar to A FLOATING POINT DIVISION UNIT BASED ON TAYLOR-SERIES EXPANSION ALGORITHM AND ITERATIVE LOGARITHMIC MULTIPLIER (20)

PDF
Fpga based efficient multiplier for image processing applications using recur...
VLSICS Design
ย 
PDF
IRJET-Design and Implementation of LNS based Approximate Multiplier using Mit...
IRJET Journal
ย 
PDF
Implementation and Simulation of Ieee 754 Single-Precision Floating Point Mul...
inventionjournals
ย 
PDF
Iy3116761679
IJERA Editor
ย 
PDF
Implementation of Low-Complexity Redundant Multiplier Architecture for Finite...
ijcisjournal
ย 
PDF
Implementation of an Effective Self-Timed Multiplier for Single Precision Flo...
IRJET Journal
ย 
PDF
SINGLE PRECISION FLOATING POINT MULTIPLIER USING SHIFT AND ADD ALGORITHM
AM Publications
ย 
PDF
Ap32283286
IJERA Editor
ย 
PDF
Design and Implementation of Single Precision Pipelined Floating Point Co-Pro...
Silicon Mentor
ย 
PDF
International Journal of Engineering Research and Development
IJERD Editor
ย 
PDF
A Fast Floating Point Double Precision Implementation on Fpga
IJERA Editor
ย 
PDF
IRJET- A Review on Single Precision Floating Point Arithmetic Unit of 32 Bit ...
IRJET Journal
ย 
PDF
Bs25412419
IJERA Editor
ย 
PDF
Lp2520162020
IJERA Editor
ย 
PDF
Lp2520162020
IJERA Editor
ย 
PDF
Bu33436438
IJERA Editor
ย 
PDF
Bu33436438
IJERA Editor
ย 
PDF
IRJET- Single Precision Floating Point Arithmetic using VHDL Coding
IRJET Journal
ย 
PDF
Comparative study of single precision floating point division using differen...
International Journal of Reconfigurable and Embedded Systems
ย 
PDF
IRJET - Design and Implementation of Double Precision FPU for Optimised Speed
IRJET Journal
ย 
Fpga based efficient multiplier for image processing applications using recur...
VLSICS Design
ย 
IRJET-Design and Implementation of LNS based Approximate Multiplier using Mit...
IRJET Journal
ย 
Implementation and Simulation of Ieee 754 Single-Precision Floating Point Mul...
inventionjournals
ย 
Iy3116761679
IJERA Editor
ย 
Implementation of Low-Complexity Redundant Multiplier Architecture for Finite...
ijcisjournal
ย 
Implementation of an Effective Self-Timed Multiplier for Single Precision Flo...
IRJET Journal
ย 
SINGLE PRECISION FLOATING POINT MULTIPLIER USING SHIFT AND ADD ALGORITHM
AM Publications
ย 
Ap32283286
IJERA Editor
ย 
Design and Implementation of Single Precision Pipelined Floating Point Co-Pro...
Silicon Mentor
ย 
International Journal of Engineering Research and Development
IJERD Editor
ย 
A Fast Floating Point Double Precision Implementation on Fpga
IJERA Editor
ย 
IRJET- A Review on Single Precision Floating Point Arithmetic Unit of 32 Bit ...
IRJET Journal
ย 
Bs25412419
IJERA Editor
ย 
Lp2520162020
IJERA Editor
ย 
Lp2520162020
IJERA Editor
ย 
Bu33436438
IJERA Editor
ย 
Bu33436438
IJERA Editor
ย 
IRJET- Single Precision Floating Point Arithmetic using VHDL Coding
IRJET Journal
ย 
Comparative study of single precision floating point division using differen...
International Journal of Reconfigurable and Embedded Systems
ย 
IRJET - Design and Implementation of Double Precision FPU for Optimised Speed
IRJET Journal
ย 
Ad

Recently uploaded (20)

PPTX
Introduction to web development | MERN Stack
JosephLiyon
ย 
PDF
TEASMA: A Practical Methodology for Test Adequacy Assessment of Deep Neural N...
Lionel Briand
ย 
PDF
IObit Uninstaller Pro 14.3.1.8 Crack for Windows Latest
utfefguu
ย 
PPTX
ERP Systems in the UAE: Driving Business Transformation with Smart Solutions
dheeodoo
ย 
PDF
What Is an Internal Quality Audit and Why It Matters for Your QMS
BizPortals365
ย 
PDF
Mastering VPC Architecture Build for Scale from Day 1.pdf
Devseccops.ai
ย 
PPTX
IObit Driver Booster Pro 12.4-12.5 license keys 2025-2026
chaudhryakashoo065
ย 
PDF
Best Software Development at Best Prices
softechies7
ย 
PDF
CodeCleaner: Mitigating Data Contamination for LLM Benchmarking
arabelatso
ย 
PPTX
IObit Uninstaller Pro 14.3.1.8 Crack Free Download 2025
sdfger qwerty
ย 
PDF
OpenChain Webinar - AboutCode - Practical Compliance in One Stack โ€“ Licensing...
Shane Coughlan
ย 
PPTX
IObit Driver Booster Pro 12 Crack Latest Version Download
pcprocore
ย 
PPTX
Iobit Driver Booster Pro 12 Crack Free Download
chaudhryakashoo065
ย 
PDF
capitulando la keynote de GrafanaCON 2025 - Madrid
Imma Valls Bernaus
ย 
PPTX
Foundations of Marketo Engage - Programs, Campaigns & Beyond - June 2025
BradBedford3
ย 
PDF
Alur Perkembangan Software dan Jaringan Komputer
ssuser754303
ย 
PPTX
IDM Crack with Internet Download Manager 6.42 [Latest 2025]
HyperPc soft
ย 
PDF
Humans vs AI Call Agents - Qcall.ai's Special Report
Udit Goenka
ย 
PPTX
IObit Driver Booster Pro Crack Download Latest Version
chaudhryakashoo065
ย 
DOCX
Zoho Creator Solution for EI by Elsner Technologies.docx
Elsner Technologies Pvt. Ltd.
ย 
Introduction to web development | MERN Stack
JosephLiyon
ย 
TEASMA: A Practical Methodology for Test Adequacy Assessment of Deep Neural N...
Lionel Briand
ย 
IObit Uninstaller Pro 14.3.1.8 Crack for Windows Latest
utfefguu
ย 
ERP Systems in the UAE: Driving Business Transformation with Smart Solutions
dheeodoo
ย 
What Is an Internal Quality Audit and Why It Matters for Your QMS
BizPortals365
ย 
Mastering VPC Architecture Build for Scale from Day 1.pdf
Devseccops.ai
ย 
IObit Driver Booster Pro 12.4-12.5 license keys 2025-2026
chaudhryakashoo065
ย 
Best Software Development at Best Prices
softechies7
ย 
CodeCleaner: Mitigating Data Contamination for LLM Benchmarking
arabelatso
ย 
IObit Uninstaller Pro 14.3.1.8 Crack Free Download 2025
sdfger qwerty
ย 
OpenChain Webinar - AboutCode - Practical Compliance in One Stack โ€“ Licensing...
Shane Coughlan
ย 
IObit Driver Booster Pro 12 Crack Latest Version Download
pcprocore
ย 
Iobit Driver Booster Pro 12 Crack Free Download
chaudhryakashoo065
ย 
capitulando la keynote de GrafanaCON 2025 - Madrid
Imma Valls Bernaus
ย 
Foundations of Marketo Engage - Programs, Campaigns & Beyond - June 2025
BradBedford3
ย 
Alur Perkembangan Software dan Jaringan Komputer
ssuser754303
ย 
IDM Crack with Internet Download Manager 6.42 [Latest 2025]
HyperPc soft
ย 
Humans vs AI Call Agents - Qcall.ai's Special Report
Udit Goenka
ย 
IObit Driver Booster Pro Crack Download Latest Version
chaudhryakashoo065
ย 
Zoho Creator Solution for EI by Elsner Technologies.docx
Elsner Technologies Pvt. Ltd.
ย 

A FLOATING POINT DIVISION UNIT BASED ON TAYLOR-SERIES EXPANSION ALGORITHM AND ITERATIVE LOGARITHMIC MULTIPLIER

  • 1. Natarajan Meghanathan et al. (Eds) : NeCoM, CSITEC - 2016 pp. 19โ€“30, 2016. ยฉ CS & IT-CSCP 2016 DOI : 10.5121/csit.2016.61203 A FLOATING POINT DIVISION UNIT BASED ON TAYLOR-SERIES EXPANSION ALGORITHM AND ITERATIVE LOGARITHMIC MULTIPLIER Riyansh K. Karani, Akash K. Rana, Dhruv H. Reshamwala and Kishore Saldanha Electronics Department, Dwarkadas J. Sanghvi College of Engineering, Mumbai [email protected] [email protected] [email protected] [email protected] ABSTRACT Floating point division, even though being an infrequent operation in the traditional sense, is indis-pensable when it comes to a range of non-traditional applications such as K-Means Clustering and QR Decomposition just to name a few. In such applications, hardware support for floating point division would boost the performance of the entire system. In this paper, we present a novel architecture for a floating point division unit based on the Taylor-series expansion algorithm. We show that the Iterative Logarithmic Multiplier is very well suited to be used as a part of this architecture. We propose an implementation of the powering unit that can calculate an odd power and an even power of a number simultaneously, meanwhile having little hardware overhead when compared to the Iterative Logarithmic Multiplier. KEYWORDS Floating point division, Iterative Logarithmic Multiplier, Taylor-series 1. INTRODUCTION Approximation methods such as Newton-Raphson and Taylor-series can be used to approximate functions where direct computation of these functions is either computationally very expensive or not possible. The Taylor-series expansion is an approximation method that generates a high order polynomial approximation of a function at some value in its domain. The idea is simple; given that a function and its first derivatives are continuous at some point in its domain, the function can be approximated by a polynomial of degree at that point. The higher the order of this polynomial, the better is the approximation. Following on this idea, the reciprocal of a number can be approximated as a very simple Taylor-series polynomial, and thus the problem of division of one number by another is essentially reduced to multiplication of one number and the Taylor-
  • 2. 20 Computer Science & Information Technology (CS & IT) series polynomial of the other [6]. Calculating the terms of the Taylor-series polynomial that approximates the reciprocal of a number involves calculating the powers of itself where calculating each higher power generates a better approximation. Naturally, there is a need of a multiplier unit to calculate these powers, and the performance of the floating point division unit then depends almost solely on the performance of the multiplier unit. There are many popular multiplier architectures in use today [3]. The Iterative Logarithmic Multiplier is one such multiplier architecture [12]. What makes it an attractive choice is that it is highly programmable. The accuracy of the product generated by this multiplier can be precisely controlled, which comes in very handy when one does not need full precision multiplication, like in the case of digital signal processing. But perhaps what is even more important is that because of its inherent nature, its implementation is very hardware efficient when it comes to computation of squares. This is because the mathematical description of the Iterative Logarithmic Multiplier is quite simplified when multiplying a number with itself, as when compared to multiplying two different numbers. Because every even power of a number can be represented as a square of a smaller power of the number every alternate power of is representable as a square of some other number. Hence, the Iterative Logarithmic Multiplier is a very suitable candidate when choosing a multiplier architecture for calculating the terms of the Taylor-series polynomial approximation of a reciprocal. In this paper, we present an architecture for a floating point division unit based on the Iterative Logarithmic Multiplication algorithm, and the Taylor-series expansion algorithm. The proposed architecture is designed to be hardware efficient, as is the requirement when designing architectures for high speed computational units. We start by describing the Taylor-series expansion algorithm in section II, and analyse the approximation errors generated when using this approach. We then introduce the methodology for calculating the reciprocal of a number using the Taylor-series approach, and derive a quantitative measure of the generated error. Since the approach above requires an initial approximation of the reciprocal, in section III, we start by describing and analysing linear approximation as a possible approach. We then build on this to present the piecewise linear approximation based approach that we have employed in our implementation of the floating point division unit. In section IV, we describe the Iterative Logarithmic Multiplier as proposed by Babicอ , Avramovicอ and Bulicอ [12], followed by the discussion and implementation of the proposed squaring unit in section V. Finally, we present the architecture for the proposed powering unit in section VI, and discuss its implementation and features. 2. TAYLOR-SERIES EXPANSION ALGORITHM
  • 3. Computer Science & Information Technology (CS & IT) 21
  • 4. 22 Computer Science & Information Technology (CS & IT) 3. INITIAL APPROXIMATION As stated in the previous section, we need an initial approximation of the inverse of a number, in order to calculate a more precise approximation using the Taylor-series expansion algorithm, and according to equation (7), the number of iterations required to obtain an approximation with a desired precision depends on the initial approximation. So, it is very important to select an appropriate method for finding an initial approximation. There are different kinds of methods [5] based on linear approximation, direct lookup tables [7] [8] [11], table lookup followed by multiplication [4] and polynomial approximations [9]. In our implementation, we choose a different approach. We employ a piecewise linear approximation for generating the initial approximation, and we show that one can obtain any desired amount of precision using this method, without much increase in complexity. Figure 1: Plot showing the actual and linear Figure 2: Plot showing the values of approximation values of x-1 , m for x in the range [1,2] for a = 1 and b = 2
  • 5. Computer Science & Information Technology (CS & IT) 23
  • 6. 24 Computer Science & Information Technology (CS & IT)
  • 7. Computer Science & Information Technology (CS & IT) 25 Figure 3: Piecewise linear approximation of x-1 for x in the range [1,2], derived for n = 5 Table I: Piecewise Linear approximation segments 4. ITERATIVE LOGARITHMIC MULTIPLIER Logarithmic Number System (LNS) based multipliers are a good choice when there is a possibility of trading accuracy for speed (such as in Digital Signal Processing). The main advantage of LNS based multipliers is the substitution of multiplication with addition, which is a much simpler operation in terms of complexity. LNS multipliers can be divided into two categories [12], one based on methods that use lookup-tables, and the others based on Mitchellโ€™s algorithm [10]. The major drawback with Mitchellโ€™s algorithm is the error in the product due to the piecewise linear approximation of the logarithmic curve. The Iterative Logarithmic Multiplier, as the name suggests, proposes an iterative solution to computer this error term, and hence generate a better approximation to the product.
  • 8. 26 Computer Science & Information Technology (CS & IT) The binary representation of a number can be written as Figure 4: Block diagram of an Figure 5: Architecture of the Iterative Logarithmic Multiplier proposed squaring unit
  • 9. Computer Science & Information Technology (CS & IT) 27 5. SQUARING UNIT
  • 10. 28 Computer Science & Information Technology (CS & IT) 6. POWERING UNIT Figure 6: Flow diagram indicating the operation Figure 7: System implementation of the powering unit for calculating up to 12 powers of x
  • 11. Computer Science & Information Technology (CS & IT) 29 7. CONCLUSION In this paper, we propose and investigate a new architecture for a floating point division unit. We show that the Taylor-series expansion algorithm can be used to generate approximations for the reciprocal of a number up to an arbitrary precision, and analyse the errors for the same. We propose a new piecewise linear approximation based method to generate the first approximation required by the Taylor-series expansion algorithm, and present an extensive analysis. We then present the architecture for a squaring unit derived from the Iterative Logarithmic Multiplier, and argue that it requires less than 50% hardware, as compared to the Iterative Logarithmic Multiplier. Finally, we present a cumulative implementation of the powering unit, and discuss some of the enhancements made in order to further boost its performance. The complete system is illustrated in Figure 7. The performance of the system can be improved by pipelining the architecture for the Iterative Logarithmic Multiplier [12] and the squaring unit, but at the cost of increase in hardware utilization. REFERENCES [1] Tom M. Apostol. Calculus Vol. 1. 2nd ed. 2013, pp. 272โ€“303. [2] Birne Binegar. โ€œMath 4513: Numerical Analysisโ€. 1998. URL:https://math.okstate.edu/people/binegar/4513-F98/4513-l01.pdf. [3] Deepak Bordiya and Lalit Bandil. โ€œComparative Analysis Of Multipliers (serial and parallel with radix based on booth algorithm)โ€. In: International Journal of Engineering Research and Technology Vol.2 - Issue 9 (Sept. 2013). [4] Zhan Guo, Dongdong Chen, Bintian Zhou and Peter Nilsson. โ€œDesign and Implementation of Reciprocal Unitโ€. In: IEEE (Jan. 2007) [5] Ankit Khandelwal and Gaurav Agarwal. A Newton Raphson Divider Based on Improved Reciprocal Approximation Algorithm. Dec. 2006. [6] Taek-Jun Kwon. โ€œFloating-Point unit design using Taylor-series Expansion algorithmsโ€. PhD thesis. University of Southern California, 2009. [7] Albert A. Liddicoat and Michael J. Flynn. โ€œHigh Performance Floating Point Divideโ€. In: Digital Systems Design, 2001. Proceedings. Euromicro Symposium on. Sept. 2001. [8] Naofumi Takagi, Masayuki Ito, and Shuzo Yajima. โ€œEfficient Initial Approximation and Fast Converging Methods for Division and Square Rootโ€. In: ARITH โ€™95 Proceedings of the 12th Symposium on Computer Arithmetic. 1995.
  • 12. 30 Computer Science & Information Technology (CS & IT) [9] Jean-Michel Muller, Milos Ercegovac and Arnaud Tisserand. โ€œSimple Seed Architectures for Reciprocal and Square Root Reciprocalโ€. In: INRIA RR-5720 (2005), p. 25. [10] J.N. Mitchell. โ€œComputer multiplication and division using binary logarithmsโ€. In: IRE Transactions on Electronic Computers EC-11 (Aug. 1962), pp. 512โ€“517. [11] Oskar Mencer, Patrick Hung, Hossam Fahmy and Michael J. Flynn. โ€œFast Division algorithm with a Small Lookup Tableโ€. In: Circuits, Systems and Computers, 1997. Conference Record. 1997 11th Asilomar Conference on. 1997. [12] Patricio Bulicอ, Zdenka Babicอ, Aleksej Avramovicอ.โ€œAn Iterative Logarithmic Multiplierโ€. In: Elektrotehnisฬ† ki vestnik 77(1) (2010), pp. 25โ€“30.