• Aucun résultat trouvé

Effective arithmetic in finite fields based on Chudnovsky's multiplication algorithm

N/A
N/A
Protected

Academic year: 2021

Partager "Effective arithmetic in finite fields based on Chudnovsky's multiplication algorithm"

Copied!
6
0
0

Texte intégral

(1)

HAL Id: hal-01260806

https://hal.archives-ouvertes.fr/hal-01260806

Submitted on 1 Feb 2016

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Effective arithmetic in finite fields based on Chudnovsky’s multiplication algorithm

Kévin Atighehchi, Stéphane Ballet, Alexis Bonnecaze, Robert Rolland

To cite this version:

Kévin Atighehchi, Stéphane Ballet, Alexis Bonnecaze, Robert Rolland. Effective arithmetic in finite fields based on Chudnovsky’s multiplication algorithm. Comptes rendus de l’Académie des sciences.

Série I, Mathématique, Elsevier, 2016, 354, pp.137-141. �10.1016/j.crma.2015.12.001�. �hal-01260806�

(2)

Contents lists available atScienceDirect

C. R. Acad. Sci. Paris, Ser. I

www.sciencedirect.com

Number theory/Computer science

Effective arithmetic in finite fields based on Chudnovsky’s multiplication algorithm

Arithmétique effective dans les corps finis basée sur l’algorithme de multiplication de Chudnovsky

Kévin Atighehchia,Stéphane Balletb,Alexis Bonnecazeb,Robert Rollandb

aAix-MarseilleUniversité,Laboratoired’informatiquefondamentaledeMarseille,case901,13288Marseillecedex9,France bAix-MarseilleUniversité,InstitutdemathématiquesdeMarseille,case930,13288Marseillecedex9,France

a r t i c l e i n f o a b s t r a c t

Articlehistory:

Received22September2015

Acceptedafterrevision1December2015 Availableonlinexxxx

PresentedbytheEditorialBoard

ThankstoanewconstructionoftheChudnovskyandChudnovskymultiplicationalgorithm, wedesignefficientalgorithmsforboththeexponentiationandthemultiplicationinfinite fields.Theyare tailoredtohardwareimplementationandtheyallowcomputationstobe parallelized,whilemaintainingalownumberofbilinearmultiplications.

©2015Académiedessciences.PublishedbyElsevierMassonSAS.All rights reserved.

r é s um é

À partir d’une nouvelle construction de l’algorithme de multiplication de Chudnovsky et Chudnovsky, nous concevons des algorithmes efficaces pour la multiplication et l’exponentiation dans lescorps finis. Ils sont adaptés àune implémentation matérielle etsontparallélisables,toutengardantunnombredemultiplicationsbilinéairestrèsbas.

©2015Académiedessciences.PublishedbyElsevierMassonSAS.All rights reserved.

1. Introduction

Multiplication infinitefieldsisa fundamentaloperation inarithmetic andfindingefficientmultiplication methodsre- mainsatopicalissue.Letq beaprimepower,Fqthefinitefieldwithq elementsandFqn thedegreenextensionofFq.If B= {e1,. . . ,en}isabasisofFqn overFqthenforx=n

i=1xieiand y=n

i=1yiei,wehavetheproduct z=xy=

n

h=1 zheh=

n

h=1

n i,j=1

ti jhxixj

eh, (1)

whereeiej=n

h=1ti jheh,ti jh∈Fq beingsomeconstants. Thecomplexity ofamultiplicationalgorithm inFqn dependson thenumberofmultiplications andadditionsinFq.Thereexisttwotypesofmultiplications inFq:thescalarmultiplication

E-mailaddresses:[email protected](K. Atighehchi),[email protected](S. Ballet),[email protected](A. Bonnecaze), [email protected](R. Rolland).

http://dx.doi.org/10.1016/j.crma.2015.12.001

1631-073X/©2015Académiedessciences.PublishedbyElsevierMassonSAS.All rights reserved.

(3)

JID:CRASS1 AID:5641 /FLA Doctopic: Number theory [m3G; v1.172; Prn:15/01/2016; 16:07] P.2 (1-5) 2 K. Atighehchi et al. / C. R. Acad. Sci. Paris, Ser. I•••(••••)••••••

andthebilinear multiplication.The scalarmultiplicationisthemultiplication bya constant(in Fq) thatdoesnot depend on the elements of Fqn that are multiplied. The bilinear multiplication is a multiplication of elements that depend on the elementsofFqn that aremultiplied. Thebilinearcomplexity isindependentofthechosen representationofthefinite field. Forexample,thedirectcalculationof z=(z1,. . . ,zn)using(1) requiresn2 non-scalarmultiplications xixj,n3 scalar multiplications,andn3nadditions.

Moreprecisely,themultiplicationoftwoelementsofFqn isanFq-bilinearapplicationfromFqn×Fqn ontoFqn.Then,it can be consideredasan Fq-linearapplicationfromthetensorproduct FqnFqFqn ontoFqn.Consequently,it canalsobe consideredasanelementT ofFqnFqFqnFqFqn,wheredenotesthedual.Set

T= r

i=1

xiyici, (2)

wherether elementsxi aswell astherelements yi areinthedualFqn ofFqn,whiletherelementsci areinFqn.The followingholdsforanyx,y∈Fqn:x·y=r

i=1xi(x)yi(y)ci.Thedecomposition(2)isnotunique.

Definition1.1.AbilinearmultiplicationalgorithmUisanexpression

x·y= r

i=1

xi(x)yi(y)ci.

ThenumberrofsummandsinthisexpressioniscalledthebilinearcomplexityofthealgorithmU andisdenotedby μ(U). Definition1.2.The minimal numberofsummands ina decompositionof thetensor T ofthe multiplicationis calledthe bilinearcomplexityofthemultiplicationandisdenotedby μq(n):

μq(n)=min

U μ(U),

whereU isrunningoverallbilinearmultiplicationalgorithmsinFqn overFq.

Thebilinearcomplexity ofthemultiplicationinFqn overFq hasbeenwidelystudied.Inparticular,itwasprovedin[2]

thatitisuniformlylinearwithrespecttothedegreenoftheextension.ThisfollowsfromtheChudnovskyandChudnovsky multiplication algorithm (CCMA). This clever construction was originally introduced in 1987 in [3] and is based on the interpolationonalgebraiccurves.

Thereisbenefithavingalowbilinearcomplexitywhenconsideringhardwareimplementationsmainlybecauseitreduces thenumberofgatesinthecircuit.Inthisnote,weconsiderthreemodels.

– Thenon-scalarmodel(denotedNS),inwhichonlythebilinearcomplexityistakenintoaccountanditisassumedthat all scalaroperationsare free.Indeed,thismodeldoesnot reflectthereality and, sincethe bilinearcomplexity isnot the whole complexity ofthe algorithm,the complexity ofthelinear partofthe algorithm shouldalso be takeninto account.

– ThemodelS1,whichtakesintoaccountthenumberofmultiplicationswithoutdistinguishingbetweenthebilinearones andthescalarones.

– ThemodelS2,whichtakesintoaccountalloperations(multiplicationsandadditions)inFq.

Notice that so far, practical implementations of multiplication algorithms over finite fieldshave failed to simultaneously optimizethenumberofscalarmultiplications,additions,andbilinearmultiplications.

Regarding exponentiationalgorithms, theuse ofa normalbasis isofinterest becausethe qth powerofan element is justa cyclicshiftofits coordinates.Aremaining question ishowtoimplementmultiplication efficientlyinordertohave simultaneouslyfastmultiplicationandfastexponentiation.In2000,Gao etal.[6]showedthatfastmultiplicationmethods canbeadaptedtonormalbasesconstructedwithGaussperiods.TheyshowthatifFqn isrepresentedbyanormalbasisover Fq generatedby aGauss periodof type (n,k),themultiplication inFqn canbe computedwithO

nklognklog lognk and theexponentiationwithO

n2klogklog lognk

operationsinFq (q beingsmall).Thisresultisvaluablewhenkisbounded.

However,inthegeneralcase,kisupper-boundedbyO

n3log2nq .

In2009, CouveignesandLercierconstructedin[5,Theorem4]twofamilies ofbasis (calledellipticandnormalelliptic) forfinitefieldextensions,fromwhichtheyobtainedamodeldefinedasfollows.Witheverycouple(q,n),theyassociated a model,(q,n),ofthedegree-nextension ofFq,such thatthefollowingholds: thereisapositive constant K suchthat thefollowingaretrue:

– elements in Fqn are represented by vectors for which the number of components in Fq is upper bounded by Kn(logn)2log(logn)2;

(4)

– thereexistsanalgorithmthatmultipliestwoelementsattheexpenseofKn(logn)4|log(logn)|3 multiplicationsinFq; – exponentiationbyqconsistsofacircularshiftofthecoordinates.

Therefore,foreachextensionoffinitefield,theyshowthatthereexistsamodelthatallowsbothfastmultiplicationand fastapplicationoftheFrobeniusautomorphism.Theirmodelhastheadvantageofexistingforallextensions.However,the bilinear complexity of their algorithm is not competitive comparedwith thebest known methods, aspointedout in [5, Section4.3.4].Indeed,itisclearthatsuchamodelrequiresatleastKn(logn)2(log(logn))2 bilinearmultiplications.

Notethathere,theefficiencyofthealgorithmsisdescribedintermsofparalleltime(depthofthecircuit,innumberof multiplications),numberofprocessors(width),andtotalnumberofmultiplications(size).

Thisarticledescribesthemaintheoretical resultsofamoredetailedforthcomingarticle,whereaneffectiveimplemen- tationforthecaseF1613 ispresented(forapreliminaryversion,see[1]).

2. Newresults

Weproposeanothermodelwiththefollowingcharacteristics:

– ourmodelisbasedonCCMA,thusthemultiplicationalgorithmhasabilinearcomplexityinO(n),whichisoptimal;

– ourmodel istailored toparallel computation. Hence, the computation time usedto perform amultiplication or any exponentiationcan easilybe reducedwithan adequatenumber ofprocessors.Since ourmethodhasa bilinear com- plexityofmultiplicationinO(n),itcanbeparallelizedtoobtainaconstanttimecomplexityusingO

n

processors.The previousaforementionedworks([6]and[5]) donotgive anyparallelalgorithm(suchanalgorithmismoredifficultto conceivethanaserialone);

– exponentiationby qisacircularshiftofthecoordinatesandcanbeconsideredfree.Thus,efficientparallelizationcan bedonewhendoingexponentiation;

– the scalar complexity of our exponentiation algorithm is reduced, compare to a basic exponentiation using CCMA, thankstoa suitablebasis representationoftheRiemann–RochspaceL(2D) inthesecondevaluationmap.Morepre- cisely,thenormalbasisrepresentationoftheresidueclassfieldiscarriedintheassociatedRiemann–RochspaceL(D), andtheexponentiationbyq consistsofacircularshiftofthenfirstcoordinatesofthevectorslying intheRiemann–

RochspaceL(2D);

– ourmodelusestheCoppersmith–Winograd[4]method(denotedCW)oranyvariantsthereoftoimprovematrixprod- uctsandtodiminishthenumberofscalaroperations.Thisimprovementisparticularlyefficientforexponentiation.

Theorem2.1.Inthenon-scalarmodelNS,thereexistmultiplicationandexponentiationalgorithmsinFqnsuchthat:

themultiplicationisdoneinparalleltimeinO 1

multiplicationsinFqwithO n

processors,foratotalinO n

multiplications;

exponentiationisdoneinparalleltimeinO logn

multiplicationsinFqwithO

n2/log2n

processors,foratotalinO

n2/logn multiplications.

When considering models S1 andS2,two cases can be distinguished forthe multiplication complexity.We might be interested either in thecomplexity ofone multiplication or inthe average (amortized) complexity of one multiplication whenmanymultiplicationsaredonesimultaneously.Regardingexponentiation,awiseuseofCWmethodallowscomplexity tobeimproved.

Theorem2.2.InthemodelS1,thereexistmultiplicationandexponentiationalgorithmsinFqnsuchthat:

multiplication:

a) onemultiplicationisdoneinparalleltimeinO 1

multiplicationsinFqwithO n2

processors,foratotalinO n2

multiplica- tions;

b) intheamortizedsense,theparalleltimeisinO 1

multiplicationsinFq withO n1+

processors,foratotalinO n1+ multiplicationswherethevalueof isapproximately0.38forthebestknownmatrixproductmethods;

exponentiation isdonein a parallel timeofO logn

multiplications in Fq with O

n2+/log2n

processors, fora totalin O

n2+log12n

multiplications.

Theorem2.3.InthemodelS2,thereexistmultiplicationandexponentiationalgorithmsinFqnsuchthat:

multiplication:

a) onemultiplicationisdoneinparalleltimeinO logn

operationsinFqwithO

n2/logn

processors,foratotalinO n2 operations;

b) intheamortizedsense,theparalleltimeisinO logn

operationsinFqwithO

n1+/logn

processors,foratotalinO n1+ operations;recallthatthevalueof isapproximately0.38forthebestmatrixproductmethods;

(5)

JID:CRASS1 AID:5641 /FLA Doctopic: Number theory [m3G; v1.172; Prn:15/01/2016; 16:07] P.4 (1-5) 4 K. Atighehchi et al. / C. R. Acad. Sci. Paris, Ser. I•••(••••)••••••

exponentiation is done in a parallel timeof O log2n

operations in Fq with O

n2+/log1+2n

processors, fora total in O

n2+log12n

operations.

2.1. Multiplicationandexponentiationalgorithms

Let F/Fq be an algebraic functionfield overthefinitefield Fq ofgenus g(F).Wedenote by N1(F/Fq) thenumberof places ofdegreeone ofF overFq.IfD isadivisor,L(D)denotes theRiemann–Rochspaceassociatedwith D.Wedenote by FQ theresidueclass fieldoftheplace Q whichisisomorphic toFqdeg(Q),wheredeg(Q) isthedegreeoftheplace Q. Thefollowingtheoremthatmakeseffectivetheoriginalalgorithmgroupssomeresultsof[2].

Theorem2.4.LetF/Fqbeanalgebraicfunctionfieldofgenusg(F)definedoverFqandn aninteger.Letussupposethatthereexists aplaceQ ofdegreen.

Then,ifN1(F/Fq)>2n+2g2thereisaneffectivedivisorD ofdegreen+g1suchthat:

(i) Q isnotinthesupportofD, (ii) theevaluationmapE definedby

E: L(D)FQ

ff(Q)

isanisomorphismofvectorspacesoverFq,

(iii) thereexist2n+g1placesofdegreeonePiwhicharenotinthesupportofD suchthatthemulti-evaluationmapT definedby T:L(2D)

Fq

2n+g1

f

f(P1) , . . . ,f

P2n+g1

isanisomorphism.

2.1.1. Strategyofimplementation

The constructionofthealgorithmisbasedon thechoiceoftheplace Q ofdegreen,theeffectivedivisorD ofdegree n+g1,thebasesofspacesL(D)andL(2D),andthebasisoftheresidueclassfield FQ oftheplace Q.TheplaceQ of degreenislyingaboveanormalprimitivepolynomialinFq[X],whichistotallydecomposedinthealgebraicfunctionfield F/Fq.

Astheresidueclassfield FQ oftheplaceQ isisomorphictothefinitefieldFqn,weidentifyFqn toFQ.Indeed,deg(D)= n+g1,dim(DQ)=0 yetL(DQ)=Ker(E).Inparticular,we chooseforbasisofL(D),thereciprocalimage BD of thebasisBQ =1,. . . ,φn)of FQ bytheevaluationmapE,namelyBD=(E11),. . . ,E1n)).

NotethatasthedivisorDisaneffectivedivisor,wehaveL(D)L(2D).LetP bethemapfromL(2D)toL(2D)defined in thefollowing way:if fL(2D) then f(Q) isin theresiduefield FQ ofthe place Q; define P(f)= JE1

f(Q) , where JistheinjectionmapfromL(D)intoL(2D).Then P isalinearmapfromL(2D)intoL(2D)whoseimageisL(D). Moreprecisely, P isaprojectionfromL(2D)ontoL(D).LetMbethekernelof P.ThenL(2D)=L(D)M.

2.1.2. ProductoftwoelementsinFqn

Let x=(x1,. . . ,xn) and y=(y1,. . . ,yn) be two elements of Fqn given by their components over Fq relative to the chosenbasisBQ.Accordingtothepreviousnotation,wecanconsiderthatxandy areidentifiedtothefollowingelements ofL(D):

fx= n

i=1

xifi and fy= n

i=1

yifi.

Wewillconsiderthatxandyarerespectivelytheelements fxand fyofL(2D)wherethen+g1 lastcomponentsare 0.

Nowitisclearthatknowingxor fx bytheircoordinatesisthesamething.

DenotetheHadamardproductin Fq

2n+g1 by:

(u1, . . . ,u2n+g1) (v1, . . . ,v2n+g1)=(u1v1, . . . ,u2n+g1v2n+g1).

Theorem2.5.Theproductofx byy issuchthat fxy=P

T1

T(fx) T(fy) .

Références

Documents relatifs

Unit´e de recherche INRIA Lorraine, Technopˆole de Nancy-Brabois, Campus scientifique, ` NANCY 615 rue du Jardin Botanique, BP 101, 54600 VILLERS LES Unit´e de recherche INRIA

Some of these bounds turn out to be asymptotically optimal when g → ∞ , meaning that they converge to the lower bound from the generalized Brauer–Siegel theorem for function

Faster polynomial multiplication over finite fields using cyclotomic coefficient rings..

It is also possible to give variants of the new multiplication algorithms in which Bluestein’s transform is replaced by a different method for converting DFTs to convolutions, such

Pollard did not analyze the asymptotic complexity of his method, but it can be shown that its recursive use for the arithmetic

In Section IV we focus on genus 3 curves, following the same steps as for genus 2, we provide a divisor splitting algorithm, necessary addition formulas on the Jacobian, complexity

So far, the strategy to obtain upper bounds for bilinear complexity of multiplication in F q n over F q , has always been to apply algorithms of type Chudnovsky on infinite

Therefore, we note that so far, practical implementations of multiplication algorithms of type Chudnovsky over finite fields have failed to simultaneously optimize the number of