c ○ 2001 Kluwer Academic Publishers. Manufactured in The Ne(10)

发布时间:2021-06-07

Abstract. A data cube is a popular organization for summary data. A cube is simply a multidimensional structure that contains in each cell an aggregate value, i.e., the result of applying an aggregate function to an underlying relation. In practical situat

264´ANDWUBARBARA

effects,allthepossiblek 1-factoreffects,andsoonuptothe1-factoreffectsandthe

ABmean(γ.Forexample,γiAisone-factoreffect,γijistwo-factoreffectwhichshowstheABCdependencywithinthedistributionsoftheassociatedattributesA,B,γijkisthree-factor

effectwhichshowsthedependencywithinthedistributionsoftheassociatedattributes

ABCA,B,C.Notethesizeofpossiblehigh-factoreffectscanbeverylarge(forγijk,thesize

isI×J×K).

ABACADBClogy ijkl=γ+γiA+γjB+γkC+γlD+γij+γik+γil+γjk

BDCDABCABDACDBCDABCD+γjl+γkl+γijk+γijl+γikl+γjkl+γijkl(2)

Therearealsosomelinearconstraintstoovercometheoverparameterization,asshowninEq.(3).

γ.A=γ.B=γ.C=γ.D=0

ABACACCDγiAB=0.=γ.j=γi.=γ.k=···=γ.l

...

ABCDABCDγijk=γij=γiABCD=γ.ABCD=0.kljkl..l(3)whereadot“.”meansthattheparameterhasbeensummedovertheindex(Forexample, 1ABABγi.=J

j=0γij).Inshort,theconstraintsspecifythattheloglinearparameterssumto0overallindices.

Therearetwoapproachestoestimatethemodelcoef cients.Oneistheiterativepropor-tional ttingmethod,basedonsolvingthecorrespondinglikelihoodequations.Thismethodcanalwaysgettheprecisesolutions,butitneedsmanyiterativestepsoverthedata.

Theothermethodistocomputethecoef cientsfromthevaluesdirectly.Thecoef cientscorrespondingtoanygroup-byGareobtainedbysubtractingfromaveragelvalueatgroup-byGallthecoef cientsfromhigherlevelgroup-by-s.Forinstance,Eq.(4)showstheparametersina4-dimensionaltable.

γ=l....

γiA=li... γ.

...

ABγij=lij.. γiA γjB γ.

ABCABACBCγijk=lijk. γij γik γjk γiA γjB γkC γ.

...(4)

wherel....isthegrandmeanoraverage(Notethata“·”denotesanaggregationalongthatdimension.)andli...isthemeanoverallvaluesalongithmemberofdimensionA.Froml....andli...,γiAdenoteshowmuchtheaverageofthevaluesalongithmemberofdimensionAdiffersfromtheoverallaverage.

Sarawagietal.(1998)presentfastcomputationtechniquesthatmakethisapproachfeasibleforlargesets,andalthoughthismethoddoesnotgiveprecisesolutions,itsfaster

c ○ 2001 Kluwer Academic Publishers. Manufactured in The Ne(10).doc 将本文的Word文档下载到电脑

精彩图片

热门精选

大家正在看

× 游客快捷下载通道(下载后可以自由复制和排版)

限时特价:7 元/份 原价:20元

支付方式:

开通VIP包月会员 特价:29元/月

注:下载文档有可能“只有目录或者内容不全”等情况,请下载之前注意辨别,如果您已付费且无法下载或内容有问题,请联系我们协助你处理。
微信:fanwen365 QQ:370150219