Storage device performance prediction with CART models(11)
时间:2025-07-10
时间:2025-07-10
Storage device performance prediction is a key element of self-managed storage systems and application planning tasks, such as data assignment. This work explores the application of a machine learning tool, CART models, to storage device modeling. Our appr
Relative importanc
e
60%40%20%0%
TimeDiffTimeDiffTimeDiffTimeDiffTimeDiffTimeDiffTimeDiffTimeDiffTimeDiffTimeDiff1LBNLBNDiffLBNDiffLBNDiffLBNDiffLBNDiffSize(a)relativeimportancemeasuredonthe9GBAtlas10KBdiskusingcello99aand
cello99c
SeR(b)relativeimportancemeasuredontheRAID5diskarrayusingcello99aandcello99c
Figure4:Relativeimportanceoftherequestdescriptionfeatures.
5.1CalibratingRequest-LevelModels
Thissectiondescribeshowweselectparametervaluesforkandlfortherequest-leveldevicemodels.
Figure4showstherelativeimportanceoftherequestdescriptionfeaturesindeterminingper-requestresponsetimebysettingkto10andlto5.Thefeature’srelativeimportanceismeasuredbyitscontributioninerrorreduction.Thegraphsshowtheimportanceofrequestdescriptionfeaturesmeasuredonbothdevices,trainedontwotraces(cello99aandcello99c).Weuseonlythe rstdayofthetracesandreducethedatasetsizeby90%withuniformsampling.
First,weobservethattherelativeimportanceisworkloaddependent.Asweexpected,forbusytraf- csuchasthatwhichoccurredinthecello99atrace,thequeuingtimedominatestheresponsetime,andthereby,theTimeDifffeaturesaremoreimportant.Ontheotherhand,cello99chassmallresponsetimes,andfeaturesthatcharacterizethedatatransfertime,suchasSizeandRW,havegoodpredictivepowerinmodelingthesingledisk.
Second,weobservethatthemostimporantfeatureshiftsfromTimeDiff8toTimeDiff7wherecom-paringthesingledisktothediskarrayforcello99abecausethequeuingtimebecomeslesssigni cantforthediskarray.Thedistinctionbetweenthetwotraces,however,persists.
Wesetkto10forTimeDiffandlto3forLBNDiffinthesubsequentexperimentssothatwecanmodeldevicebehaviorunderbothtypesofworkloads.
Weshowthemodelaccuracyinpredictingper-requestresponsetimesinFigure5.ThemodelisbuiltfortheAtlas10Kdisk.Thetrainingtraceisthe rstdayofcello99a,andthetestingtraceistheseconddayofthesametrace.Figure5(a)isascatterplot,showingthepredictedresponsetimesagainsttheactualonesforthe rst5,000requests.Mostofthepointsstayclosetothediagonalline,suggestingaccuratepredictionoftherequest-leveldevicemodel.Figure5(b)furthercomparestheresponsetimedistributions.Thelongtailofthedistributioniswellcapturedbytherequest-levelmodel,indicatingthattherequestdescriptionis
…… 此处隐藏:411字,全部文档内容请下载后查看。喜欢就下载吧 ……下一篇:中国食物成分表(全)2010版