数据挖掘取样方法研究_胡文瑜(8)
发布时间:2021-06-05
发布时间:2021-06-05
52
[16]
计算机研究与发展 2011,48(1)
JinCheqing,QianWeining,ZhouAoying.Analysisandmanagementofstreamingdata:Asurvey[J].JournalofSoftware,2004,15(8):1172-1181(inChinese)
参
[1]
考文献
VitterJS.Randomsamplingwithareservoir[J].ACMTransonMathematicalSoftware,1985,11(1):37-57
[17]
(金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181)
HaasPJ,SwamiAN.Sequentialsamplingproceduresforquerysizeestimation[C]PPProcoftheACMSIGMOD1992.NewYork:ACM,1992:341-350
[18]
LastM.
Improvingdataminingutilitywithprojective
[2]CochranWG.SamplingTechniques[M].3rded.NewYork:JohnWiley&Sons,1977
[3]LevyPS,LemeshowS.SamplingofPopulations:MethodsandApplications[M].NewYork:JohnWiley&Sons,1991
[4]LohrSL.Sampling:DesignandAnalysis[M].PacificGrove,CA:DuxburyPress,1999
[19]
sampling[C]PPProcofthe15thACMSIGKDDIntConfonKDD.NewYork:ACM,2009:487-496
BabcockB,ChaudhuriS,DasG.Dynamicdampledelectionforspproximatewueryprocessing[C]PPProcofACMSIGMOD2003.NewYork:ACM,2003:539-550
[20]
BrownPG,HaasPJ.Techniquesforwarehousingofsampledata[C]PPProcofthe22ndICDE.LosAlamitos,CA:IEEEComputerSociety,2006:6
[21]
ChaudhuriS,DasG,NarasayyaV,etal.Optimization-basedapproachforapproximateansweringofaggregatequeries[C]PPProcofACMSIGMOD2001.NewYork:ACM,2001:295-306
[22]
ThompsonStevenK,SeberGeorgeAF.AdaptiveSampling[M].NewYork:JohnWiley&Sons,1996
[23]
OlkenF.Randomsamplingfromdatabase[D].Berkeley:UniversityofCalifornia,2005
[24]
PalmerC,
FaloutsosC.
Densitybiasedsampling:
An
[5]OlkenF,RotemD.RandomsamplingfromB+trees[C]PPProcofthe15thIntConfonVLDB.SanFrancisco:MorganKaufmann,1989:269-277
[6]OlkenF,RotemD.Samplingfromspatialdatabases[J].StatisticsandComputing,1995,5(1):43-57
[7]GibbonsPB,MatiasY.Newsampling-basedsummarystatisticsforimprovingapproximatequeryanswers[C]PPProcofACMSIGMOD1998.NewYork:ACM,1998:331-342
[8]AcharyaS,GibbonsPB,PoosalaV.Congressionalsamplesforapproximateansweringofgroup-byqueries[C]PPProcoftheACMSIGMODonManagementofData.NewYork:ACM,2000:487-498
[9]ChaudhuriS,DasG,DatarM,etal.Overcominglimitationsofsamplingforaggregationqueries[C]PPProcofICDE2001.LosAlamitos,CA:IEEEComputerSociety,2001:534-542
improvedmethodfordataminingandclustering[C]PPProcofACMSIGMOD2000.NewYork:ACM,2000:82-92
[25]
KolliosG,GunopoulosD,KoudasN,etal.Efficientbiasedsamplingforapproximateclusteringandoutlierdetectioninlargedatasets[J].IEEETransonKnowledgeandDataEngineering,2003,15(5):1170-1187
[26]
CormodeG,MuthukrishnanS.Summarizingandminingskeweddatastreams[C]PPProcofthe5thSIAMIntConfonDataMining.NewportBeach,USA:SocietyforIndustrialandApplied,2005:12
[27]
ZhouShuigeng,ZhouAoying,etal.Afastdensity-basedclusteringalgorithm[J].JournalofComputerResearchandDevelopment,2000,37(11):1287-1292(inChinese)(周水庚,周傲英,等.一种基于密度的快速聚类算法[J].计算机研究与发展,2000,37(11):1287-1292)
[28]
ToivonenH.Samplinglargedatabasesforassociationrules[C]PPProcofthe22ndVLDB.SanFrancisco:MorganKaufmann,1996:134-145
[29]
LiW,
GaoX,
ZhuY,
etal.
Onthesmallsample
[10]GibbonsPB.Distinctsamplingforhighly-accurateanswerstodistinctvaluesqueriesandeventreports[C]PPProcofVLDB2001.SanFrancisco:MorganKaufmann,2001:541-550
[11]GibbonsPB,MatiasY,PoosalaV.Fastincremental
maintenanceofapproximatehistograms[J]ACMTransonDatabaseSystems,2002,27(3):261-298
[12]
BabcockB,DatarM,MotwaniR.Samplingfromamovingwindowoverstreamingdata[C]PPProcofthe13thAnnualACM-SIAMSymponDiscreteAlgorithms.SanFrancisco,California:SocietyforIndustrialandAppliedMathematics,2002:633-634
[13]
MankuGS,MotwaniR.Approximatefrequencycountsoverstreamingdata[C]PPProcofthe28th
VLDBConf.
Trondheim,Norway:VLDBEndowment,2002:346-357
[14]
CormodeG,MuthukrishnanS,RozenbaumI.Summarizingandmininginversedistributionsondatastreamsviadynamicinversesampling[C]PPProcofthe31stIntConfonVLDB.Trondheim,Norway:VLDBEndowment,2005:25-36
[15]
ProvostF,
JensenD,
OatesT.
Efficient
progressive
performanceofboostedclassifiers[C]PPProcoftheIEEEComputerSocietyConfonCVPR.LosAlamitos,CA:IEEESsampling[C]PPProcofthe15thACMSIGKDD.NewYork:,