Estimating the quality of data in relational databases(4)

时间:2025-04-04

3SoundnessandCompletenessasMeasuresofData

Quality

Wede netwomeasuresofdataqualitythataregeneralenoughtoencompassmostexistingmeasuresandaspectsofdataquality[5,19].Thebasicideasunderlyingthesemeasureswere rststatedin[7].Inthatpapertheauthorsuggestedthatdeclarationsoftheportionsofthedatabasethatareknowntobeperfectmodelsoftherealworld(andtherebytheportionsthatarepossiblyimperfect)beincludedinthede nitionofeachdatabase.Withthisinformation,thedatabasesystemcanqualifytheaccuracyoftheanswersitissuesinresponsetoqueries:eachanswerisaccompaniedbystatementsthatde netheportionsoftheanswerthatareguaranteedtobeperfect.Thisapproachusesviewstospecifytheportionsofthedatabaseortheportionsofanswersthatareperfectmodelsoftherealworld.

Morespeci cally,thisapproachinterpretsinformationquality,whichittermsintegrity,asacombinationofsoundnessandcompleteness.Adatabaseviewissoundifitincludesonlyinformationthatoccursintherealworld;adatabaseviewiscompleteifitincludesalltheinformationthatoccursintherealworld.Hence,adatabaseviewhasintegrity,ifitincludesthewholetruth(completeness)andnothingbutthetruth(soundness).Aprototypedatabasesystemthatisbasedontheseideasisdescribedin[10].Theseideaswerefurtherdevelopedin[9]andaresummarizedbelow.

GivenadatabaseschemeD,weassumetheexistenceofahypotheticaldatabaseinstanced0thatcapturesperfectlythatportionoftherealworldthatismodeledbyD(theidealortruedatabase).Inaddition,weassumeoneormoreactualinstancesdi(i≥1).Theactualinstancesareconsideredapproximationsoftheidealinstanced0.

GivenaviewV,wedenotebyv0itsextensionintheidealdatabased0(theidealortrueextensiontoV)andwedenotebyviitsextensionintheactualdatabasedi.Again,theextensionsviareapproximationsoftheidealextensionv0.

ConsiderviewV,itsidealextensionv0,andanapproximationv.Ifv v0,thenvisacompleteextension.Ifv v0,thenvisasoundextension.Obviously,anextensionwhichissoundandcompleteistheidealextension.

Withthesede nitions,eachviewextensioniseithercompleteorincomplete,andeithersoundornonsound.Wenowre nethesede nitionsbyassigningeachextensionavaluethatdenoteshowwellitapproximatestheidealextension.Weshalltermthisvaluethegoodnessoftheextension.Werequirethatthegoodnessofeachextensionbeavaluebetween0and1,thatthegoodnessoftheidealextensionbe1,andthatthegoodnessofextensionsthatareentirelydisjointfromtheidealextensionbe0.Formally,agoodnessmeasureisafunctiongonthesetofallpossibleextensionsthatsatis es

v:g(v)∈[0,1]

v:v∩v0= = g(v)=0

g(v0)=1

…… 此处隐藏:413字,全部文档内容请下载后查看。喜欢就下载吧 ……
Estimating the quality of data in relational databases(4).doc 将本文的Word文档下载到电脑

精彩图片

热门精选

大家正在看

× 游客快捷下载通道(下载后可以自由复制和排版)

限时特价:7 元/份 原价:20元

支付方式:

开通VIP包月会员 特价:29元/月

注:下载文档有可能“只有目录或者内容不全”等情况,请下载之前注意辨别,如果您已付费且无法下载或内容有问题,请联系我们协助你处理。
微信:fanwen365 QQ:370150219