Abstract The MediaMill TRECVID 2005 Semantic Video Search En(15)

时间:2025-05-05

UvA-MediaMill team participated in four tasks. For the detection of camera work (runid: A CAM) we investigate the benefit of using a tessellation of detectors in combination with supervised learning over a standard approach using global image information.

MeanAverage Precision

System Runs

Figure13:OverviewofallsearchrunssubmittedtoTRECVID2005,erswhoexploitedtheproposedparadigmareindicatedwithspecialmarkers.

5.5.3InteractiveSearch

Wesubmittedfourrunsforinteractivesearch.Threeusersfocussedonusingonlyonebrowser.Thefourthusersmixedallbrowsers.ResultsinFig.12indicatethatformostsearchtopics,usersoftheproposedparadigmforinteractivemulti-mediaretrievalscoreaboveaverage.Furthermore,usersofourapproachobtainatop-3averageprecisionresultfor19outof24topics.Bestperformanceisobtainedfor7topics.BestresultsareobtainedwiththeCrossBrowser.

Dependingonthesearchtopic,theproposedGalaxy-Browseraidsusersinsearchingfortherelevantsubsetofthecollection.Asthefeaturesusedarevisualbased,thesystemworkswellincaserelevantimagesofacertaintopicsharevisualsimilarity,e.g.queriesrelatedtotennisorcar.However,whentopicshavelargevarietyinvisualsettings,forinstancepersonxtopics,visualfeatureshardlyyieldad-ditionalinformationtoaidtheuserintheinteractivesearchprocess.Toourknowledge,noexistingfeaturesworkwellinthesecases.

Twosearchstrategieswerediscoveredduringtheinter-activeretrievaltaskusingtheSphereBrowser.Thereweretopicsforwhichmultipleclusterthreadsyieldedgoodresultsforthattopic,suchasTennis(156),Peoplewithbannersorsigns(161),Meeting(163)andTallbuilding(170).Forthesetopicsonlytherelevantpartsofthethreadsneededtobese-lected.AnotherselectionmethodwasfoundinqueriessuchasAirplanetakeo (167)andO cesetting(172).Heretherewereonlyalimitednumberofconsecutivevalidshotsvisibleineachthread,butbecauseofthecombinationofbothtimeandclusterthreadstherewasalwaysanothervalidbutnotyetselectedshotvisible.Forthesequeries,selectionwasdonebyhoppingfromonevalidresulttoanother.AlsoanumberoftopicswerenotanswerablebytheSphereBrowserbecauseoflackofnearbyshots.Theseincludepersonxtop-ics149,151,and153.

Togaininsightintheoverallqualityofourlexicon-drivenretrievalparadigm.Wecomparetheresultsofouruserswithallotherusersthatparticipatedintheretrievaltasksofthe2005TRECVIDbenchmark.WevisualizedtheresultsforallsubmittedsearchrunsinFig.13.Theresultsarestate-of-the-art.

6ExplorationofBBCRushes

TheBBCRushesconsistofrawmaterialusedtoproduceavideo.Sincethereislittletonospeech,thismaterialisverysuitableforvisual-onlyindexing.We rstsegmentedthevideo’susingourshotsegmentationalgorithm[2].Thenweappliedourbestperformingcameramotiondetector(seeSection4)ontheBBCrushesusingthemodelstrainedforthenewsdata.Tofurtherinvestigatetherobustnessofourvisualfeatures,weperformedvisual-onlyconceptde-tectionontheBBCrushesdata,withoutre-trainingthevisualmodels.Thevisualmodelsarethesameasusedinthevisualonlyfeaturetask(Section3.4)andinthemanualsearchtask(Section5.2).ThedetectorsthuslearnedonnewsdataaresubsequentlyevaluatedontheBBCrushesvideos.Obviously,notall101conceptsareuseful,sincetheyaretrainedonbroadcastnews.However,25conceptstranscendthenewsdomainandsomeperformsurprisinglywellontheBBCrushes:aircraft,bird,boat,building,car,charts,cloud,crowd,face,female,food,governmentbuild-ing,grass,meeting,mountain,outdoor,overlayedtext,sky,smoke,tower,tree,urban,vegetation,vehicle,waterbody.WedevelopedaversionoftheMediaMillsemanticvideosearchenginetailoredtotheBBCrushersbasedonthecom-putedindexes.Whilestillprimitiveintermsofutility,thesearchengineallowsuserstoexplorethecollectioninasur-prisingmanner.Theresultsagaincon rmtheimportanceofrobustvisualfeatures.Hence,forthistaskmuchisto

…… 此处隐藏:1632字,全部文档内容请下载后查看。喜欢就下载吧 ……
Abstract The MediaMill TRECVID 2005 Semantic Video Search En(15).doc 将本文的Word文档下载到电脑

精彩图片

热门精选

大家正在看

× 游客快捷下载通道(下载后可以自由复制和排版)

限时特价:7 元/份 原价:20元

支付方式:

开通VIP包月会员 特价:29元/月

注:下载文档有可能“只有目录或者内容不全”等情况,请下载之前注意辨别,如果您已付费且无法下载或内容有问题,请联系我们协助你处理。
微信:fanwen365 QQ:370150219