Providing direction using images and speech recognizer

时间:2025-04-06

A Smart Direction Guide that Provides Users with Additional Destinationdependent Information

Angela Xian December 12, 2002

Objective

Based on the direction stored on mobile devices, predict users’ actions according to the destination (and purpose of the trip, if available), and thus provide users with additional meaningful information The prediction uses common sense reasoning, where the common sense knowledge may be inputted by users

User

Major ComponentsDirection Selectordirection

destination record input

Direction file storage

Destination Extractordestination

Learning Engine

Reasoning EngineDestination beans

Output

Destination Record Storage

A Sample DirectionThe file direction0.txt contains the following text:From home to Galleria Start from 70 Pacific St going North Turn right at Sidney St Turn left at Mass Ave Get to Central Square T station Take Red line Inbound to Park St Change to Green line Take Green line to Museum of Science Turn right at Cambridge St Go Straight for two blocks Galleria is on your left

Destination Bean Object1) PURPOSE - the purpose of the trip, such as shopping, party at friend’s house, etc. 2) DESTINATION NAME – the name of the destination, such as “Cambridgeside Galleria” or “Logan” 3) DESTINATION TYPE – what kind of place the user is going to, e.g. shopping mall or airport 4) RELATED INFORMATION – the additional information related to the destination and the purpose of the trip that may be useful to the user, such as web URLs of the businesses that the user is going to 5) LIKELIHOOD – the ratio (in the scale of 1 to 10) that indicates how likely the related information is relevant if only the destination name but not the purpose of the trip is available.

Areas of Improvements

Use a real database to store Destination Bean objects instead of using a sequential-access file (due to the unavailability of database software) Add intelligence to the reasoning engine, such as updating the likelihood ratio Add a learning engine that learns user-specific common sense from user’s input and behavior Add more functionalities in the application such as creating a new direction file, allowing multiple sources for the additional information

Slides from an Earlier Presentation

Existing Project: Providing Direction using Images and Speech Recognizer

Take pictures of landmarks along a route, at the same time, speak the instruction to the handhelde.g. image: picture of the sign “Kendall Square”; instruction: “Get off at the Kendall Square station.”

The handheld processes the speech input, associates the instruction with the picture User can also manually input or correct the instructions

A complete direction with image and text

The Question is -How to make this software agent smarter?

One Possible Approach: Predict Users’ Actions according to the Destination

A direction is packaged with some extra information that u

sers might find useful The following rules can be add into Openmind:ActionsGo to grocery store or flower store on the way to the party

Sample Destinations (do not need to be final dest.)Party at friend’s house

Shopping mallAirport, train station, etc.

Have a look at the current sales or promotions in this particular mallCheck changes/updates regarding arrival/departure times

Hotel

Inquire dinner menu in this hotel or in restaurants/stores nearby

User Scenario I

Lisa goes to South Station to catch a train to NYC She has the direction from her apartment to her friend’s place in NYC (through South Station and Grand Central) on her handheld On her way to South Station, her handheld tells her that her train will be delayed for 20 minutes She now has time to stop by a gift store on the way and get a little present for her friend

User Scenario II

Bob arrived in Logan at 8pm, with an empty stomach Using the direction stored on his handheld, he heads to Hyatt On his way, he checks the dinner menu in Hyatt (it comes with the direction) He does not like what Hyatt provide, so he looks at menus of restaurants nearby He picks the restaurant he likes and makes a reservation even before he arrives in the hotel

…… 此处隐藏:2096字,全部文档内容请下载后查看。喜欢就下载吧 ……
Providing direction using images and speech recognizer.doc 将本文的Word文档下载到电脑

    精彩图片

    热门精选

    大家正在看

    × 游客快捷下载通道(下载后可以自由复制和排版)

    限时特价:7 元/份 原价:20元

    支付方式:

    开通VIP包月会员 特价:29元/月

    注:下载文档有可能“只有目录或者内容不全”等情况,请下载之前注意辨别,如果您已付费且无法下载或内容有问题,请联系我们协助你处理。
    微信:fanwen365 QQ:370150219