Introduction
|
The goal of the interactive
task is not to compare participants systems in a competitive environment, but
rather for participants to explore variations of their retrieval system within
a given scenario. There are at least four aspects of a Cross-Language image
retrieval system we could investigate including:
| |
|
| (1) |
How the CLIR system supports user query formulation for
images with English captions, particularly for users in their native language
which may be non-English. This is also an opportunity to study how the images
themselves could also be used as part of the query formulation.
|
|
|
| (2)* |
Whether the CLIR system supports
query re-formulation, e.g. the support of positive and negative feedback to
improve the user's search experience, and how this affects
retrieval.
|
| |
|
| (3)* |
Browsing the image collection. This might include
support for summarising the image results set through categorising images
according to pre-defined categories (which must also be translated) or visually
based on the images themselves (e.g. by shape, colour etc.). Browsing becomes
particularly important in a CLIR system when query translation fails and
returns irrelevant or no results. |
| |
|
| (4) |
How well the CLIR system presents the retrieval results
to the user to enable selection of relevant images. This might include how the
system presents the caption to the user (particularly if they are not familiar
with English or some of the specific and colloquial language used in the
captions) and investigate the relationship between the image and caption for
retrieval purposes.
|
Each participant
will compare two interactive Cross-Language image retrieval systems (one
intended as a baseline) that differ in the facilities provided for interactive
query refinement (this includes points 2 and 3 from above). For example the user is searching for a picture of an
arched bridge and starts with the query "bridge". Through query modification (e.g. query expansion based on the
captions), or perhaps browsing for similar images and using feedback based on
visual features, the user refines the query
until relevant images are found.
As a CL image retrieval task,
the initial query should be in a language different from the collection (i.e.
not English) and translated into English for retrieval. The
simplest approach is to translate the query
and display only images to the user (assuming relevance can be based on the
image only and images are language independent), maybe using relevance feedback
on visual features only, enabling browsing, or categorising the images in some
way and allowing the user to narrow their search through selecting these
categories. Any text displayed to the user
must be translated into the user's source language. This might include
captions, summaries, pre-defined image categories etc.
A minimum
of 8 users (who can search with non-English queries) and 16 example images
(topics) are required for this task (we supply the topics) which is described
below. Although the goal is to experiment with users who search with
non-English queries, we are willing to relax this condition if required (but
please discuss with Paul Clough).
|
Scenario and example
images
|
Given
an image (not including the caption) from the St Andrews
collection, the goal for the searcher is to find the same image again using
a Cross-Language image retrieval system. This aims to allow researchers to
study how users describe images and their methods of searching the collection
for particular images, e.g. browsing or by conducting specific searches.
The scenario models the
situation in which a user searches with a specific image in mind (perhaps they
have seen it before) but without knowing key information thereby requiring them
to describe the image instead, e.g. searches for a familiar painting whose
title and painter are unknown.
This task can be used to determine whether the retrieval system is
being used in the manner intended by the system designers and determine how the
interface helps users reformulate and refine their search requests.
The following images have been selected for
this task (click to view larger image):
| TOPIC 1 |
TOPIC
2 |
TOPIC
3 |
TOPIC
4 |
 |
 |
 |
 |
| TOPIC 5 |
TOPIC 6 |
TOPIC 7 |
TOPIC 8 |
 |
 * |
 |
 |
| TOPIC 9 |
TOPIC 10 |
TOPIC 11 |
TOPIC 12 |
 |
 * |
 |
 |
| TOPIC 13 |
TOPIC 14 |
TOPIC 15 |
TOPIC 16 |
 |
 |
 |
 * |
*(NB - the
black box in the larger version of these images is to hide text on the
postcard) The scenario should be described to users before starting the
experiments. You can use some text like this:
In this task
we will show you 16 different images, one at a time, using two different
Cross-Language image retrieval systems. The pictures cover a variety of topics
and are taken from the St Andrews historic photographic collection. When we
show you each image, we will ask you to search the collection and try and find
that same image again. We will let you keep the image to refer to during your
search. This known-item search is aimed at modelling the scenario in which you
know the image you want from the collection, but don't have it to hand; you
know it exists in the collection but can't remember the exact person, location
or name of the object in the image. You can browse and search for the image any
way you want and you have a maximum of 5 minutes to find each image. You can
stop searching when you have found it. We want to observe how our system
supports this kind of task, what words/phrases you use to describe the images
and whether you are successful in finding the required images or not.
Please
note that it is a good idea to let users search the collection prior to
starting the experiments to let them get a feel for its contents. More
information about the collection which you could give to people can be found
here. It is also a good idea to iterate to users that
they can search using any part of the image, i.e. objects in the foreground and
background.
|
Experiment instructions for
participants
|
The interactive ImageCLEF
task is run similar to
iCLEF 2003
using a similar experimental procedure. However, because of the type of
evaluation (i.e. whether known items are found or not), the experimental
procedure for iCLEF 2004
(Q&A) is also very relevant and we make use of both iCLEF
procedures.
Given the 16 topics shown above, participants get the 8 users to test each system with 8
topics. Users are given a maximum of 5 mins only to find each image.
Topics and systems will be presented to the
user in combinations following a latin-square design to ensure user/topic and
system/topic interactions are minimised. The
experimental
procedure given in iCLEF 2004 is to be followed.
The experiment
duration is slightly different than for iCLEF and participants should use the
following as a guideline:
| Introductory
stuff |
10 minutes
|
| Initial survey
|
5 minutes |
| Tutorials (2 systems)
|
30 minutes total
|
| Break |
10 minutes |
| Searching (system A, 8
topics) |
40 minutes (5
mins/img) |
| Post/system survey
|
5 minutes |
| Break |
10 minutes |
| Searching (system B, 8
topics) |
40 minutes (5
mins/img) |
| Post/system survey
|
5 minutes |
| Final survey
|
10 minutes |
The user
questionnaires are a recommended way of obtaining feedback from the user about
their level of satisfaction with the system. There is no fixed questionnaire,
but you can use the
questionnaires from
iCLEF 2003 to give you some ideas for ImageCLEF. These correspond to the
surveys suggested in the above procedure, but may need some modification to
suit the image retrieval task.
To measure the performance of this task, the following metrics will
be used: whether the user could find the intended image or not, the time taken
to find the image, the number of steps/iterations required to reach the
solution (e.g. the number of clicks or the number of queries), and the number
of images displayed to the user. For each topic, we require that you summarise
your system and provide us with this information.
These factors help to measure the
efficiency with which a cross language image retrieval search could be
performed, e.g. how quickly or how many clicks were necessary to find the
relevant image. Information about how the interface was useful for the user can
be obtained from performing a user questionnaire after the
task.
|
What to submit
|
Please provide us with a basic description of your two systems, the
language used for searching, and the main feature of your system which supports
query refinement, e.g. combining visual and text for relevance feedback.
For each topic, please state information for the measures given
above: whether the user found the image or not, the time taken to find the
image, the number of steps/iterations, and the number of images displayed to
the user. We will normalise some of these scores (e.g. the time taken) across
all submissions to compare systems.
Please can you submit your
results by: June 10th.
We follow the timetable given
in iCLEF
2004.
|
|