How to format your GeoCLEF results

Results have to be submitted in ASCII format, with one line per document retrieved.
The lines have six fields to be formatted as follows:

1

2

3

4

5

6

10

Q0

document.00072

0

0.017416

runidex1

  The fields must be separated by ONE blank and have the following meanings:

1) Query number (eliminate any identifying letters). Please only use SIMPLE numbers ("1", not "001") GeoCLEF participants – Please adjust your topic numbers from GC026 to 26 when submitting your results
INPUT MUST BE SORTED NUMERICALLY BY QUERY NUMBER.

2) Query iteration (will be ignored. Please choose "Q0" for all experiments).

3) Document number (content of the <DOCNO> tag.).

4) Rank 0..n (0 is best matching document. If you retrieve 1000 documents per query, rank will be 0..999, with 0 best and 999 worst). Note that rank starts at 0 (zero) and not 1 (one).
MUST BE SORTED IN INCREASING ORDER PER QUERY.

5) RSV value (system specific value that expresses how relevant your system deems a document to be. This is a floating point value. High relevance should be expressed with a high value). If a document D1 is considered more relevant than a document D2, this must be reflected in the fact that RSV1 > RSV2. If RSV1 = RSV2, the documents may be randomly reordered during calculation of the evaluation measures. Please use a decimal point ".", not a comma. Do not use any form of separators for thousands. The only legal characters for the RSV values are 0-9 and the decimal point.
MUST BE SORTED IN DECREASING ORDER PER QUERY.

6) Run identifier (please chose an unique ID for each experiment you submit). Only use a-z, A-Z and 0-9. No special characters, accents, etc.

The fields are separated by a single space.
The file contains nothing but lines formatted in the way described above.
You are expected to retrieve 1000 documents per query. An experiment that retrieves a maximum of 1000 documents each for 20 queries therefore produces a file that contains a maximum of 20000 lines.

You should know that the effectiveness measures used in CLEF evaluate the performance of systems at various points of recall. Participants must thus return at most 1000 documents per query in their results. Please note that by its nature, the average precision measure does not penalize systems that return extra irrelevant documents at the bottom of their result lists. Therefore, you will usually want to use the maximum number of allowable documents in your official submissions.

Example: The following is a snapshot of part of a run submitted by Berkeley to GeoCLEF 2005;

1 Q0 SDA.950613.0223 0 0.744537 BKGeoED1
1 Q0 SDA.940805.0057 1 0.644537 BKGeoED1
1 Q0 FR940911-000351 2 0.544537 BKGeoED1
1 Q0 FR940904-000441 3 0.544537 BKGeoED1
1 Q0 SDA.940117.0015 4 0.392188 BKGeoED1
1 Q0 SDA.940117.0058 5 0.341676 BKGeoED1
1 Q0 SDA.940912.0047 6 0.312085 BKGeoED1
1 Q0 FR940123-000131 7 0.295260 BKGeoED1
1 Q0 SDA.940806.0011 8 0.285681 BKGeoED1
1 Q0 SDA.950906.0270 9 0.277957 BKGeoED1
1 Q0 SDA.940120.0021 10 0.272608 BKGeoED1
.
.
.
1 Q0 SDA.940209.0205 997 0.090403 BKGeoED1
1 Q0 FR940918-002546 998 0.090403 BKGeoED1
1 Q0 SDA.951228.0139 999 0.090381 BKGeoED1
2 Q0 SDA.950328.0077 0 0.561243 BKGeoED1
2 Q0 FR940605-000976 1 0.461243 BKGeoED1
2 Q0 FR940724-000195 2 0.361243 BKGeoED1
2 Q0 SDA.950215.0125 3 0.344085 BKGeoED1
2 Q0 SDA.950915.0122 4 0.321175 BKGeoED1
2 Q0 SDA.941206.0242 5 0.285106 BKGeoED1
2 Q0 SDA.950705.0168 6 0.277838 BKGeoED1
2 Q0 SDA.950711.0124 7 0.262405 BKGeoED1
2 Q0 SDA.940712.0054 8 0.257480 BKGeoED1
2 Q0 SDA.950704.0164 9 0.256444 BKGeoED1
2 Q0 FR940626-002320 10 0.254796 BKGeoED1
.
.
.
2 Q0 FR940911-002528 997 0.076474 BKGeoED1
2 Q0 SPIEGEL9495-011824 998 0.076453 BKGeoED1
2 Q0 SDA.950404.0132 999 0.076441 BKGeoED1
3 Q0 SDA.940609.0257 0 0.803262 BKGeoED1
3 Q0 SDA.950327.0006 1 0.703262 BKGeoED1
3 Q0 SDA.950330.0269 2 0.603262 BKGeoED1
3 Q0 SDA.940428.0061 3 0.588835 BKGeoED1
3 Q0 SDA.950130.0033 4 0.531738 BKGeoED1
3 Q0 SDA.951128.0211 5 0.529770 BKGeoED1
3 Q0 SDA.950510.0170 6 0.509732 BKGeoED1
3 Q0 SDA.940815.0073 7 0.504174 BKGeoED1
3 Q0 SDA.950125.0281 8 0.477380 BKGeoED1
3 Q0 SDA.950529.0095 9 0.474632 BKGeoED1
3 Q0 SDA.940512.0001 10 0.467610 BKGeoED1

 

Note the run ID (field 6) is composed to identify the run as from Berkeley (BK) for GeoCLEF (Geo) and bilingual English-German (ED) number 1.