Additional Data

iKAT Year 1 and 2 (2023 and 2024)

File	Description
2024_test_topics.json	Test topics in JSON format.
2023_train_topics.json	Train topics in JSON format.
2023_test_topics.json	Test topics in JSON format.
2023_train_topics_psg_text.jsonl	Text of provenance passages in the train topics.
2023_test_topics_psg_text.jsonl	Text of provenance passages in the test topics.
2023_passages_hashes.tsv.bz2	TSV file containing MD5 hashes of passage texts. The `.tsv` file has this format: `doc_id passage_number passage_MD5`. Total download size is 2.2GB.
2023_top_1000_query_results.zip	This zip file has queries from both training and testing topics, saved in `queries_train.txt` and `queries_test.txt` respectively. The results from the iKAT searcher (`BM25` using manually resolved queries) are saved in the `query_results_train` and `query_results_test` folders. Each result file, with up to 1000 results, corresponds to a query based on line numbers, starting from zero. For instance, the results for the first query in `queries_train.txt` can be found in `query_results_train/query_results_000.txt`. In each result file, every line shows the `ClueWeb22 ID` followed by the `URL`.
ret_bm25_rm3--type_automatic--num_ptkb_3--k_100--num_psg_3.official.run.json	Automatic Run
ret_bm25_rm3--type_manual--num_ptkb_3--k_100--num_psg_3.official.run.json	Manual Run

Baseline Runs iKAT 2023

We provide two baseline runs (linked above).

Method.
- BM25+RM3 (Pyserini default) as the initial retrieval (denoted by ret_bm25_rm3 in the file name) method to retrieve 100 passages per query (denoted by k_100).
- The query in each turn was re-written using:
  1. The context, and
  2. The top-3 relevant PTKB statements (denoted by num_ptkb_3 in the file name).
Query re-writing. In all cases, the re-written query was construted by appending the relevant PTKB statements to the (manually or automatically) resolved query.
Response generation. A response was generated using the top-3 passages retrieved with the re-written query (denoted by num_psg_3 in the file name). We use the T5 model mrm8488/t5-base-finetuned-summarize-news available on HuggingFace for this purpose.
For automatic runs.
- The relevant PTKB statements were determined automatically by re-ranking the statements using SentenceTransformers, specifically, the model cross-encoder/ms-marco-MiniLM-L-6-v2 available on HuggingFace.
- The query was re-written automatically using the castorini/t5-base-canard model available on HuggingFace.
For manual runs.
- The relevant PTKB statements provided in the ptkb_provenance field were used.
- The manually re-written query provided in the resolved_utterance field was used.

Data from TREC CAsT

We provide the data from previous years' TREC CAsT below. The iKAT topics are similar, with the addition of the Personal Text Knowledge Base. For more information on TREC CAsT, see the website and read the overview papers [2019] [2020] [2021] [2022]

Note. TREC CAsT did not include a PTKB but you can be creative and modify the data according to your needs. Also, TREC CAsT used different collections (Wikipedia, KILT, MS MARCO, etc.) at different stages. iKAT is using a subset of the recently released ClueWeb22-B.

CAsT Year 4 (2022)

File	Description
2022_automatic_evaluation_topics_tree_v1.0.json	Contains each conversation tree (topic) with an automatic rewrite generated for each user utterance.
2022_evaluation_topics_turn_ids.json	Contains each conversation tree (topic) with the resolved query for each user utterance.
2022_evaluation_topics_tree_v1.0.json	Contains all ids that responses/ranked passages need to be returned for.
2022_evaluation_topics_flattened_duplicated_v1.0.json	Contains all possible conversation paths across all the conversation trees.

CAsT Year 3 (2021)

File	Description
2021_automatic_evaluation_topics_v1.0.json	25 primary evaluation topics in JSON format. Variant: Automatic
2021_manual_evaluation_topics_v1.0.json	25 primary evaluation topics in JSON format. Variant: Manual
2021qrels.txt	Qrels file for passage ranking task.

CAsT Year 2 (2020)

File	Description
2020_automatic_evaluation_topics_v1.0.json	25 primary evaluation topics in JSON format. Variant: Automatic
2020_manual_evaluation_topics_v1.0.json	25 primary evaluation topics in JSON format. Variant: Manual
2020qrels.txt	Qrels file for passage ranking task.

CAsT Year 1 (2019)

File	Description
train_topics_v1.0.json	30 example training topics in JSON format.
evaluation_topics_v1.0.json	50 evaluation topics in JSON format.
2019qrels.txt	Official evaluation qrels file for passage ranking task.
train_qrels.txt	Limited (incomplete) training judegements for 5 topics (approximately 50 turns). The judgments are graded on a three point scale (2 very relevant, 1 relevant, and 0 not relevant).