Additional Data
iKAT Year 1 (2023)
File | Description |
---|---|
2023_train_topics.json | Train topics in JSON format. |
2023_test_topics.json | Test topics in JSON format. |
2023_train_topics_psg_text.jsonl | Text of provenance passages in the train topics. |
2023_test_topics_psg_text.jsonl | Text of provenance passages in the test topics. |
2023_passages_hashes.tsv.bz2 | TSV file containing MD5 hashes of passage texts. The .tsv file has this format: doc_id passage_number passage_MD5 . Total download size is 2.2GB. |
2023_top_1000_query_results.zip | This zip file has queries from both training and testing topics, saved in queries_train.txt and queries_test.txt respectively. The results from the iKAT searcher (BM25 using manually resolved queries) are saved in the query_results_train and query_results_test folders. Each result file, with up to 1000 results, corresponds to a query based on line numbers, starting from zero. For instance, the results for the first query in queries_train.txt can be found in query_results_train/query_results_000.txt . In each result file, every line shows the ClueWeb22 ID followed by the URL . |
ret_bm25_rm3--type_automatic--num_ptkb_3--k_100--num_psg_3.official.run.json | Automatic Run |
ret_bm25_rm3--type_manual--num_ptkb_3--k_100--num_psg_3.official.run.json | Manual Run |
Baseline Runs iKAT 2023
We provide two baseline runs (linked above).
- Method.
BM25+RM3
(Pyserini default) as the initial retrieval (denoted byret_bm25_rm3
in the file name) method to retrieve 100 passages per query (denoted byk_100
).- The query in each turn was re-written using:
- The context, and
- The top-3 relevant PTKB statements (denoted by
num_ptkb_3
in the file name).
- Query re-writing. In all cases, the re-written query was construted by appending the relevant PTKB statements to the (manually or automatically) resolved query.
- Response generation. A response was generated using the top-3 passages retrieved with the re-written query (denoted by
num_psg_3
in the file name). We use theT5
modelmrm8488/t5-base-finetuned-summarize-news
available on HuggingFace for this purpose. - For automatic runs.
- The relevant PTKB statements were determined automatically by re-ranking the statements using
SentenceTransformers
, specifically, the modelcross-encoder/ms-marco-MiniLM-L-6-v2
available on HuggingFace. - The query was re-written automatically using the
castorini/t5-base-canard
model available on HuggingFace.
- The relevant PTKB statements were determined automatically by re-ranking the statements using
- For manual runs.
- The relevant PTKB statements provided in the
ptkb_provenance
field were used. - The manually re-written query provided in the
resolved_utterance
field was used.
- The relevant PTKB statements provided in the
Data from TREC CAsT
We provide the data from previous years' TREC CAsT below. The iKAT topics are similar, with the addition of the Personal Text Knowledge Base. For more information on TREC CAsT, see the website and read the overview papers [2019] [2020] [2021] [2022]
Note. TREC CAsT did not include a PTKB but you can be creative and modify the data according to your needs. Also, TREC CAsT used different collections (Wikipedia, KILT, MS MARCO, etc.) at different stages. iKAT is using a subset of the recently released ClueWeb22-B.
CAsT Year 4 (2022)
File | Description |
---|---|
2022_automatic_evaluation_topics_tree_v1.0.json | Contains each conversation tree (topic) with an automatic rewrite generated for each user utterance. |
2022_evaluation_topics_turn_ids.json | Contains each conversation tree (topic) with the resolved query for each user utterance. |
2022_evaluation_topics_tree_v1.0.json | Contains all ids that responses/ranked passages need to be returned for. |
2022_evaluation_topics_flattened_duplicated_v1.0.json | Contains all possible conversation paths across all the conversation trees. |
CAsT Year 3 (2021)
File | Description |
---|---|
2021_automatic_evaluation_topics_v1.0.json | 25 primary evaluation topics in JSON format. Variant: Automatic |
2021_manual_evaluation_topics_v1.0.json | 25 primary evaluation topics in JSON format. Variant: Manual |
2021qrels.txt | Qrels file for passage ranking task. |
CAsT Year 2 (2020)
File | Description |
---|---|
2020_automatic_evaluation_topics_v1.0.json | 25 primary evaluation topics in JSON format. Variant: Automatic |
2020_manual_evaluation_topics_v1.0.json | 25 primary evaluation topics in JSON format. Variant: Manual |
2020qrels.txt | Qrels file for passage ranking task. |
CAsT Year 1 (2019)
File | Description |
---|---|
train_topics_v1.0.json | 30 example training topics in JSON format. |
evaluation_topics_v1.0.json | 50 evaluation topics in JSON format. |
2019qrels.txt | Official evaluation qrels file for passage ranking task. |
train_qrels.txt | Limited (incomplete) training judegements for 5 topics (approximately 50 turns). The judgments are graded on a three point scale (2 very relevant, 1 relevant, and 0 not relevant). |