kg_llm_leaderboard_test

Runtime error

b1sheng commited on Jul 27, 2023

Commit

0343b70

1 Parent(s): 59f9e08

Update src/assets/text_content.py

Files changed (1) hide show

src/assets/text_content.py CHANGED Viewed

@@ -6,6 +6,10 @@ INTRODUCTION_TEXT = f"""
 🐨 KG LLM Leaderboard aims to track, rank, and evaluate the performance of released Large Language Models on traditional KBQA/KGQA datasets.
 The data on this page is sourced from a research paper. If you intend to use the data from this page, please remember to cite the following source: https://arxiv.org/abs/2303.07992
 """
 LLM_BENCHMARKS_TEXT = f"""

 🐨 KG LLM Leaderboard aims to track, rank, and evaluate the performance of released Large Language Models on traditional KBQA/KGQA datasets.
 The data on this page is sourced from a research paper. If you intend to use the data from this page, please remember to cite the following source: https://arxiv.org/abs/2303.07992
+We compare the current SOTA traditional KBQA models (fine-tuned (FT) and zero-shot (ZS)),
+LLMs in the GPT family, and Other Non-GPT LLM. In QALD-9 and LC-quad2, the evaluation metric used is F1, while other datasets use Accuracy (Exact match).
 """
 LLM_BENCHMARKS_TEXT = f"""