Question Answering On Hotpotqa

评估指标

ANS-EM
ANS-F1
JOINT-EM
JOINT-F1
SUP-EM
SUP-F1

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
Beam Retrieval0.7270.8500.5050.7750.6630.901End-to-End Beam Retrieval for Multi-Hop Question Answering
BigBird-etc-0.755-0.736-0.891Big Bird: Transformers for Longer Sequences
AISO0.6750.8050.4490.7200.6120.860Adaptive Information Seeking for Open-Domain Question Answering
Chain-of-Skills0.6740.8010.4570.7170.6130.853Chain-of-Skills: A Configurable Model for Open-domain Question Answering
TPRR0.6700.7950.4440.7080.5940.843--
HopRetriever + Sp-search0.6710.7990.4320.7060.5740.835HopRetriever: Retrieve Hops over Wikipedia to Answer Complex Questions-
EBS-Large0.6620.7930.4200.7000.5730.840--
HopRetriever0.6710.7990.4310.6980.5720.826--
IRRR+0.6630.7910.4280.6960.5690.832Answering Open-Domain Questions of Varying Reasoning Steps from Text
EBS-SH0.6550.7860.4090.6890.5590.831--
IRRR0.6570.7820.4210.6860.5590.821Answering Open-Domain Questions of Varying Reasoning Steps from Text
HopRetriever-V20.6480.7780.4100.6780.5610.818--
AFSGraph-retriever0.6460.7780.4110.6700.5570.812--
Recursive Dense Retriever0.6230.7530.4180.6660.5750.809Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval
Step-by-Step Retriever0.6300.7540.4040.6620.5460.800--
HopRetriever-V10.6080.7390.3800.6390.5310.793--
DDRQA0.6250.7590.3600.6390.5100.789Answering Any-hop Open-domain Questions with Iterative Document Reranking-
DR model large0.6200.7530.3540.6300.4990.778--
HopAns0.6170.7460.3680.6290.5000.772--
Model name0.6170.7460.3680.6290.5000.772--
0 of 72 row(s) selected.
Question Answering On Hotpotqa | SOTA | HyperAI超神经