Question Answering on PIQA ; 1. Unicorn 11B (fine-tuned). 90.1 ; 2. LLaMA3 8B+MoSLoRA. 89.7 ; 3. CompassMTL 567M with Tailor. 88.3 ; 4. LLaMA-3 8B + MixLoRA. 87.6. |
Ranked list of submissions for the Physical IQa: Physical Interaction QA Leaderboard. |
The PIQA dataset introduces the task of physical commonsense reasoning and a corresponding benchmark dataset Physical Interaction: Question Answering or PIQA. |
The AI2 Leaderboard platform hosts public leaderboards for a variety of AI challenges across multiple research domains. |
PIQA is a dataset for commonsense reasoning, and was created to investigate the physical knowledge of existing models in NLP. |
16 дек. 2022 г. · A new commonsense QA benchmark for naive physics reasoning focusing on how we interact with everyday objects in everyday situations. |
10 мая 2024 г. · The low-bit quantized open LLM leaderboard is a valuable tool for finding high-quality models that can be deployed efficiently on a given client. |
Leaderboard. We provide OpenCompass Leaderboard for the community to rank all public models and API models. If you would like to join the evaluation ... opencompass/README_zh... · Issues 208 · Pull requests 29 · Discussions |
In this paper, we introduce the task of physical commonsense reasoning and a corresponding benchmark dataset Physical. Interaction: Question Answering or PIQA . |
Novbeti > |
Axtarisha Qayit Anarim.Az Anarim.Az Sayt Rehberliyi ile Elaqe Saytdan Istifade Qaydalari Anarim.Az 2004-2023 |