The How2 dataset contains 13,500 videos, or 300 hours of speech, and is split into 185,187 training, 2022 development (dev), and 2361 test utterances. |
Dataset contains article and summary pairs extracted and constructed from an online knowledge base written by different human authors. |
XL-Sum is a comprehensive and diverse dataset for abstractive summarization comprising 1 million professionally annotated article-summary pairs from BBC, ... |
For this task, we are going to use the SamSum Dataset, which contains three csv files for training, testing, and validation. All these files are structured ... |
News Articles and summary from CNN-DailyMail Dataset. |
The How2 dataset contains 13,500 videos, or 300 hours of speech, and is split into 185,187 training, 2022 development (dev), and 2361 test utterances. |
Pn-summary is a dataset for Persian abstractive text summarization. A well-structured summarization dataset for the Persian language consists of 93,207 records ... |
Dataset for summarization of long documents. Adapted from this repo. Note that original data are pre-tokenized so this dataset returns " ".join(text) and add " ... |
Abstractive text summarization summarizes the text maintaining coherent information in a similar amount of words as human generated summary. |
The dataset contains online news articles (781 tokens on average) paired with multi-sentence summaries (3.75 sentences or 56 tokens on average). The processed ... |
Novbeti > |
Axtarisha Qayit Anarim.Az Anarim.Az Sayt Rehberliyi ile Elaqe Saytdan Istifade Qaydalari Anarim.Az 2004-2023 |