Dataset containing synthetically generated (by GPT-3.5 and GPT-4) short stories that only use a small vocabulary. |
12 мая 2023 г. · We introduce TinyStories, a synthetic dataset of short stories that only contain words that a typical 3 to 4-year-olds usually understand. |
A Diverse, Richly Annotated Corpus of Short-Form Stories. |
12 авг. 2024 г. · Model trained on the TinyStories Dataset, see https://arxiv.org/abs/2305.07759. Based on GPT-Neo architecture. |
Kaggle is the world's largest data science community with powerful tools and resources to help you achieve your data science goals. |
23 мая 2023 г. · It introduces TinyStories, a synthetic dataset which is a collection of short stories that consist of words that 3 to 4-year-olds can usually understand. |
A re-implementation of GPT language model in PyTorch, both training and inference. The model is trained on the TinyStories dataset with GPT-2 tokeniser. |
16 мая 2023 г. · A dataset for training tiny models to produce coherent English text with small vocabulary. R, T, Emp, Data, Smol, MS Attempts to produce KB-level TinyStories models : r/LocalLLaMA The Smallest GPT with Coherent English (by Microsoft) - Reddit Другие результаты с сайта www.reddit.com |
4 июл. 2024 г. · The Small Language Model from Microsoft, called Phi-3, was trained using a novel dataset called TinyStories. |
The new constrained dataset, Tiny Stories, is for analyzing core AI language capabilities. Researchers created this focused corpus of short, simple stories. |
Novbeti > |
Axtarisha Qayit Anarim.Az Anarim.Az Sayt Rehberliyi ile Elaqe Saytdan Istifade Qaydalari Anarim.Az 2004-2023 |