The GitHub Code dataset consists of 115M code files from GitHub in 32 programming languages with 60 extensions totaling in 1TB of data. The dataset was created ... |
Fashion-MNIST is a dataset comprising of 28×28 grayscale images of 70,000 fashion products from 10 categories, with 7,000 images per category. The training set ... |
A collection of datasets for machine learning for big code - CUHK-ARISE/ml4code-dataset. |
The Django dataset is a dataset for code generation comprising of 16000 training, 1000 development and 1805 test annotations. Each data point consists of a line ... |
This dataset is released as a part of the Machine Learning for Programming project that aims to create new kinds of programming tools and techniques based on ... |
30 мая 2023 г. · The GitHub Code dataset consists of 115M code files from GitHub in 32 programming languages with 60 extensions totaling in 1TB of data. 7. MBPP. |
This is a cleaner version of Github-code dataset, we add the following filters: Average line length < 100; Alpha numeric characters fraction > ... |
11 апр. 2023 г. · Datasets, tools, and benchmarks for representation learning of code. - github/CodeSearchNet. Code of Conduct · Instructions · README.md · MIT License |
Take a course with Kaggle Notebooks ; Data Visualization course logo. Data Visualization. Make great data visualizations. A great way to see the power of coding! |
This dataset includes the Java source code and JSON files containing the names and the tokens of the methods of 11 of the most popular GitHub Java projects. |
Novbeti > |
Axtarisha Qayit Anarim.Az Anarim.Az Sayt Rehberliyi ile Elaqe Saytdan Istifade Qaydalari Anarim.Az 2004-2023 |