datasets

src-d

source{d} datasets ("big code") for source code analysis and machine learning on source code

github git

View on GitHub

345 Stars

85 Forks

345 Watchers

Jupyter Notebook Language

other License

100 SrcLog Score

Cost to Build

$360.6K

Market Value

$786.9K

How is this calculated?

Growth over time

15 data points · 2021-07-01 → 2026-04-01

Stars Forks Watchers

💬

How do you feel about this project?

Ask AI about datasets

Question copied to clipboard

What is the src-d/datasets GitHub project? Description: "source{d} datasets ("big code") for source code analysis and machine learning on source code". Written in Jupyter Notebook. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone datasets

Clone via HTTPS

git clone https://github.com/src-d/datasets.git

Clone via SSH

[email protected]:src-d/datasets.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the datasets issue tracker:

Open GitHub Issues

Similar to datasets

gitignore gogs tips hub git-extras diff-so-fancy phabricator gitea husky ungit git-recipes gitbucket libgit2 tig git-lfs decap-cms gitsome desktop pure git-standup githug legit docker-gitlab vim-gitgutter gitql my-git semantic-release gitpitch bfg-repo-cleaner git-style-guide