Data version control, made easy

DAGsHub is a web platform for data version control and collaboration for data scientists and machine learning engineers.

Haven't heard of DVC? Want to see our awesomeness in action?

Based on open source software

DAGsHub is built on DVC, an open source version control system for machine learning projects, which works seamlessly with Git.
We are completely agnostic to your ecosystem, programming language and libraries. Fully modular and awesome!
Also, we are completely free for open source projects!

Reproduce your experiments

Reproduce any experiment from previous versions of your data pipeline. It’s as simple as switching a branch and running a single command.
Experiment with different hyper parameters in parallel. Each run will automagically save its results in an organized and easy to access location.

Data & pipeline versioning

Maintaining giant spreadsheets of hyper parameters and inconsistent naming conventions are a thing of the past.
DVC tracks data & code versions. Every time you run a stage in your pipeline, your data is saved automatically, so you don’t need to worry about accidentally ruining your data.
You can visualize your pipeline (as DAGs) and see how it changes over time, and get any version of your model with a click.

Collaborate more effectively

Leverage the same best practices used in software engineering to get more done. DAGsHub helps automate your workflow, so you can focus on work instead of coordination.
Share intermediate results from your pipeline with any collaborator, instead of re-running all the pre-processing code, and re-run only parts of the pipeline that you changed.
Ease of use also means it takes less time to get new team members and collaborators caught up and contributing.