Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

General

open-data-registry aws-pds sustainability agriculture earth observation geospatial life sciences + 745

Task

disaster response classification image classification object detection autonomous vehicles machine translation vision + 490

 Open Source Data Science Datasets

Path: .

Identify promoter regions by applying machine learning methods of binary classification to read FASTA files with DNA samples with length of 81 bp or 251 bp of promoter regions and non-promoter regions.

dataset dvc git mlflow github

Path: .

Aim to create a reliable skin cancer diagnosis model with extensive experimentation and handling imbalenced dataset.

dataset computer vision dvc git mlflow github

nsd8888 / mlops-mlflow

Updated 3 months ago

Path: .

This project is for practicing mlflow on cloud

dataset dvc git mlflow github

Dean / Bookdata-tools

Updated 3 months ago

Path: .

This repository contains the code to import and integrate the book and rating data that we work with. It imports and integrates data from several sources in a homogenous tabular outputs; import scripts are primarily Rust, with Python implement analyses.

dataset nlp dvc git github

morrisalp / unikud

Updated 6 months ago

Path: . data

UNIKUD is an open-source tool for adding vowel signs (nikud) to Hebrew text with deep learning, using absolutely no rule-based logic.

dataset model nlp dvc git mlflow github

DagsHub / triviaqa

Updated 1 year ago

Path: .

Code for the TriviaQA reading comprehension dataset

dataset nlp dvc git github