Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
General:  remote working file system Data Domain:  nlp tabular Integration:  dvc git
Dean 2e0f2b35fb
Merge branch 'localize-wfs' of Dean/RPPP into master
3 years ago
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
raw
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
d9735a5dd8
Add contributing guide
4 years ago
0aa625a470
Update 'README.md'
4 years ago
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
b2eb747da1
Finished training of text based classifier.
4 years ago
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
b54b8e2185
Added calculation of training metrics, modified metric filenames to relate to stage
3 years ago
b6c2981a67
This completes training for the numerical and categorical base model.
4 years ago
b6c2981a67
This completes training for the numerical and categorical base model.
4 years ago
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
50aefb4765
Added make dataset stage the runs a query on BigQuery and saves the raw data to the remote working file system.
3 years ago
cac18be035
Add 'remote-wfs-setup.md'
4 years ago
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
7942d96f38
- Removed remote-wfs elements of the project. All dvc managed files are local.
3 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

RPPP - Reddit Post Popularity Predictor

This Project attempts to predict whether a reddit submission will be popular or not according to it's features.

We currently provide models for r/MachineLearning only, base on submission title and body.

DVC Remote Working File System

This project is also an exploration of DVC remote WFS workflow. To setup your remote WFS – read here: Remote WFS Setup

Contributing

Contributions Are Very Welcome!

Read the Contribution Guide for more information.

Ideas to work on:

  • Combine textual and numerical classifier into one model!
  • Add UI to test if your post is going to be successful!
  • Add MOAR data! (other subreddits, more from r/ML)
  • Improve model performance (there is a lotttt to improve)!
  • Add memes: Add MOAR MEMES
Tip!

Press p or to see the previous file or, n or to see the next file

About

RPPP – Reddit Post Popularity Predictor
A project with two goals:
1. Given a Reddit post, predict how popular it's going to be (what it's score will be)
2. Showcasing a remote working file system with DVC

Collaborators 1

Comments

Loading...