Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Integration:  dvc git github
Patrick Frank 656d8b994c
Merge branch 'fixes/update-dependencies'
3 months ago
86cb5d4692
Add dagshub s3 remote for big file upload
8 months ago
2fa274c11d
Add vscode debug configurations
3 months ago
6f95f42f5e
Changes for colab environment
8 months ago
f7aa7aef46
Add more high quality training positions
8 months ago
db45655c6c
Install milvus vector db and run hello milvus notebook
7 months ago
c9db055d93
Fix most liniting errors
3 months ago
src
c9db055d93
Fix most liniting errors
3 months ago
019a028fa7
Initialize dvc
10 months ago
a81f66083b
Add flak8 configuration
3 months ago
47a6f4181d
Update gitignore
3 months ago
97265870f6
Update poetry configuration
3 months ago
5c73d8b7b5
Document past experiments
7 months ago
5f79e7b305
Configure poetry to use venv in repository
7 months ago
db45655c6c
Install milvus vector db and run hello milvus notebook
7 months ago
9425b33ca5
Push model from colab
9 months ago
97265870f6
Update poetry configuration
3 months ago
5f79e7b305
Configure poetry to use venv in repository
7 months ago
97265870f6
Update poetry configuration
3 months ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

Chess Embedding Experiments

Cheat Sheet

  • Generate training positions

    python -m src.run.generate_positions

  • Train a neural network

    python -m src.run.train

  • Evaluate a trained network by starting the notebook: src/run/evaluate.ipynb

Tools

  • Start ML Flow UI, in correct python venv

    mlflow ui

  • Export dependencies to requirements.txt

    poetry export > requirements.txt

Notes

Milvus

  • could only get milvus 2.3.1 to work, so use that for now
  • but had to downgrade python to 3.9, because of compatibility issues
  • and only works with recent tensorflow version, so it's incompatible with aws sage maker
    • maybe I need to build a different toolchain for different python versions

TODOs

  • Document findings of up to current model training
  • Write to db from .npy files
    • write tokenized positions with some metadata and id
    • write embeddings generated from a model
  • Write to db from .pgn file
    • maybe some refactoring is needed
  • make embeddings better for search
    • document approaches
    • make a plan
Tip!

Press p or to see the previous file or, n or to see the next file

About

Experimentation

Collaborators 1

Comments

Loading...