Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

dvc.yaml 1.7 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
  1. #after performing the replace operation in csv and saving it in RAW folder now we update
  2. # dvc.yaml
  3. stages:
  4. load_data:
  5. # terminal command to execute
  6. cmd: python src/load_data.py --config=params.yaml
  7. # dependency for load_data.py
  8. deps:
  9. - src/get_data.py
  10. - src/load_data.py
  11. - data_given/winequality.csv
  12. # output -> csv file present in raw folder
  13. outs:
  14. - data/raw/winequality.csv
  15. # execute with -> "dvc repro"
  16. # this command will generate dvc.lock file will contain dependency mentioned will be tracked as a code
  17. # if we execute it again it will say data and pipelines are uptodate (if no changes are made in data or load_data.py)
  18. #'data_given/winequality.csv.dvc' didn't change, skipping
  19. #Stage 'load_data' didn't change, skipping
  20. #Data and pipelines are up to date.
  21. # if changes made , if sense the changes and rerun the pipeline
  22. # even dvc.loc file contains the size of thee file also
  23. ###############################
  24. # executing the split_data.py
  25. split_data:
  26. cmd: python src/split_data.py --config=params.yaml
  27. deps:
  28. - src/split_data.py
  29. - data/raw/winequality.csv
  30. outs:
  31. - data/processed/train_winequality.csv
  32. - data/processed/test_winequality.csv
  33. train_and_evaluate:
  34. cmd: python src/train_and_evaluate.py --config=params.yaml
  35. deps:
  36. - data/processed/train_winequality.csv
  37. - data/processed/test_winequality.csv
  38. - src/train_and_evaluate.py
  39. params:
  40. - estimators.ElasticNet.params.alpha
  41. - estimators.ElasticNet.params.l1_ratio
  42. metrics:
  43. - report/scores.json:
  44. cache: false #cache memory
  45. - report/params.json:
  46. cache: false
  47. outs:
  48. - saved_models/model.joblib
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...