Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

extract-25k-vocab-corpus.dvc 1.0 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
  1. md5: fafaeb49b8b72dfab33ecc8142993f6b
  2. cmd: scripts/extract-corpus.sh data/gigatoken-corpus.tar.gz script-data/25k-vocab-corpus-projects/train.txt
  3. script-data/25k-vocab-corpus-projects/valid.txt script-data/25k-vocab-corpus-projects/test.txt
  4. Clara/clara/src/main/java/org/vaadin/teemu/clara/Clara.java data/25k-vocab-corpus
  5. wdir: ..
  6. deps:
  7. - md5: f23195d351918f103c01930427230207
  8. path: scripts/extract-corpus.sh
  9. - md5: 5ad5ab1f45baa6eb502066df96d00380
  10. path: data/gigatoken-corpus.tar.gz
  11. - md5: b501d6377a596022e5bec91ca057485e
  12. path: script-data/25k-vocab-corpus-projects/train.txt
  13. - md5: b62ed5a3d3624b9ea572d29dedc0c472
  14. path: script-data/25k-vocab-corpus-projects/valid.txt
  15. - md5: ff6c3e364155e331a6e3b39c46b90a21
  16. path: script-data/25k-vocab-corpus-projects/test.txt
  17. - path: params/25k-vocab-corpus.yml
  18. params:
  19. demo-file: Clara/clara/src/main/java/org/vaadin/teemu/clara/Clara.java
  20. outs:
  21. - md5: ca2e8d6be677c14bd408fca1ad2c08cc.dir
  22. path: data/25k-vocab-corpus
  23. cache: true
  24. metric: false
  25. persist: false
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...