Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

dvc.jsonnet 969 B

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
  1. local bd = import '../lib.jsonnet';
  2. bd.pipeline({
  3. 'scan-books': {
  4. cmd: bd.cmd('scan-marc --book-mode --glob "../data/loc-books/BooksAll.2016*.xml.gz"'),
  5. deps: [
  6. '../src/cli/scan_marc.rs',
  7. '../src/marc',
  8. '../data/loc-books',
  9. ],
  10. outs: [
  11. 'book-fields.parquet',
  12. 'book-ids.parquet',
  13. 'book-isbns.parquet',
  14. 'book-authors.parquet',
  15. ],
  16. },
  17. 'scan-names': {
  18. cmd: bd.cmd('scan-marc --glob "../data/loc-names/Names.2016*.xml.gz" -o name-fields.parquet'),
  19. deps: [
  20. '../src/cli/scan_marc.rs',
  21. '../src/marc',
  22. '../data/loc-names',
  23. ],
  24. outs: [
  25. 'name-fields.parquet',
  26. ],
  27. },
  28. 'book-isbn-ids': {
  29. wdir: '..',
  30. cmd: bd.cmd('link-isbn-ids -R rec_id -o loc-mds/book-isbn-ids.parquet loc-mds/book-isbns.parquet'),
  31. deps: [
  32. 'loc-mds/book-isbns.parquet',
  33. 'book-links/all-isbns.parquet',
  34. ],
  35. outs: [
  36. 'loc-mds/book-isbn-ids.parquet',
  37. ],
  38. },
  39. })
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...