Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

dvc.jsonnet 905 B

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
  1. local bd = import '../lib.jsonnet';
  2. bd.pipeline({
  3. 'scan-authors': {
  4. cmd: bd.cmd('scan-marc -L -o viaf.parquet ../data/viaf-clusters-marc21.xml.gz'),
  5. deps: [
  6. '../src/cli/scan_marc.rs',
  7. '../src/marc',
  8. '../data/viaf-clusters-marc21.xml.gz',
  9. ],
  10. outs: [
  11. 'viaf.parquet',
  12. ],
  13. },
  14. 'author-genders': {
  15. cmd: bd.cmd('filter-marc --tag=375 --subfield=a --trim --lower -n gender -o author-genders.parquet viaf.parquet'),
  16. deps: [
  17. '../src/cli/filter_marc.rs',
  18. 'viaf.parquet',
  19. ],
  20. outs: [
  21. 'author-genders.parquet',
  22. ],
  23. },
  24. 'index-names': {
  25. cmd: bd.cmd('index-names --marc-authorities viaf.parquet author-name-index.parquet'),
  26. deps: [
  27. '../src/cli/index_names.rs',
  28. '../src/cleaning/names',
  29. 'viaf.parquet',
  30. ],
  31. outs: [
  32. 'author-name-index.parquet',
  33. 'author-name-index.csv.gz',
  34. ],
  35. },
  36. })
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...