Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

dvc.lock 2.9 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
  1. schema: '2.0'
  2. stages:
  3. scan-authors:
  4. cmd: python ../run.py --rust scan-marc -L -o viaf.parquet ../data/viaf-clusters-marc21.xml.gz
  5. deps:
  6. - path: ../data/viaf-clusters-marc21.xml.gz
  7. md5: 83bfbacff8b72b5d4163c796941d6234
  8. size: 12655431316
  9. - path: ../src/cli/scan_marc.rs
  10. md5: bcbc2c6cfebc08fbd4072a6b01fba27d
  11. size: 3706
  12. - path: ../src/marc
  13. md5: 066418c6e3db231224a89a97aa94da9f.dir
  14. size: 19381
  15. nfiles: 5
  16. outs:
  17. - path: viaf.parquet
  18. md5: e267c52cb6ead92d5b51b80b3dcd28af
  19. size: 11318952681
  20. author-names:
  21. cmd: python ../run.py --rust fusion author-names.tcl
  22. deps:
  23. - path: author-names.tcl
  24. md5: d0ebf76118c885cd85ce01ddc87cdb64
  25. size: 223
  26. - path: viaf.parquet
  27. md5: 8c93292c182dcfba2a5960eac57c6faa
  28. size: 12519632430
  29. outs:
  30. - path: author-names.csv.gz
  31. md5: 3823edd34fc084a9f8cef5cd8ec43da2
  32. size: 494355416
  33. author-genders:
  34. cmd: python ../run.py --rust filter-marc --tag=375 --subfield=a --trim --lower
  35. -n gender -o author-genders.parquet viaf.parquet
  36. deps:
  37. - path: ../src/cli/filter_marc.rs
  38. md5: b2b2e285ede35d3ea9a2f950b6bb079f
  39. size: 5905
  40. - path: viaf.parquet
  41. md5: e267c52cb6ead92d5b51b80b3dcd28af
  42. size: 11318952681
  43. outs:
  44. - path: author-genders.parquet
  45. md5: f22e08510e3b70e749400b5019760350
  46. size: 115076921
  47. index-names:
  48. cmd: python ../run.py --rust index-names --marc-authorities viaf.parquet author-name-index.parquet
  49. deps:
  50. - path: ../src/cleaning/names
  51. md5: bc2cfaeffef29d4e5e8f345322789c1d.dir
  52. size: 10745
  53. nfiles: 5
  54. - path: ../src/cli/index_names.rs
  55. md5: 4c059e9edf4439f0062e43ec477d0e47
  56. size: 3858
  57. - path: viaf.parquet
  58. md5: e267c52cb6ead92d5b51b80b3dcd28af
  59. size: 11318952681
  60. outs:
  61. - path: author-name-index.csv.gz
  62. md5: 31d281aacd49d67a30093da93dc42f31
  63. size: 657718645
  64. - path: author-name-index.parquet
  65. md5: 254a39021dedbd8c7a994fd2c4fbdc67
  66. size: 516451943
  67. schema@author-name-index:
  68. cmd: python ../run.py --rust pq-info -o author-name-index.json author-name-index.parquet
  69. deps:
  70. - path: author-name-index.parquet
  71. md5: 254a39021dedbd8c7a994fd2c4fbdc67
  72. size: 516451943
  73. outs:
  74. - path: author-name-index.json
  75. md5: 4e608b7c650d04ed5e77c3fc3312b44d
  76. size: 247
  77. schema@viaf:
  78. cmd: python ../run.py --rust pq-info -o viaf.json viaf.parquet
  79. deps:
  80. - path: viaf.parquet
  81. md5: e267c52cb6ead92d5b51b80b3dcd28af
  82. size: 11318952681
  83. outs:
  84. - path: viaf.json
  85. md5: 55485f5bc2034323462fa644d92d820d
  86. size: 695
  87. schema@author-genders:
  88. cmd: python ../run.py --rust pq-info -o author-genders.json author-genders.parquet
  89. deps:
  90. - path: author-genders.parquet
  91. md5: f22e08510e3b70e749400b5019760350
  92. size: 115076921
  93. outs:
  94. - path: author-genders.json
  95. md5: e6ac261bc04b94f1ee581ae5ee31844f
  96. size: 249
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...