Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

dvc.lock 3.1 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
  1. schema: '2.0'
  2. stages:
  3. scan-authors:
  4. cmd: cargo run --release -- scan-marc -L -o viaf.parquet ../data/viaf-clusters-marc21.xml.gz
  5. deps:
  6. - path: ../data/viaf-clusters-marc21.xml.gz
  7. hash: md5
  8. md5: 62075083cf4ec72bf0ec841e4f46665f
  9. size: 14163590455
  10. - path: ../src/cli/scan_marc.rs
  11. hash: md5
  12. md5: 0663aa5a5d2fe2a3c2fdb505170a5cc2
  13. size: 3934
  14. - path: ../src/marc
  15. hash: md5
  16. md5: 874e3a2ea08a2d41e3c54b9d7a2032c1.dir
  17. size: 22963
  18. nfiles: 5
  19. outs:
  20. - path: viaf.parquet
  21. hash: md5
  22. md5: 9e58fa2d32e55dd94328917aef3c5f16
  23. size: 15013721675
  24. author-names:
  25. cmd: python ../run.py --rust fusion author-names.tcl
  26. deps:
  27. - path: author-names.tcl
  28. md5: d0ebf76118c885cd85ce01ddc87cdb64
  29. size: 223
  30. - path: viaf.parquet
  31. md5: 8c93292c182dcfba2a5960eac57c6faa
  32. size: 12519632430
  33. outs:
  34. - path: author-names.csv.gz
  35. md5: 3823edd34fc084a9f8cef5cd8ec43da2
  36. size: 494355416
  37. author-genders:
  38. cmd: cargo run --release -- filter-marc --tag=375 --subfield=a --trim --lower
  39. -n gender -o author-genders.parquet viaf.parquet
  40. deps:
  41. - path: ../src/cli/filter_marc.rs
  42. hash: md5
  43. md5: 87ea27014ee2006924cc041cbfbcfa1c
  44. size: 5682
  45. - path: viaf.parquet
  46. hash: md5
  47. md5: 9e58fa2d32e55dd94328917aef3c5f16
  48. size: 15013721675
  49. outs:
  50. - path: author-genders.parquet
  51. hash: md5
  52. md5: e5398fd1755b812bf8b43551f56994d7
  53. size: 134925962
  54. index-names:
  55. cmd: cargo run --release -- index-names --marc-authorities viaf.parquet author-name-index.parquet
  56. deps:
  57. - path: ../src/cleaning/names
  58. md5: c5591171b9fef6a491e33b4b7f52ba2c.dir
  59. size: 10362
  60. nfiles: 5
  61. - path: ../src/cli/index_names.rs
  62. hash: md5
  63. md5: 3ea83851695cebe5cb78faab379e89ad
  64. size: 4067
  65. - path: viaf.parquet
  66. hash: md5
  67. md5: 9e58fa2d32e55dd94328917aef3c5f16
  68. size: 15013721675
  69. outs:
  70. - path: author-name-index.csv.gz
  71. hash: md5
  72. md5: e224a3148eeb0e2cf5fed19e1d9fd966
  73. size: 859649637
  74. - path: author-name-index.parquet
  75. hash: md5
  76. md5: 382caf194f20c9f9ba74317c05ae4083
  77. size: 565011474
  78. schema@author-name-index:
  79. cmd: python ../run.py --rust pq-info -o author-name-index.json author-name-index.parquet
  80. deps:
  81. - path: author-name-index.parquet
  82. md5: 28bdb3cdaf4f193087e566662a880b22
  83. size: 515538267
  84. outs:
  85. - path: author-name-index.json
  86. md5: fa29532090232226cf30bd5c53b2566a
  87. size: 247
  88. schema@viaf:
  89. cmd: python ../run.py --rust pq-info -o viaf.json viaf.parquet
  90. deps:
  91. - path: viaf.parquet
  92. md5: 3b436c52b269e5ec33af33a24c1af1c0
  93. size: 11540970920
  94. outs:
  95. - path: viaf.json
  96. md5: 5cd18bf6344942be879199d5a1f393a7
  97. size: 695
  98. schema@author-genders:
  99. cmd: python ../run.py --rust pq-info -o author-genders.json author-genders.parquet
  100. deps:
  101. - path: author-genders.parquet
  102. md5: 505a5a8f30082f297cb79995435a82c3
  103. size: 115121052
  104. outs:
  105. - path: author-genders.json
  106. md5: ef9b90520bd446297359938de27a9de1
  107. size: 249
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...