Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

dvc.lock 3.1 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
  1. schema: '2.0'
  2. stages:
  3. scan-ratings:
  4. cmd: cargo run --release -- amazon scan-ratings -o ratings.parquet --swap-id-columns
  5. ../data/az2018/Books.csv
  6. deps:
  7. - path: ../data/az2018/Books.csv
  8. md5: bcdcbbf336eb0d410e7a7894efa904ab
  9. size: 2140933459
  10. - path: ../src/amazon.rs
  11. hash: md5
  12. md5: 21b5d02a0fcb3f494163ed41cb6dd295
  13. size: 1345
  14. - path: ../src/cli/amazon/
  15. hash: md5
  16. md5: a7dffc6b923be125d80365fa51520376.dir
  17. size: 5736
  18. nfiles: 4
  19. outs:
  20. - path: ratings.parquet
  21. hash: md5
  22. md5: fdc3832ac453570056ca27500f9d4033
  23. size: 342486205
  24. schema@ratings:
  25. cmd: python ../run.py --rust pq-info -o ratings.json ratings.parquet
  26. deps:
  27. - path: ratings.parquet
  28. md5: 060809287f39a08c63724a1e8ae0fd8d
  29. size: 316701214
  30. outs:
  31. - path: ratings.json
  32. md5: e122a398de9e64654720f792598b7e1e
  33. size: 427
  34. cluster-ratings:
  35. cmd: cargo run --release -- amazon cluster-ratings -o az2018/az-cluster-ratings.parquet
  36. az2018/ratings.parquet
  37. deps:
  38. - path: az2018/ratings.parquet
  39. hash: md5
  40. md5: 293d4ebd867bc7dd126a052572477e38
  41. size: 342513929
  42. - path: book-links/isbn-clusters.parquet
  43. hash: md5
  44. md5: 1a87f47db64785678a022a264c8603be
  45. size: 487825142
  46. - path: src/cli/amazon
  47. hash: md5
  48. md5: a7dffc6b923be125d80365fa51520376.dir
  49. size: 5736
  50. nfiles: 4
  51. outs:
  52. - path: az2018/az-cluster-ratings.parquet
  53. hash: md5
  54. md5: 334d5e45ada5091f895dff4bc3a084ae
  55. size: 452366669
  56. schema@az-cluster-ratings:
  57. cmd: python ../run.py --rust pq-info -o az-cluster-ratings.json az-cluster-ratings.parquet
  58. deps:
  59. - path: az-cluster-ratings.parquet
  60. md5: c9ecf365a84cfb2ff36b87ec9c393c35
  61. size: 661901612
  62. outs:
  63. - path: az-cluster-ratings.json
  64. md5: da713f0303e3384dfb8624edf8924005
  65. size: 702
  66. cluster-ratings-5core:
  67. cmd: cargo run --release -- kcore -o az-cluster-ratings-5core.parquet az-cluster-ratings.parquet
  68. deps:
  69. - path: ../src/cli/kcore.rs
  70. hash: md5
  71. md5: 9a64f2beb19d2053d9c2386609beafe9
  72. size: 4874
  73. - path: az-cluster-ratings.parquet
  74. hash: md5
  75. md5: 334d5e45ada5091f895dff4bc3a084ae
  76. size: 452366669
  77. outs:
  78. - path: az-cluster-ratings-5core.parquet
  79. hash: md5
  80. md5: 3631bd4d445e9a2da927d053af94d419
  81. size: 241092593
  82. scan-reviews:
  83. cmd: cargo run --release -- amazon scan-reviews --rating-output ratings.parquet
  84. --review-output reviews.parquet ../data/az2018/Books.json.gz
  85. deps:
  86. - path: ../data/az2018/Books.json.gz
  87. hash: md5
  88. md5: 38bd00a67dd98902741eebfaf64f08dc
  89. size: 11813848069
  90. - path: ../src/amazon.rs
  91. hash: md5
  92. md5: 21b5d02a0fcb3f494163ed41cb6dd295
  93. size: 1345
  94. - path: ../src/cli/amazon/
  95. hash: md5
  96. md5: a7dffc6b923be125d80365fa51520376.dir
  97. size: 5736
  98. nfiles: 4
  99. outs:
  100. - path: ratings.parquet
  101. hash: md5
  102. md5: 293d4ebd867bc7dd126a052572477e38
  103. size: 342513929
  104. - path: reviews.parquet
  105. hash: md5
  106. md5: edde381af20b17f6012af7c5315d1286
  107. size: 9477326910
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...