Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

dvc.lock 2.1 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
  1. schema: '2.0'
  2. stages:
  3. preprocess:
  4. cmd: python scripts/preprocess.py
  5. deps:
  6. - path: data/raw/arxiv_raw.csv
  7. md5: 700b601ba33941b1c43b50da539211b5
  8. size: 7025160
  9. - path: data/raw/biorxiv_raw.csv
  10. md5: e08273e5c158ae8d4f780b6131d74ca7
  11. size: 53702451
  12. - path: data/raw/pubmed_raw.csv
  13. md5: 3adbc5b6eb565fa98a26d67dbee40a14
  14. size: 458192680
  15. - path: data/raw/scopus_raw.csv
  16. md5: b710ece2fff58f13adc49163380f5fc0
  17. size: 1314681357
  18. - path: scripts/preprocess.py
  19. md5: fea1318dfb56b1728390247a0060f26e
  20. size: 3125
  21. outs:
  22. - path: data/prepared/arxiv_covid_19.csv
  23. md5: 965d50cd513b13b77a96792e991ae1c3
  24. size: 7602135
  25. - path: data/prepared/biorxiv_covid_19.csv
  26. md5: 70c775b087889316fbd611aff91a0336
  27. size: 65913909
  28. - path: data/prepared/pubmed_covid_19.csv
  29. md5: 3aa8bb59e40d2031eee8c830d3d8686b
  30. size: 441641348
  31. - path: data/prepared/scopus_covid_19.csv
  32. md5: 044aa33045e265d0ecf86fb430002890
  33. size: 1135141523
  34. merge:
  35. cmd: python scripts/merge_datasets.py
  36. deps:
  37. - path: data/prepared/arxiv_covid_19.csv
  38. md5: 965d50cd513b13b77a96792e991ae1c3
  39. size: 7602135
  40. - path: data/prepared/biorxiv_covid_19.csv
  41. md5: 70c775b087889316fbd611aff91a0336
  42. size: 65913909
  43. - path: data/prepared/pubmed_covid_19.csv
  44. md5: 3aa8bb59e40d2031eee8c830d3d8686b
  45. size: 441641348
  46. - path: data/prepared/scopus_covid_19.csv
  47. md5: 044aa33045e265d0ecf86fb430002890
  48. size: 1135141523
  49. - path: scripts/merge_datasets.py
  50. md5: 3d650209936a6d96a2bc5f2f93501e12
  51. size: 16804
  52. outs:
  53. - path: data/raw/final_raw.csv
  54. md5: 272185ed7aedf2c1b9befef03963c945
  55. size: 1362214742
  56. preprocess_final:
  57. cmd: python scripts/preprocess.py final
  58. deps:
  59. - path: data/raw/final_raw.csv
  60. md5: 272185ed7aedf2c1b9befef03963c945
  61. size: 1362214742
  62. - path: scripts/preprocess.py
  63. md5: fea1318dfb56b1728390247a0060f26e
  64. size: 3125
  65. outs:
  66. - path: data/prepared/final_covid_19.csv
  67. md5: cf5b42611664c4a34fbe002a1baf377c
  68. size: 1401207482
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...