Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

megamolbart_pretrain_small_span_aug.yaml 192 B

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
  1. defaults:
  2. - megamolbart_pretrain_base
  3. trainer:
  4. devices: 8
  5. num_nodes: 8
  6. model:
  7. name: small_span_aug
  8. # model architecture
  9. num_layers: 6
  10. hidden_size: 512
  11. num_attention_heads: 8
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...