|
allow specifying max_tokens for generation
|
Alexei Baevski
|
|
6 years ago |
|
Support integer learning rates
|
Myle Ott
|
|
6 years ago |
|
fix alignment when using uneven batches and left pad
|
Alexei Baevski
|
|
6 years ago |
|
implement batching in interactive mode
|
Alexei Baevski
|
|
6 years ago |
|
Sampling doesn't work with interactive
|
Sergey Edunov
|
|
6 years ago |
|
Fix --remove-bpe to strip trailing BPE symbols
|
Myle Ott
|
|
6 years ago |
|
Remove padding from --score-reference
|
Myle Ott
|
|
6 years ago |
|
fix flag copy paste (decoder-normalize-before)
|
alexeib
|
|
6 years ago |
|
add support for averaging last n checkpoints
|
Alexei Baevski
|
|
6 years ago |
|
make attn dropout 0.1 default for big en-de transformer
|
Alexei Baevski
|
|
6 years ago |
|
fix to adding tokens to dictionary while thresholding
|
Angela Fan
|
|
6 years ago |
|
Fix --prefix-size
|
Myle Ott
|
|
6 years ago |
|
make sure tensor used to index is cuda if on gpu
|
Alexei Baevski
|
|
6 years ago |
|
Remove src-padding from generation output
|
Myle Ott
|
|
6 years ago |
|
Fix tests
|
Myle Ott
|
|
6 years ago |
|
Support --warmup-updates with fixed LR schedule
|
Myle Ott
|
|
6 years ago |
|
Save and restore wall time in checkpoints
|
Myle Ott
|
|
6 years ago |
|
Simplify train.py (merge with singleprocess_train.py)
|
Myle Ott
|
|
6 years ago |
|
Fix embedding initialization for padding
|
Alexei Baevski
|
|
6 years ago |
|
Use eval() to parse args.lr
|
Myle Ott
|
|
6 years ago |
|
Fix preprocess.py
|
Myle Ott
|
|
6 years ago |
|
Small optimization for LSTM
|
Myle Ott
|
|
6 years ago |
|
Fix Flake8
|
Myle Ott
|
|
6 years ago |
|
remove completed sentences from batch
|
Alexei Baevski
|
|
6 years ago |
|
No more magical --fp16
|
Myle Ott
|
|
6 years ago |
|
Pad dictionary to be a multiple of 8 in preprocessing
|
Myle Ott
|
|
6 years ago |
|
Revert "Make dictionary size a multiple of 8"
|
Myle Ott
|
|
6 years ago |
|
Make dictionary size a multiple of 8
|
Myle Ott
|
|
6 years ago |
|
Add FP16 support
|
Myle Ott
|
|
6 years ago |
|
Fix batching during generation
|
Myle Ott
|
|
6 years ago |