|
Adding README and more parameters to En2De script
|
Sergey Edunov
|
|
6 years ago |
|
Merge branch 'master' of github.com:facebookresearch/fairseq-py into prepare_wmt
|
Sergey Edunov
|
|
6 years ago |
|
Switch to news-commentary-v12
|
Sergey Edunov
|
|
6 years ago |
|
Fixed Weight Decay Regularization in Adam
|
Michael Auli
|
|
6 years ago |
|
Fix tests
|
Myle Ott
|
|
6 years ago |
|
Output correct perplexity when training with --sentence-avg
|
Myle Ott
|
|
6 years ago |
|
Fix max_positions calculation in train.py
|
Myle Ott
|
|
6 years ago |
|
Better warning message for inputs that are too long
|
Myle Ott
|
|
6 years ago |
|
ATen Fix
|
Michael Auli
|
|
6 years ago |
|
Momentum correction
|
Michael Auli
|
|
6 years ago |
|
Report log likelihood for label smoothing
|
Sergey Edunov
|
|
6 years ago |
|
Share input/output embed
|
Sergey Edunov
|
|
6 years ago |
|
Better support for torch.no_grad (since volatile is deprecated)
|
Myle Ott
|
|
6 years ago |
|
Fix training
|
Myle Ott
|
|
6 years ago |
|
Save dictionary in model base classes
|
Myle Ott
|
|
6 years ago |
|
Fix gradient clipping when --clip-norm=0
|
Myle Ott
|
|
6 years ago |
|
Fix LearnedPositionalEmbedding
|
Myle Ott
|
|
6 years ago |
|
Move normalization of model output (e.g., via LSM) into model definition
|
Myle Ott
|
|
6 years ago |
|
Move positional embeddings into LearnedPositionalEmbedding module
|
Myle Ott
|
|
6 years ago |
|
Fix warning about deprecated `volatile` kwarg for Variables
|
Myle Ott
|
|
6 years ago |
|
Add option to SequenceGenerator to retain dropout
|
Myle Ott
|
|
6 years ago |
|
Add --max-sentences-valid to train.py
|
Myle Ott
|
|
6 years ago |
|
Streamline data formatting utils
|
Myle Ott
|
|
6 years ago |
|
Add reduce kwarg to criterions
|
Myle Ott
|
|
6 years ago |
|
Raise FileNotFoundError if dictionary files don't exist
|
Myle Ott
|
|
6 years ago |
|
Output number of model parameters in train.py
|
Myle Ott
|
|
6 years ago |
|
Add explicit dimension to softmax calls
|
Myle Ott
|
|
6 years ago |
|
Support deprecation of volatile Variables in latest PyTorch
|
Myle Ott
|
|
6 years ago |
|
Minor fix for strip_pad functions
|
Myle Ott
|
|
6 years ago |
|
Better error message for --decoder-attention
|
Myle Ott
|
|
6 years ago |